Week 1 (Sep 17): Introduction and Course Overview
Admin details
Intro Lecture based on ICSE 2008 Tutorial by Ahmed E. Hassan and Tao Xie.
Assigned Reading:The Road Ahead for Mining Software Repositories by Ahmed E. Hassan slides

You need to access the papers below using an on campus machine or using the Queen's Libary proxy
Week 2 (Sep 24): Evolution I
Mining Software Repositories ICSE 2008 Tutorial Contiued.
Assigned Reading: Future of Mining Software Archives: A Roundtable
An Empirical Study of Operating Systems Errors
Andy Chou, Junfeng Yang, Benjamin Chelf, Seth Hallem, Dawson Engler
Analysis Techniques: Fitting distributions, Kaplan-Meir Survival Curve
Listening to programmers Taxonomies and characteristics of comments in operating system code
Yoann Padioleau, Lin Tan, Yuanyuan Zhou
Week 3 (Oct 1): Predicting Bugs I
Predicting fault incidence using software change history
Todd L. Graves, Alan F. Karr, J. S. Marron, and Harvey P. Siy
Analysis Techniques: Basic linear regression, GLM, R2, model error, exponential decay
Predictors of customer perceived software quality
Audris Mockus, Ping Zhang, and Paul Luo Li
Analysis Techniques: Classification, Logistic Regression (Building and Interpreting Co-efficients), R2, model error
Predicting Bugs from History
Thomas Zimmermann, Nachiappan Nagappan, and Andreas Zeller (Evolution Book)
[READING]
Week 4 (Oct 8): Predicting Bugs and Helping Fix Bugs
Using Software Dependencies and Churn Metrics to Predict Field Failures: An Empirical Case Study
Nachiappan Nagappan, Thomas Ball
Analysis Techniques: PCA, classification, ranking, data splitting
Who should fix this bug?
John Anvik, Lyndon Hiew, and Gail C. Murphy
Analysis Techniques: C4.5, SVM, Naive Bayes
Novel Applications of Machine Learning in Software Testing
Lionel C. Briand
Predicting Defects for Eclipse
Thomas Zimmermann, Rahul Premraj, and Andreas Zeller
Analysis Techniques: Using R, Classification, Ranking
[READING]
[ASSIGNMENT]
Week 5 (Oct 15): WCRE




Week 6 (Oct 22): Guiding Development
Assignment Presentation DUE -- OCT 22
Mining version histories to guide software changes
Thomas Zimmermann, Peter Weißgerber, Stephan Diehl, Andreas Zeller
What's Hot and What's Not: Windowing Developer Topic Analysis?
Abram J. Hindle, Michael W. Godfrey, Richard C. Holt.
Analysis Techniques: LSI, LDA
Hipikat: recommending pertinent software development artifacts
Davor Cubranic and Gail C. Murphy
Week 7 (Oct 29): Visualization of Software Repositories
Assignment Report DUE -- OCT 29
Analysing Software Repositories to Understand Software Evolution
Marco D’Ambros, Harald Gall, Michele Lanza, and Martin Pinzger
Visualizing software changes
Stephen G. Eick, Todd L. Graves, Alan F. Karr, Audris Mockus, and Paul Schuster
Seeking the source: software source code as a social and technical artifact
Cleidson de Souza, Jon Froehlich, and Paul Dourish
Project Proposal DUE -- Nov 2
Week 8 (Nov 5): Tools and Mining Challenges
Automatic identification of bug-introducing changes
Sunghun Kim, Thomas Zimmermann, Kai Pan, E., and James Whitehead, Jr.
Identifying reasons for software change using historic databases.
Audris Mockus and Larry G. Votta
Preprocessing CVS Data for Fine-Grained Analysis
Thomas Zimmermann and Peter Weißgerber
Week 9 (Nov 12): Bug Detection
Capturing, indexing, clustering, and retrieving system history
Ira Cohen, Steve Zhang, Moises Goldszmidt, Julie Symons, Terence Kelly, and Armando Fox
Scalable statistical bug isolation
Ben Liblit, Mayur Naik, Alice X. Zheng, Alex Aiken, and Michael I. Jordan
Bugs as deviant behavior: A general approach to inferring errors in systems code
Dawson Engler, David Yu Chen, Seth Hallem, Andy Chou, and Benjamin Chelf
Week 10 (Nov 19): Code Reuse, Reuse Patterns, and Searching (might change)
CVSSearch: searching through source code using CVS comments
Annie Chen, Eric Chou, Joshua Wong, Andrew Y. Yao, Qing Zhang, Shao Zhang, and Amir Michail
XSnippet: Mining for Sample Code
Naiyana Tansalarak and Kajal T. Claypool
Symbolic mining of temporal specifications
Mark Gabel and Zhendong Su
Week 11 (Nov 26): Bring A Paper To Class Week
Have things changed now?: an empirical study of bug characteristics in modern open source software
Zhenmin Li, Lin Tan, Xuanhui Wang, Shan Lu, Yuanyuan Zhou, and Chengxiang Zhai
License integration patterns: Addressing license mismatches in component-based development.
Daniel M. Germán and Ahmed E. Hassan


Week 12 (Dec 3): Project Presentations

Presenter

Presenter

Presenter

Presenter
Project Report DUE -- DEC 18