MSR 2014- Proceedings of the 11th Working Conference on Mining Software Repositories

Full Citation in the ACM Digital Library

SESSION: Keynote

Is mining software repositories data science? (keynote)

Audris Mockus

SESSION: Green Mining

Mining energy-greedy API usage patterns in Android apps: an empirical study

Mario Linares-Vásquez
Gabriele Bavota
Carlos Bernal-Cárdenas
Rocco Oliveto
Massimiliano Di Penta
Denys Poshyvanyk

GreenMiner: a hardware based mining software repositories software energy consumption framework

Abram Hindle
Alex Wilson
Kent Rasmussen
E. Jed Barlow
Joshua Charles Campbell
Stephen Romansky

Mining questions about software energy consumption

Gustavo Pinto
Fernando Castor
Yu David Liu

SESSION: Code Clones and Origin Analysis

Prediction and ranking of co-change candidates for clones

Manishankar Mondal
Chanchal K. Roy
Kevin A. Schneider

Incremental origin analysis of source code files

Daniela Steidl
Benjamin Hummel
Elmar Juergens

Oops! where did that code snippet come from?

Lisong Guo
Julia Lawall
Gilles Muller

SESSION: Bug Characterizing

Works for me! characterizing non-reproducible bug reports

Mona Erfani Joorabchi
Mehdi Mirzaaghaei
Ali Mesbah

Characterizing and predicting blocking bugs in open source projects

Harold Valdivia Garcia
Emad Shihab

An empirical study of dormant bugs

Tse-Hsun Chen
Meiyappan Nagappan
Emad Shihab
Ahmed E. Hassan

SESSION: Mining Repos and QA Sites

The promises and perils of mining GitHub

Eirini Kalliamvakou
Georgios Gousios
Kelly Blincoe
Leif Singer
Daniel M. German
Daniela Damian

Mining StackOverflow to turn the IDE into a self-confident programming prompter

Luca Ponzanelli
Gabriele Bavota
Massimiliano Di Penta
Rocco Oliveto
Michele Lanza

Mining questions asked by web developers

Kartik Bajaj
Karthik Pattabiraman
Ali Mesbah

Process mining multiple repositories for software defect resolution from control and organizational perspective

Monika Gupta
Ashish Sureka
Srinivas Padmanabhuni

SESSION: Mining Applications

MUX: algorithm selection for software model checkers

Varun Tulsian
Aditya Kanade
Rahul Kumar
Akash Lal
Aditya V. Nori

Improving the effectiveness of test suite through mining historical data

Jeff Anderson
Saeed Salem
Hyunsook Do

Finding patterns in static analysis alerts: improving actionable alert ranking

Quinn Hanam
Lin Tan
Reid Holmes
Patrick Lam

Impact analysis of change requests on source code based on interaction and commit histories

Motahareh Bahrami Zanjani
George Swartzendruber
Huzefa Kagdi

SESSION: Defect Prediction

An empirical study of just-in-time defect prediction using cross-project models

Takafumi Fukushima
Yasutaka Kamei
Shane McIntosh
Kazuhiro Yamashita
Naoyasu Ubayashi

Towards building a universal defect prediction model

Feng Zhang
Audris Mockus
Iman Keivanloo
Ying Zou

SESSION: Code Review and Code Search

The impact of code review coverage and code review participation on software quality: a case study of the qt, VTK, and ITK projects

Shane McIntosh
Yasutaka Kamei
Bram Adams
Ahmed E. Hassan

Modern code reviews in open-source projects: which problems do they fix?

Moritz Beller
Alberto Bacchelli
Andy Zaidman
Elmar Juergens

Thesaurus-based automatic query expansion for interface-driven code search

Otávio A. L. Lemos
Adriano C. de Paula
Felipe C. Zanichelli
Cristina V. Lopes

SESSION: Effort Estimation and Reuse

Estimating development effort in Free/Open source software projects by mining software repositories: a case study of OpenStack

Gregorio Robles
Jesús M. González-Barahona
Carlos Cervigón
Andrea Capiluppi
Daniel Izquierdo-Cortázar

An industrial case study of automatically identifying performance regression-causes

Thanh H. D. Nguyen
Meiyappan Nagappan
Ahmed E. Hassan
Mohamed Nasser
Parminder Flora

Revisiting Android reuse studies in the context of code obfuscation and library usages

Mario Linares-Vásquez
Andrew Holtzhauer
Carlos Bernal-Cárdenas
Denys Poshyvanyk

SESSION: Mining Mix

Syntax errors just aren't natural: improving error reporting with language models

Joshua Charles Campbell
Abram Hindle
José Nelson Amaral

Do developers feel emotions? an exploratory analysis of emotions in software artifacts

Alessandro Murgia
Parastou Tourani
Bram Adams
Marco Ortu

How does a typical tutorial for mobile development look like?

Rebecca Tiarks
Walid Maalej

Unsupervised discovery of intentional process models from event logs

Ghazaleh Khodabandelou
Charlotte Hug
Rebecca Deneckère
Camille Salinesi

SESSION: Short Research/Practice Papers

Tracing dynamic features in python programs

Beatrice Åkerblom
Jonathan Stendahl
Mattias Tumlin
Tobias Wrigstad

It's not a bug, it's a feature: does misclassification affect bug localization?

Pavneet Singh Kochhar
Tien-Duy B. Le
David Lo

Classifying unstructured data into natural language text and technical information

Thorsten Merten
Bastian Mager
Simone Bürsner
Barbara Paech

Collaboration in open-source projects: myth or reality?

Yuriy Tymchuk
Andrea Mocci
Michele Lanza

Improving the accuracy of duplicate bug report detection using textual similarity measures

Alina Lazar
Sarah Ritchey
Bonita Sharif

Undocumented and unchecked: exceptions that spell trouble

Maria Kechagia
Diomidis Spinellis

Innovation diffusion in open source software: preliminary analysis of dependency changes in the gentoo portage package database

Remco Bloemen
Chintan Amrit
Stefan Kuhlmann
Gonzalo Ordóñez–Matamoros

A dictionary to translate change tasks to source code

Katja Kevic
Thomas Fritz

New features for duplicate bug detection

Nathan Klein
Christopher S. Corley
Nicholas A. Kraft

Mining modern repositories with elasticsearch

Oleksii Kononenko
Olga Baysal
Reid Holmes
Michael W. Godfrey

SESSION: Mining Challenge

A study of external community contribution to open-source projects on GitHub

Rohan Padhye
Senthil Mani
Vibha Singhal Sinha

Understanding "watchers" on GitHub

Jyoti Sheoran
Kelly Blincoe
Eirini Kalliamvakou
Daniela Damian
Jordan Ell

Do developers discuss design?

João Brunet
Gail C. Murphy
Ricardo Terra
Jorge Figueiredo
Dalton Serey

Magnet or sticky? an OSS project-by-project typology

Kazuhiro Yamashita
Shane McIntosh
Yasutaka Kamei
Naoyasu Ubayashi

Security and emotion: sentiment analysis of security discussions on GitHub

Daniel Pletea
Bogdan Vasilescu
Alexander Serebrenik

Sentiment analysis of commit comments in GitHub: an empirical study

Emitza Guzman
David Azócar
Yang Li

Analysing the 'biodiversity' of open source ecosystems: the GitHub case

Nicholas Matragkas
James R. Williams
Dimitris S. Kolovos
Richard F. Paige

Co-evolution of project documentation and popularity within github

Karan Aggarwal
Abram Hindle
Eleni Stroulia

An insight into the pull requests of GitHub

Mohammad Masudur Rahman
Chanchal K. Roy

SESSION: Data Showcase

A dataset for pull-based development research

Georgios Gousios
Andy Zaidman

The bug catalog of the maven ecosystem

Dimitris Mitropoulos
Vassilios Karakoidas
Panos Louridas
Georgios Gousios
Diomidis Spinellis

A dataset of feature additions and feature removals from the Linux kernel

Leonardo Passos
Krzysztof Czarnecki

Kataribe: a hosting service of historage repositories

Kenji Fujiwara
Hideaki Hata
Erina Makihara
Yusuke Fujihara
Naoki Nakayama
Hajimu Iida
Kenichi Matsumoto

Lean GHTorrent: GitHub data on demand

Georgios Gousios
Bogdan Vasilescu
Alexander Serebrenik
Andy Zaidman

A code clone oracle

Daniel E. Krutz
Wei Le

Generating duplicate bug datasets

Alina Lazar
Sarah Ritchey
Bonita Sharif

FLOSS 2013: a survey dataset about free software contributors: challenges for curating, sharing, and combining

Gregorio Robles
Laura Arjona Reina
Alexander Serebrenik
Bogdan Vasilescu
Jesús M. González-Barahona

A green miner's dataset: mining the impact of software change on energy consumption

Chenlei Zhang
Abram Hindle

Gentoo package dependencies over time

Remco Bloemen
Chintan Amrit
Stefan Kuhlmann
Gonzalo Ordóñez–Matamoros

Models of OSS project meta-information: a dataset of three forges

James R. Williams
Davide Di Ruscio
Nicholas Matragkas
Juri Di Rocco
Dimitris S. Kolovos

A dataset of clone references with gaps

Hiroaki Murakami
Yoshiki Higo
Shinji Kusumoto

A dataset for maven artifacts and bug patterns found in them

Vaibhav Saini
Hitesh Sajnani
Joel Ossher
Cristina V. Lopes

OpenHub: a scalable architecture for the analysis of software quality attributes

Gabriel Farah
Juan Sebastian Tejada
Dario Correal

Understanding software evolution: the maisqual ant data set

Boris Baldassari
Philippe Preux

SIGSOFT Awards

Conference Awards

Recognition

MSR 2014- Proceedings of the 11th Working Conference on Mining Software Repositories

SESSION: Keynote

SESSION: Green Mining

SESSION: Code Clones and Origin Analysis

SESSION: Bug Characterizing

SESSION: Mining Repos and QA Sites

SESSION: Mining Applications

SESSION: Defect Prediction

SESSION: Code Review and Code Search

SESSION: Effort Estimation and Reuse

SESSION: Mining Mix

SESSION: Short Research/Practice Papers

SESSION: Mining Challenge

SESSION: Data Showcase

Search