MSR 2014- Proceedings of the 11th Working Conference on Mining Software Repositories

MSR 2014- Proceedings of the 11th Working Conference on Mining Software Repositories

Full Citation in the ACM Digital Library

SESSION: Keynote

Is mining software repositories data science? (keynote)

  • Audris Mockus

SESSION: Green Mining

Mining energy-greedy API usage patterns in Android apps: an empirical study

  • Mario Linares-Vásquez
  • Gabriele Bavota
  • Carlos Bernal-Cárdenas
  • Rocco Oliveto
  • Massimiliano Di Penta
  • Denys Poshyvanyk

GreenMiner: a hardware based mining software repositories software energy consumption framework

  • Abram Hindle
  • Alex Wilson
  • Kent Rasmussen
  • E. Jed Barlow
  • Joshua Charles Campbell
  • Stephen Romansky

Mining questions about software energy consumption

  • Gustavo Pinto
  • Fernando Castor
  • Yu David Liu

SESSION: Code Clones and Origin Analysis

Prediction and ranking of co-change candidates for clones

  • Manishankar Mondal
  • Chanchal K. Roy
  • Kevin A. Schneider

Incremental origin analysis of source code files

  • Daniela Steidl
  • Benjamin Hummel
  • Elmar Juergens

Oops! where did that code snippet come from?

  • Lisong Guo
  • Julia Lawall
  • Gilles Muller

SESSION: Bug Characterizing

Works for me! characterizing non-reproducible bug reports

  • Mona Erfani Joorabchi
  • Mehdi Mirzaaghaei
  • Ali Mesbah

Characterizing and predicting blocking bugs in open source projects

  • Harold Valdivia Garcia
  • Emad Shihab

An empirical study of dormant bugs

  • Tse-Hsun Chen
  • Meiyappan Nagappan
  • Emad Shihab
  • Ahmed E. Hassan

SESSION: Mining Repos and QA Sites

The promises and perils of mining GitHub

  • Eirini Kalliamvakou
  • Georgios Gousios
  • Kelly Blincoe
  • Leif Singer
  • Daniel M. German
  • Daniela Damian

Mining StackOverflow to turn the IDE into a self-confident programming prompter

  • Luca Ponzanelli
  • Gabriele Bavota
  • Massimiliano Di Penta
  • Rocco Oliveto
  • Michele Lanza

Mining questions asked by web developers

  • Kartik Bajaj
  • Karthik Pattabiraman
  • Ali Mesbah

Process mining multiple repositories for software defect resolution from control and organizational perspective

  • Monika Gupta
  • Ashish Sureka
  • Srinivas Padmanabhuni

SESSION: Mining Applications

MUX: algorithm selection for software model checkers

  • Varun Tulsian
  • Aditya Kanade
  • Rahul Kumar
  • Akash Lal
  • Aditya V. Nori

Improving the effectiveness of test suite through mining historical data

  • Jeff Anderson
  • Saeed Salem
  • Hyunsook Do

Finding patterns in static analysis alerts: improving actionable alert ranking

  • Quinn Hanam
  • Lin Tan
  • Reid Holmes
  • Patrick Lam

Impact analysis of change requests on source code based on interaction and commit histories

  • Motahareh Bahrami Zanjani
  • George Swartzendruber
  • Huzefa Kagdi

SESSION: Defect Prediction

An empirical study of just-in-time defect prediction using cross-project models

  • Takafumi Fukushima
  • Yasutaka Kamei
  • Shane McIntosh
  • Kazuhiro Yamashita
  • Naoyasu Ubayashi

Towards building a universal defect prediction model

  • Feng Zhang
  • Audris Mockus
  • Iman Keivanloo
  • Ying Zou

SESSION: Code Review and Code Search

The impact of code review coverage and code review participation on software quality: a case study of the qt, VTK, and ITK projects

  • Shane McIntosh
  • Yasutaka Kamei
  • Bram Adams
  • Ahmed E. Hassan

Modern code reviews in open-source projects: which problems do they fix?

  • Moritz Beller
  • Alberto Bacchelli
  • Andy Zaidman
  • Elmar Juergens

Thesaurus-based automatic query expansion for interface-driven code search

  • Otávio A. L. Lemos
  • Adriano C. de Paula
  • Felipe C. Zanichelli
  • Cristina V. Lopes

SESSION: Effort Estimation and Reuse

Estimating development effort in Free/Open source software projects by mining software repositories: a case study of OpenStack

  • Gregorio Robles
  • Jesús M. González-Barahona
  • Carlos Cervigón
  • Andrea Capiluppi
  • Daniel Izquierdo-Cortázar

An industrial case study of automatically identifying performance regression-causes

  • Thanh H. D. Nguyen
  • Meiyappan Nagappan
  • Ahmed E. Hassan
  • Mohamed Nasser
  • Parminder Flora

Revisiting Android reuse studies in the context of code obfuscation and library usages

  • Mario Linares-Vásquez
  • Andrew Holtzhauer
  • Carlos Bernal-Cárdenas
  • Denys Poshyvanyk

SESSION: Mining Mix

Syntax errors just aren't natural: improving error reporting with language models

  • Joshua Charles Campbell
  • Abram Hindle
  • José Nelson Amaral

Do developers feel emotions? an exploratory analysis of emotions in software artifacts

  • Alessandro Murgia
  • Parastou Tourani
  • Bram Adams
  • Marco Ortu

How does a typical tutorial for mobile development look like?

  • Rebecca Tiarks
  • Walid Maalej

Unsupervised discovery of intentional process models from event logs

  • Ghazaleh Khodabandelou
  • Charlotte Hug
  • Rebecca Deneckère
  • Camille Salinesi

SESSION: Short Research/Practice Papers

Tracing dynamic features in python programs

  • Beatrice Åkerblom
  • Jonathan Stendahl
  • Mattias Tumlin
  • Tobias Wrigstad

It's not a bug, it's a feature: does misclassification affect bug localization?

  • Pavneet Singh Kochhar
  • Tien-Duy B. Le
  • David Lo

Classifying unstructured data into natural language text and technical information

  • Thorsten Merten
  • Bastian Mager
  • Simone Bürsner
  • Barbara Paech

Collaboration in open-source projects: myth or reality?

  • Yuriy Tymchuk
  • Andrea Mocci
  • Michele Lanza

Improving the accuracy of duplicate bug report detection using textual similarity measures

  • Alina Lazar
  • Sarah Ritchey
  • Bonita Sharif

Undocumented and unchecked: exceptions that spell trouble

  • Maria Kechagia
  • Diomidis Spinellis

Innovation diffusion in open source software: preliminary analysis of dependency changes in the gentoo portage package database

  • Remco Bloemen
  • Chintan Amrit
  • Stefan Kuhlmann
  • Gonzalo Ordóñez–Matamoros

A dictionary to translate change tasks to source code

  • Katja Kevic
  • Thomas Fritz

New features for duplicate bug detection

  • Nathan Klein
  • Christopher S. Corley
  • Nicholas A. Kraft

Mining modern repositories with elasticsearch

  • Oleksii Kononenko
  • Olga Baysal
  • Reid Holmes
  • Michael W. Godfrey

SESSION: Mining Challenge

A study of external community contribution to open-source projects on GitHub

  • Rohan Padhye
  • Senthil Mani
  • Vibha Singhal Sinha

Understanding "watchers" on GitHub

  • Jyoti Sheoran
  • Kelly Blincoe
  • Eirini Kalliamvakou
  • Daniela Damian
  • Jordan Ell

Do developers discuss design?

  • João Brunet
  • Gail C. Murphy
  • Ricardo Terra
  • Jorge Figueiredo
  • Dalton Serey

Magnet or sticky? an OSS project-by-project typology

  • Kazuhiro Yamashita
  • Shane McIntosh
  • Yasutaka Kamei
  • Naoyasu Ubayashi

Security and emotion: sentiment analysis of security discussions on GitHub

  • Daniel Pletea
  • Bogdan Vasilescu
  • Alexander Serebrenik

Sentiment analysis of commit comments in GitHub: an empirical study

  • Emitza Guzman
  • David Azócar
  • Yang Li

Analysing the 'biodiversity' of open source ecosystems: the GitHub case

  • Nicholas Matragkas
  • James R. Williams
  • Dimitris S. Kolovos
  • Richard F. Paige

Co-evolution of project documentation and popularity within github

  • Karan Aggarwal
  • Abram Hindle
  • Eleni Stroulia

An insight into the pull requests of GitHub

  • Mohammad Masudur Rahman
  • Chanchal K. Roy

SESSION: Data Showcase

A dataset for pull-based development research

  • Georgios Gousios
  • Andy Zaidman

The bug catalog of the maven ecosystem

  • Dimitris Mitropoulos
  • Vassilios Karakoidas
  • Panos Louridas
  • Georgios Gousios
  • Diomidis Spinellis

A dataset of feature additions and feature removals from the Linux kernel

  • Leonardo Passos
  • Krzysztof Czarnecki

Kataribe: a hosting service of historage repositories

  • Kenji Fujiwara
  • Hideaki Hata
  • Erina Makihara
  • Yusuke Fujihara
  • Naoki Nakayama
  • Hajimu Iida
  • Kenichi Matsumoto

Lean GHTorrent: GitHub data on demand

  • Georgios Gousios
  • Bogdan Vasilescu
  • Alexander Serebrenik
  • Andy Zaidman

A code clone oracle

  • Daniel E. Krutz
  • Wei Le

Generating duplicate bug datasets

  • Alina Lazar
  • Sarah Ritchey
  • Bonita Sharif

FLOSS 2013: a survey dataset about free software contributors: challenges for curating, sharing, and combining

  • Gregorio Robles
  • Laura Arjona Reina
  • Alexander Serebrenik
  • Bogdan Vasilescu
  • Jesús M. González-Barahona

A green miner's dataset: mining the impact of software change on energy consumption

  • Chenlei Zhang
  • Abram Hindle

Gentoo package dependencies over time

  • Remco Bloemen
  • Chintan Amrit
  • Stefan Kuhlmann
  • Gonzalo Ordóñez–Matamoros

Models of OSS project meta-information: a dataset of three forges

  • James R. Williams
  • Davide Di Ruscio
  • Nicholas Matragkas
  • Juri Di Rocco
  • Dimitris S. Kolovos

A dataset of clone references with gaps

  • Hiroaki Murakami
  • Yoshiki Higo
  • Shinji Kusumoto

A dataset for maven artifacts and bug patterns found in them

  • Vaibhav Saini
  • Hitesh Sajnani
  • Joel Ossher
  • Cristina V. Lopes

OpenHub: a scalable architecture for the analysis of software quality attributes

  • Gabriel Farah
  • Juan Sebastian Tejada
  • Dario Correal

Understanding software evolution: the maisqual ant data set

  • Boris Baldassari
  • Philippe Preux