Opentocs

MSR '18- Proceedings of the 15th International Conference on Mining Software Repositories

MSR '18- Proceedings of the 15th International Conference on Mining Software Repositories

Full Citation in the ACM Digital Library

SESSION: Data showcase

50K-C: a dataset of compilable, and compiled, Java projects

  • Pedro Martins
  • Rohan Achar
  • Cristina V. Lopes

We provide a repository of 50,000 compilable Java projects. Each project in this dataset comes with references to all the dependencies required to compile it, the resulting bytecode, and the scripts with which the projects were built.

Continue reading

MSR '20: Proceedings of the 17th International Conference on Mining Software Repositories

MSR '20: Proceedings of the 17th International Conference on Mining Software Repositories

Full Citation in the ACM Digital Library

SESSION: Mining Challenge

The Software Heritage Graph Dataset: Large-scale Analysis of Public Software Development History

  • Antoine Pietri
  • Diomidis Spinellis
  • Stefano Zacchiroli

Software Heritage is the largest existing public archive of software source code and accompanying development history. It spans more than five billion unique source code files and one billion unique commits, coming from more than 80 million software projects. These software artifacts were retrieved from major collaborative development platforms (e.g., GitHub, GitLab) and package repositories (e.g., PyPI, Debian, NPM), and stored in a uniform representation linking together source code files, directories, commits, and full snapshots of version control systems (VCS) repositories as observed by Software Heritage during periodic crawls. This dataset is unique in terms of accessibility and scale, and allows to explore a number of research questions on the long tail of public software development, instead of solely focusing on "most starred" repositories as it often happens.

Continue reading

NL4SE 2018- Proceedings of the 4th ACM SIGSOFT International Workshop on NLP for Software Engineering

NL4SE 2018- Proceedings of the 4th ACM SIGSOFT International Workshop on NLP for Software Engineering

Full Citation in the ACM Digital Library

SESSION: Keynote

Learning from code with graphs (keynote)

  • Marc Brockschmidt

Learning from large corpora of source code ("Big Code") has seen increasing interest over the past few years. A first wave of work has focused on leveraging off-the-shelf methods from other machine learning fields such as natural language processing. While these techniques have succeeded in showing the feasibility of learning from code, and led to some initial practical solutions, they forego explicit use of known program semantics. In a range of recent work, we have tried to solve this issue by integrating deep learning techniques with program analysis methods in graphs. Graphs are a convenient, general formalism to model entities and their relationships, and are seeing increasing interest from machine learning researchers as well. In this talk, I present two applications of graph-based learning to understanding and generating programs and discuss a range of future work building on the success of this work.

Continue reading

PESOS 2014- Proceedings of the 6th International Workshop on Principles of Engineering Service-Oriented and Cloud Systems

PESOS 2014- Proceedings of the 6th International Workshop on Principles of Engineering Service-Oriented and Cloud Systems

Full Citation in the ACM Digital Library

SLA evaluation with on-the-fly measurements of distributed service implementation over clouds

  • Kaliappa Ravindran
  • Arun Adiththan
  • Michael Iannelli

A survey on checkpointing web services

  • A. Vani Vathsala
  • Hrushikesha Mohanty

Consumer-centric non-functional properties of SOA-based services

  • Hanane Becha
  • Daniel Amyot

Interaction patterns based checkpointing of choreographed web services

  • A. Vani Vathsala
  • Hrushikesha Mohanty

Towards the formalization of properties of cloud-based elastic systems

  • Marcello M. Bersani
  • Domenico Bianculli
  • Schahram Dustdar
  • Alessio Gambi
  • Carlo Ghezzi
  • Srđan Krstić

Towards exploiting the full adaptation potential of cloud applications

  • Clarissa Cassales Marquezan
  • Florian Wessling
  • Andreas Metzger
  • Klaus Pohl
  • Chris Woods
  • Karl Wallbom

Continue reading

PROMISE 2021: Proceedings of the 17th International Conference on Predictive Models and Data Analytics in Software Engineering

PROMISE 2021: Proceedings of the 17th International Conference on Predictive Models and Data Analytics in Software Engineering

Full Citation in the ACM Digital Library

SESSION: Papers

Heterogeneous ensemble imputation for software development effort estimation

  • Ibtissam Abnane
  • Ali Idri
  • Mohamed Hosni
  • Alain Abran

Choosing the appropriate Missing Data (MD) imputation technique for a given Software development effort estimation (SDEE) technique is not a trivial task. In fact, the impact of the MD imputation on the estimation output depends on the dataset and the SDEE technique used and there is no best imputation technique in all contexts. Thus, an attractive solution is to use more than one single imputation technique and combine their results for a final imputation outcome. This concept is called ensemble imputation and can help to significantly improve the estimation accuracy. This paper develops and evaluates a heterogeneous ensemble imputation whose members were the four single imputation techniques: K-Nearest Neighbors (KNN), Expectation Maximization (EM), Support Vector Regression (SVR), and Decision Trees (DT). The impact of the ensemble imputation was evaluated and compared with those of the four single imputation techniques on the accuracy measured in terms of the standardized accuracy criterion of four SDEE techniques: Case Based Reasoning (CBR), Multi-Layers Perceptron (MLP), Support Vector Regression (SVR) and Reduced Error Pruning Tree (REPTree). The Wilcoxon statistical test was also performed in order to assess whether the results are significant. All the empirical evaluations were carried out over the six datasets, namely, ISBSG, China, COCOMO81, Desharnais, Kemerer, and Miyazaki. Results show that the use of heterogeneous ensemble-based imputation instead single imputation significantly improved the accuracy of the four SDEE techniques. Indeed, the ensemble imputation technique was ranked either first or second in all contexts.

Continue reading

QoSA '14- Proceedings of the 10th international ACM Sigsoft conference on Quality of software architectures

QoSA '14- Proceedings of the 10th international ACM Sigsoft conference on Quality of software architectures

Full Citation in the ACM Digital Library

SESSION: Keynote address

  • Tomas Bures

Trust or verify?

  • Bertrand Meyer

SESSION: Performance of architectures

  • Bholanathsingh Surajbali

Automatic detection of performance anti-patterns in inter-component communications

  • Alexander Wert
  • Marius Oehler
  • Christoph Heger
  • Roozbeh Farahbod

Architectural tactics support in cloud computing providers: the jelastic case

  • Jaime Chavarriaga
  • Carlos A. Noguera
  • Rubby Casallas
  • Viviane Jonckers

Performance-based selection of software and hardware features under parameter uncertainty

  • Leire Etxeberria
  • Catia Trubiani
  • Vittorio Cortellessa
  • Goiuria Sagardui

SESSION: Performance modelling

  • Catia Trubiani

Dealing with uncertainties in the performance modelling of software systems

  • Diego Perez-Palacin
  • Raffaela Mirandola

Experiences with modeling memory contention for multi-core industrial real-time systems

  • Thijmen de Gooijer
  • K. Eric Harper

Using architecture-level performance models as resource profiles for enterprise applications

  • Andreas Brunnert
  • Kilian Wischer
  • Helmut Krcmar

SESSION: Architecture evaluation

  • Ralf Reussner

Empirical resilience evaluation of an architecture-based self-adaptive software system

  • Javier Cámara
  • Pedro Correia
  • Rogério de Lemos
  • Marco Vieira

Architecture management and evaluation in mature products: experiences from a lightweight approach

  • Mikko Raatikainen
  • Juha Savolainen
  • Tomi Männistö

Failure data collection for reliability prediction models: a survey

  • Barbora Buhnova
  • Stanislav Chren
  • Lucie Fabriková

SESSION: Architecture analysis I

  • Barbora Bühnová

Efficient re-resolution of SMT specifications for evolving software architectures

  • Kenneth Johnson
  • Radu Calinescu

Regression verification of AADL models through slicing of system dependence graphs

  • Andreas Johnsen
  • Kristina Lundqvist
  • Paul Pettersson
  • Kaj Hänninen

Evaluation of a static architectural conformance checking method in a line of computer games

  • Tobias Olsson
  • Daniel Toll
  • Anna Wingkvist
  • Morgan Ericsson

SESSION: Architecture analysis II

  • Ivica Crnkovic

An empirical investigation of modularity metrics for indicating architectural technical debt

  • Zengyang Li
  • Peng Liang
  • Paris Avgeriou
  • Nicolas Guelfi
  • Apostolos Ampatzoglou

Formalizing correspondence rules for automotive architecture views

  • Yanja Dajsuren
  • Christine M. Gerpheide
  • Alexander Serebrenik
  • Anton Wijs
  • Bogdan Vasilescu
  • Mark G.J. van den Brand

SRMP: a software pattern for deadlocks prevention inreal-time concurrency models

  • Rania Mzid
  • Chokri Mraidha
  • Jean-Philippe Babau
  • Mohamed Abid

TUTORIAL SESSION: Tutorials

Software QoS enhancement through self-adaptation and formal models

  • Raffaela Mirandola
  • Diego Perez-Palacin

Designing and evolving distributed architecture using kevoree

  • François Fouquet
  • Grégory Nain
  • Erwan Daubert
  • Johann Bourcier
  • Olivier Barais
  • Noel Plouzeau
  • Brice Morin

Continue reading

RAISE 2014- Proceedings of the 3rd International Workshop on Realizing Artificial Intelligence Synergies in Software Engineering

Continue reading

RAISE '18- Proceedings of the 6th International Workshop on Realizing Artificial Intelligence Synergies in Software Engineering

RAISE '18- Proceedings of the 6th International Workshop on Realizing Artificial Intelligence Synergies in Software Engineering

Full Citation in the ACM Digital Library

SESSION: Natural language and text data

Integrating a dialog component into a framework for spoken language understanding

  • Sebastian Weigelt
  • Tobias Hey
  • Mathias Landhäußer

Spoken language interfaces are the latest trend in human computer interaction. Users enjoy the newly found freedom but developers face an unfamiliar and daunting task. Creating reactive spoken language interfaces requires skills in natural language processing. We show how a developer can integrate a dialog component in a natural language processing system by means of software engineering methods. Our research project PARSE that aims at naturalistic end-user programming in spoken natural language serves as an example. We integrate a dialog component with PARSE without affecting its other components: We modularize the dialog management and introduce dialog acts that bundle a trigger for the dialog and the reaction of the system. We implemented three dialog acts to address the following issues: speech recognition uncertainties, coreference ambiguities, and incomplete conditionals.

Continue reading

RCoSE 2014- Proceedings of the 1st International Workshop on Rapid Continuous Software Engineering

RCoSE 2014- Proceedings of the 1st International Workshop on Rapid Continuous Software Engineering

Full Citation in the ACM Digital Library

SESSION: Overview

Continuous software engineering and beyond: trends and challenges

  • Brian Fitzgerald
  • Klaas-Jan Stol

SESSION: Technology Aspects

Rapid requirements checks with requirements smells: two case studies

  • Henning Femmer
  • Daniel Méndez Fernández
  • Elmar Juergens
  • Michael Klose
  • Ilona Zimmer
  • Jörg Zimmer

Rapidly locating and understanding errors using runtime monitoring of architecture-carrying code

  • Marco Konersmann

Building blocks for continuous experimentation

  • Fabian Fagerholm
  • Alejandro Sanchez Guinea
  • Hanna Mäenpää
  • Jürgen Münch

SESSION: Process Aspects

Supported approach for agile methods adaptation: an adoption study

  • Hajer Ayed
  • Benoît Vanderose
  • Naji Habra

Rugby: an agile process model based on continuous delivery

  • Stephan Krusche
  • Lukas Alperowitz
  • Bernd Bruegge
  • Martin O. Wagner

Scrum for cyber-physical systems: a process proposal

  • Stefan Wagner

SESSION: Cross-Cutting Concerns

Personalised continuous software engineering

  • Efi Papatheocharous
  • Marios Belk
  • Jaana Nyfjord
  • Panagiotis Germanakos
  • George Samaras

Continue reading

RCoSE '18- Proceedings of the 4th International Workshop on Rapid Continuous Software Engineering

RCoSE '18- Proceedings of the 4th International Workshop on Rapid Continuous Software Engineering

Full Citation in the ACM Digital Library

SESSION: Tooling and applications

Designing a next-generation continuous software delivery system: concepts and architecture

  • Andreas Steffens
  • Horst Lichter
  • Jan Simon Döring

Continuous Integration and Continuous Delivery are established practices in modern agile software development. The DevOps movement adapted theses practices and places the deployment pipeline at its heart as one of the main requirements to automate the software development process and to deliver and operate software in a more robust way with higher quality.

Continue reading