Opentocs

MSR '18- Proceedings of the 15th International Conference on Mining Software Repositories

Full Citation in the ACM Digital Library

SESSION: Data showcase

50K-C: a dataset of compilable, and compiled, Java projects

Pedro Martins
Rohan Achar
Cristina V. Lopes

We provide a repository of 50,000 compilable Java projects. Each project in this dataset comes with references to all the dependencies required to compile it, the resulting bytecode, and the scripts with which the projects were built.

MSR '20: Proceedings of the 17th International Conference on Mining Software Repositories

Full Citation in the ACM Digital Library

SESSION: Mining Challenge

The Software Heritage Graph Dataset: Large-scale Analysis of Public Software Development History

Antoine Pietri
Diomidis Spinellis
Stefano Zacchiroli

Software Heritage is the largest existing public archive of software source code and accompanying development history. It spans more than five billion unique source code files and one billion unique commits, coming from more than 80 million software projects. These software artifacts were retrieved from major collaborative development platforms (e.g., GitHub, GitLab) and package repositories (e.g., PyPI, Debian, NPM), and stored in a uniform representation linking together source code files, directories, commits, and full snapshots of version control systems (VCS) repositories as observed by Software Heritage during periodic crawls. This dataset is unique in terms of accessibility and scale, and allows to explore a number of research questions on the long tail of public software development, instead of solely focusing on "most starred" repositories as it often happens.

NL4SE 2018- Proceedings of the 4th ACM SIGSOFT International Workshop on NLP for Software Engineering

Full Citation in the ACM Digital Library

SESSION: Keynote

Learning from code with graphs (keynote)

Marc Brockschmidt

Learning from large corpora of source code ("Big Code") has seen increasing interest over the past few years. A first wave of work has focused on leveraging off-the-shelf methods from other machine learning fields such as natural language processing. While these techniques have succeeded in showing the feasibility of learning from code, and led to some initial practical solutions, they forego explicit use of known program semantics. In a range of recent work, we have tried to solve this issue by integrating deep learning techniques with program analysis methods in graphs. Graphs are a convenient, general formalism to model entities and their relationships, and are seeing increasing interest from machine learning researchers as well. In this talk, I present two applications of graph-based learning to understanding and generating programs and discuss a range of future work building on the success of this work.

PESOS 2014- Proceedings of the 6th International Workshop on Principles of Engineering Service-Oriented and Cloud Systems

Full Citation in the ACM Digital Library

SLA evaluation with on-the-fly measurements of distributed service implementation over clouds

Kaliappa Ravindran
Arun Adiththan
Michael Iannelli

A survey on checkpointing web services

A. Vani Vathsala
Hrushikesha Mohanty

Consumer-centric non-functional properties of SOA-based services

Hanane Becha
Daniel Amyot

Interaction patterns based checkpointing of choreographed web services

A. Vani Vathsala
Hrushikesha Mohanty

Towards the formalization of properties of cloud-based elastic systems

Marcello M. Bersani
Domenico Bianculli
Schahram Dustdar
Alessio Gambi
Carlo Ghezzi
Srđan Krstić

Towards exploiting the full adaptation potential of cloud applications

Clarissa Cassales Marquezan
Florian Wessling
Andreas Metzger
Klaus Pohl
Chris Woods
Karl Wallbom

PROMISE 2021: Proceedings of the 17th International Conference on Predictive Models and Data Analytics in Software Engineering

Full Citation in the ACM Digital Library

SESSION: Papers

Heterogeneous ensemble imputation for software development effort estimation

Ibtissam Abnane
Ali Idri
Mohamed Hosni
Alain Abran

Choosing the appropriate Missing Data (MD) imputation technique for a given Software development effort estimation (SDEE) technique is not a trivial task. In fact, the impact of the MD imputation on the estimation output depends on the dataset and the SDEE technique used and there is no best imputation technique in all contexts. Thus, an attractive solution is to use more than one single imputation technique and combine their results for a final imputation outcome. This concept is called ensemble imputation and can help to significantly improve the estimation accuracy. This paper develops and evaluates a heterogeneous ensemble imputation whose members were the four single imputation techniques: K-Nearest Neighbors (KNN), Expectation Maximization (EM), Support Vector Regression (SVR), and Decision Trees (DT). The impact of the ensemble imputation was evaluated and compared with those of the four single imputation techniques on the accuracy measured in terms of the standardized accuracy criterion of four SDEE techniques: Case Based Reasoning (CBR), Multi-Layers Perceptron (MLP), Support Vector Regression (SVR) and Reduced Error Pruning Tree (REPTree). The Wilcoxon statistical test was also performed in order to assess whether the results are significant. All the empirical evaluations were carried out over the six datasets, namely, ISBSG, China, COCOMO81, Desharnais, Kemerer, and Miyazaki. Results show that the use of heterogeneous ensemble-based imputation instead single imputation significantly improved the accuracy of the four SDEE techniques. Indeed, the ensemble imputation technique was ranked either first or second in all contexts.

QoSA '14- Proceedings of the 10th international ACM Sigsoft conference on Quality of software architectures

Full Citation in the ACM Digital Library

SESSION: Keynote address

Tomas Bures

Trust or verify?

Bertrand Meyer

SESSION: Performance of architectures

Bholanathsingh Surajbali

Automatic detection of performance anti-patterns in inter-component communications

Alexander Wert
Marius Oehler
Christoph Heger
Roozbeh Farahbod

Architectural tactics support in cloud computing providers: the jelastic case

Jaime Chavarriaga
Carlos A. Noguera
Rubby Casallas
Viviane Jonckers

Performance-based selection of software and hardware features under parameter uncertainty

Leire Etxeberria
Catia Trubiani
Vittorio Cortellessa
Goiuria Sagardui

SESSION: Performance modelling

Catia Trubiani

Dealing with uncertainties in the performance modelling of software systems

Diego Perez-Palacin
Raffaela Mirandola

Experiences with modeling memory contention for multi-core industrial real-time systems

Thijmen de Gooijer
K. Eric Harper

Using architecture-level performance models as resource profiles for enterprise applications

Andreas Brunnert
Kilian Wischer
Helmut Krcmar

SESSION: Architecture evaluation

Ralf Reussner

Empirical resilience evaluation of an architecture-based self-adaptive software system

Javier Cámara
Pedro Correia
Rogério de Lemos
Marco Vieira

Architecture management and evaluation in mature products: experiences from a lightweight approach

Mikko Raatikainen
Juha Savolainen
Tomi Männistö

Failure data collection for reliability prediction models: a survey

Barbora Buhnova
Stanislav Chren
Lucie Fabriková

SESSION: Architecture analysis I

Barbora Bühnová

Efficient re-resolution of SMT specifications for evolving software architectures

Kenneth Johnson
Radu Calinescu

Regression verification of AADL models through slicing of system dependence graphs

Andreas Johnsen
Kristina Lundqvist
Paul Pettersson
Kaj Hänninen

Evaluation of a static architectural conformance checking method in a line of computer games

Tobias Olsson
Daniel Toll
Anna Wingkvist
Morgan Ericsson

SESSION: Architecture analysis II

Ivica Crnkovic

An empirical investigation of modularity metrics for indicating architectural technical debt

Zengyang Li
Peng Liang
Paris Avgeriou
Nicolas Guelfi
Apostolos Ampatzoglou

Formalizing correspondence rules for automotive architecture views

Yanja Dajsuren
Christine M. Gerpheide
Alexander Serebrenik
Anton Wijs
Bogdan Vasilescu
Mark G.J. van den Brand

SRMP: a software pattern for deadlocks prevention inreal-time concurrency models

Rania Mzid
Chokri Mraidha
Jean-Philippe Babau
Mohamed Abid

TUTORIAL SESSION: Tutorials

Software QoS enhancement through self-adaptation and formal models

Raffaela Mirandola
Diego Perez-Palacin

Designing and evolving distributed architecture using kevoree

François Fouquet
Grégory Nain
Erwan Daubert
Johann Bourcier
Olivier Barais
Noel Plouzeau
Brice Morin

RAISE 2014- Proceedings of the 3rd International Workshop on Realizing Artificial Intelligence Synergies in Software Engineering

Full Citation in the ACM Digital Library

Towards more intelligent trace retrieval algorithms

Jane Cleland-Huang
Jin Guo

A mapping study on bayesian networks for software quality prediction

Ayse Tosun Misirli
Ayşe Başar Bener

SANAYOJAN: a framework for traceability link recovery between use-cases in software requirement specification and regulatory documents

Ritika Jain
Smita Ghaisas
Ashish Sureka

Supporting comprehension of unfamiliar programs by modeling an expert's perception

Naveen Kulkarni
Vasudeva Varma

Machine learning for constituency test of coordinating conjunctions in requirements specifications

Richa Sharma
Jaspreet Bhatia
K. K. Biswas

OCL usability: a major challenge in adopting UML

Imran Sarwar Bajwa
Behzad Bordbar
Mark Lee

A self-learning approach for validation of communication in embedded systems

Falk Langer
Erik Oswald

Deriving time lines from texts

Mathias Landhäußer
Tobias Hey
Walter F. Tichy

Mining issue tracking systems using topic models for trend analysis, corpus exploration, and understanding evolution

Ayushi Aggarwal
Gajendra Waghmare
Ashish Sureka

RAISE '18- Proceedings of the 6th International Workshop on Realizing Artificial Intelligence Synergies in Software Engineering

Full Citation in the ACM Digital Library

SESSION: Natural language and text data

Integrating a dialog component into a framework for spoken language understanding

Sebastian Weigelt
Tobias Hey
Mathias Landhäußer

Spoken language interfaces are the latest trend in human computer interaction. Users enjoy the newly found freedom but developers face an unfamiliar and daunting task. Creating reactive spoken language interfaces requires skills in natural language processing. We show how a developer can integrate a dialog component in a natural language processing system by means of software engineering methods. Our research project PARSE that aims at naturalistic end-user programming in spoken natural language serves as an example. We integrate a dialog component with PARSE without affecting its other components: We modularize the dialog management and introduce dialog acts that bundle a trigger for the dialog and the reaction of the system. We implemented three dialog acts to address the following issues: speech recognition uncertainties, coreference ambiguities, and incomplete conditionals.

RCoSE 2014- Proceedings of the 1st International Workshop on Rapid Continuous Software Engineering

Full Citation in the ACM Digital Library

SESSION: Overview

Continuous software engineering and beyond: trends and challenges

Brian Fitzgerald
Klaas-Jan Stol

SESSION: Technology Aspects

Rapid requirements checks with requirements smells: two case studies

Henning Femmer
Daniel Méndez Fernández
Elmar Juergens
Michael Klose
Ilona Zimmer
Jörg Zimmer

Rapidly locating and understanding errors using runtime monitoring of architecture-carrying code

Marco Konersmann

Building blocks for continuous experimentation

Fabian Fagerholm
Alejandro Sanchez Guinea
Hanna Mäenpää
Jürgen Münch

SESSION: Process Aspects

Supported approach for agile methods adaptation: an adoption study

Hajer Ayed
Benoît Vanderose
Naji Habra

Rugby: an agile process model based on continuous delivery

Stephan Krusche
Lukas Alperowitz
Bernd Bruegge
Martin O. Wagner

Scrum for cyber-physical systems: a process proposal

Stefan Wagner

SESSION: Cross-Cutting Concerns

Personalised continuous software engineering

Efi Papatheocharous
Marios Belk
Jaana Nyfjord
Panagiotis Germanakos
George Samaras

RCoSE '18- Proceedings of the 4th International Workshop on Rapid Continuous Software Engineering

Full Citation in the ACM Digital Library

SESSION: Tooling and applications

Designing a next-generation continuous software delivery system: concepts and architecture

Andreas Steffens
Horst Lichter
Jan Simon Döring

Continuous Integration and Continuous Delivery are established practices in modern agile software development. The DevOps movement adapted theses practices and places the deployment pipeline at its heart as one of the main requirements to automate the software development process and to deliver and operate software in a more robust way with higher quality.

SIGSOFT Awards

Conference Awards

Opentocs

MSR '18- Proceedings of the 15th International Conference on Mining Software Repositories

SESSION: Data showcase

MSR '20: Proceedings of the 17th International Conference on Mining Software Repositories

SESSION: Mining Challenge

NL4SE 2018- Proceedings of the 4th ACM SIGSOFT International Workshop on NLP for Software Engineering

SESSION: Keynote

PESOS 2014- Proceedings of the 6th International Workshop on Principles of Engineering Service-Oriented and Cloud Systems

PROMISE 2021: Proceedings of the 17th International Conference on Predictive Models and Data Analytics in Software Engineering

SESSION: Papers

QoSA '14- Proceedings of the 10th international ACM Sigsoft conference on Quality of software architectures

SESSION: Keynote address

SESSION: Performance of architectures

SESSION: Performance modelling

SESSION: Architecture evaluation

SESSION: Architecture analysis I

SESSION: Architecture analysis II

TUTORIAL SESSION: Tutorials

RAISE 2014- Proceedings of the 3rd International Workshop on Realizing Artificial Intelligence Synergies in Software Engineering

RAISE '18- Proceedings of the 6th International Workshop on Realizing Artificial Intelligence Synergies in Software Engineering

SESSION: Natural language and text data

RCoSE 2014- Proceedings of the 1st International Workshop on Rapid Continuous Software Engineering

SESSION: Overview

SESSION: Technology Aspects

SESSION: Process Aspects

SESSION: Cross-Cutting Concerns

RCoSE '18- Proceedings of the 4th International Workshop on Rapid Continuous Software Engineering

SESSION: Tooling and applications

Search