Cold-start Software Analytics

Jin Guo, Mona Rahimi, Jane Cleland-Huang, Alexander Rasin, Jane Huffman Hayes, Michael Vierhauser

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

22 Scopus citations

Abstract

Software project artifacts such as source code, requirements, and change logs represent a gold-mine of actionable information. As a result, software analytic solutions have been developed to mine repositories and answer questions such as "who is the expert?," "which classes are fault prone?," or even "who are the domain experts for these fault-prone classes?" Analytics often require training and configuring in order to maximize performance within the context of each project. A cold-start problem exists when a function is applied within a project context without first configuring the analytic functions on project-specific data. This scenario exists because of the non-trivial effort necessary to instrument a project environment with candidate tools and algorithms and to empirically evaluate alternate configurations. We address the cold-start problem by comparatively evaluating 'best-of-breed' and 'profile-driven' solutions, both of which reuse known configurations in new project contexts. We describe and evaluate our approach against 20 project datasets for the three analytic areas of artifact connectivity, fault-prediction, and finding the expert, and show that the best-of-breed approach outperformed the profile-driven approach in all three areas; however, while it delivered acceptable results for artifact connectivity and find the expert, both techniques underperformed for cold-start fault prediction.

Original languageEnglish
Title of host publicationProceedings - 13th Working Conference on Mining Software Repositories, MSR 2016
Pages142-153
Number of pages12
ISBN (Electronic)9781450341868
DOIs
StatePublished - May 14 2016
Event13th Working Conference on Mining Software Repositories, MSR 2016 - Austin, United States
Duration: May 14 2016May 15 2016

Publication series

NameProceedings - 13th Working Conference on Mining Software Repositories, MSR 2016

Conference

Conference13th Working Conference on Mining Software Repositories, MSR 2016
Country/TerritoryUnited States
CityAustin
Period5/14/165/15/16

Bibliographical note

Publisher Copyright:
© 2016 ACM.

Keywords

  • Cold-start
  • Configuration
  • Software analytics

ASJC Scopus subject areas

  • Software
  • Information Systems and Management

Fingerprint

Dive into the research topics of 'Cold-start Software Analytics'. Together they form a unique fingerprint.

Cite this