ABSTRACT
Proposing a new method for automatically detecting, localising, or repairing software faults requires a fair, reproducible evaluation of the effectiveness of the method relative to existing alternatives. Measuring effectiveness requires both an indicative set of bugs, and a mechanism for reliably reproducing and interacting with those bugs. We present BugZoo: a decentralised platform for distributing, reproducing, and interacting with historical software bugs. BugZoo connects existing datasets and tools to developers and researchers, and provides a controlled environment for conducting experiments. To ensure reproducibility, extensibility, and usability, BugZoo uses Docker containers to package, deliver, and interact with bugs and tools. Adding BugZoo support to existing datasets and tools is simple and non-invasive, requiring only a small number of supplementary files. BugZoo is open-source and available to download at: https://github.com/squaresLab/BugZoo
- H Do, S Elbaum, and G Rothermel. 2005. Supporting Controlled Experimentation with Testing Techniques: An Infrastructure and its Potential Impact. Empirical Software Engineering 10, 4 (1 Oct. 2005), 405--435. Google ScholarDigital Library
- W Felter, A Ferreira, R Rajamony, and J Rubio. 2015. An updated performance comparison of virtual machines and Linux containers. In International Symposium on Performance Analysis of Systems and Software (ISPASS '15). 171--172.Google ScholarCross Ref
- R Just, D Jalali, and M D Ernst. 2014. Defects4J: A Database of Existing Faults to Enable Controlled Testing Studies for Java Programs. In Proceedings of the 2014 International Symposium on Software Testing and Analysis (ISSTA '14). ACM, New York, NY, USA, 437--440. Google ScholarDigital Library
- J Lacomis, A Jaffe, E J Schwartz, C Le Goues, and B Vasilescu. 2018. Statistical Machine Translation is a Natural Fit for Identifier Renaming in Software Source Code. In Statistical Modeling of Natural Software Corpora, 2018 AAAI Workshop. (To appear).Google Scholar
- C Le Goues, N Holtschulte, E K Smith, Y Brun, P Devanbu, S Forrest, and W Weimer. 2015. The ManyBugs and IntroClass Benchmarks for Automated Repair of C Programs. Transactions on Software Engineering 41, 12 (Dec. 2015), 1236--1256.Google ScholarCross Ref
- J Petke, M Harman, W B Langdon, and W Westley 2014. Using genetic improvement and code transplants to specialise a C++ program to a problem class. In European Conference on Genetic Programming. Springer, Berlin, Heidelberg, 137--149. Google ScholarDigital Library
- C S Timperley, S Stepney and C Le Goues. 2017. An Investigation into the Use of Mutation Analysis for Automated Program Repair. In Symposium on Search-Based Software Engineering (SSBSE '17). 99--114.Google ScholarCross Ref
- BugZoo: a platform for studying software bugs
Recommendations
A probabilistic method for the detection and genotyping of small indels from population-scale sequence data
Motivation: High-throughput sequencing technologies have made population-scale studies of human genetic variation possible. Accurate and comprehensive detection of DNA sequence variants is crucial for the success of these studies. Small insertions ...
Comments