skip to main content
10.1145/2245276.2231975acmconferencesArticle/Chapter ViewAbstractPublication PagessacConference Proceedingsconference-collections
poster

Investigating fault prediction capabilities of five prediction models for software quality

Authors Info & Claims
Published:26 March 2012Publication History

ABSTRACT

Predicting faults in software modules can lead to a high quality and more effective software development process to follow. However, the results of a fault prediction model have to be properly interpreted before incorporating them into any decision making. Most of the earlier studies have used the prediction accuracy as the main criteria to compare amongst competing fault prediction models. However, we show that besides accuracy, other criteria like number of false positives and false negatives can equally be important to choose a candidate model for fault prediction. We have used five NASA software data sets in our experiment. Our results suggest that the performance of Simple Logistic is better than the others on raw data sets whereas the performance of Neural Network was found to be better when we applied dimensionality reduction method on raw data sets. When we used data pre-processing techniques, the prediction accuracy of Random Forest was found to be better in both cases i.e. with and without dimensionality reduction but reliability of Simple Logistic was better than Random Forest because it had less number of fault negatives.

References

  1. NASA Data Repository http://mdp.ivv.nasa.govGoogle ScholarGoogle Scholar
  2. T. M. Khoshgoftaar and N. Seliya. Tree-Based Software Quality Estimation Models for Fault Prediction. Proc. The 8th IIIE Symposium on Software Metrics, p. 203--214, Jun 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. I. H. Witten and E. Frank. Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations, Morgan Kaufmann, October 1999. http://www.cs.waikato.ac.nz/ml/weka/ Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Thwin M. M. and Quah T. Application of Neural Networks for Software Quality Prediction Using Object-Oriented Metrics. Proc. The 19th International Conference on Software Maintenance, Amsterdam, The Netherlands. p. 113--122, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Bibi S., Tsoumakas G., Stamelos I. and Vlahvas I. Software Defect Prediction Using Regression via Classification. Proc IEEE International Conference on Computer Systems and Applications IEEE Computer Society. Dubai, UAE. p. 330--336, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Lan Guo, Yan Ma, Bojan Cukic and Harshinder Singh, Robust Prediction of Fault-Proneness by Random Forests, Proc. The 15th International Symposium on Software Reliability Engineering (ISSRE'04), Brittany, France. p. 417--428, November 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library

Recommendations

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Sign in
  • Published in

    cover image ACM Conferences
    SAC '12: Proceedings of the 27th Annual ACM Symposium on Applied Computing
    March 2012
    2179 pages
    ISBN:9781450308571
    DOI:10.1145/2245276
    • Conference Chairs:
    • Sascha Ossowski,
    • Paola Lecca

    Copyright © 2012 Authors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    • Published: 26 March 2012

    Permissions

    Request permissions about this article.

    Request Permissions

    Check for updates

    Qualifiers

    • poster

    Acceptance Rates

    SAC '12 Paper Acceptance Rate270of1,056submissions,26%Overall Acceptance Rate1,650of6,669submissions,25%
  • Article Metrics

    • Downloads (Last 12 months)1
    • Downloads (Last 6 weeks)0

    Other Metrics

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader