skip to main content
10.1145/1101908.1101976acmconferencesArticle/Chapter ViewAbstractPublication PagesaseConference Proceedingsconference-collections
Article

Specialization and extrapolation of software cost models

Published: 07 November 2005 Publication History

Abstract

Despite the widespread availability of software effort estimation models (e.g. COCOMO [2], Price-S [12], SEER-SEM [13], SLIM [14]), most managers still estimate new projects by extrapolating from old projects [3, 5, 7]. In this delta method, the cost of the next project is the cost of the last project multiplied by some factors modeling the difference between old and new projects [2].Delta estimation is simple, fast, and best of all, can take full advantage of local costing information. However delta estimation fails when the experience base (the old projects) can not be extrapolated to the new projects. Previously [10], we have shown that for a set of NASA projects, delta estimation would usually fail since most of the features and coefficients of the learned model vary wildly across sub-samples of the training data. In that prior work, no solution was offered for this problem.Here, we offer a solution and report the results of experiment with feature subset selection (FSS) and extrapolation. FSS methods are usually assessed via the mean change in model performance. However, as shown below, FSS can significantly reduce the variance as well. Hence, FSS should be routinely used in cost estimation.Our results should stop the trend in the effort modeling community of continually adding to the number of features in a model in order to improve estimation performance. Here we show that there are benefits in intelligently subtracting model features.

References

[1]
B. Boehm. Software Engineering Economics. Prentice Hall, 1981.
[2]
B. Boehm. Safe and simple software cost analysis. IEEE Software, pages 14-17, September/October 2000. Available from http://www.computer.org/ certification/beta/Boehm_Safe.pdf.
[3]
B. Boehm. Personnel communication, 2003.
[4]
B. Boehm, E. Horowitz, R. Madachy, D. Reifer, B. K. Clark, B. Steece, A. W. Brown, S. Chulani, and C. Abts. Software Cost Estimation with Cocomo II. Prentice Hall, 2000.
[5]
A. Griesel, J. Hihn, K. Bruno, and R. Tausworthe. Software Forecasting: As it is Really Done: A Study of JPL Software Engineers. In Proceedings of the Eighteenth Annual Software Engineering Workshop, Goddard Space Flight Center, Decemeber 1993.
[6]
M. Hall and G. Holmes. Benchmarking attribute selection techniques for discrete class data mining. IEEE Transactions On Knowledge And Data Engineering, 15(6):1437- 1447, 2003.
[7]
J. Hihn and H. Habib-agahi. Cost estimation of software intensive projects: A survey of current practices. In Proceedings of the Thirteenth IEEE International Conference of Software Engineering, May 1991.
[8]
C. Kirsopp and M. Shepperd. Case and feature subset selection in case-based software project effort prediction. In Proc. of 22nd SGAI International Conference on Knowledge-Based Systems and Applied Artificial Intelligence, Cambridge, UK, 2002.
[9]
R. Kohavi and G. H. John. Wrappers for feature subset selection. Artificial Intelligence, 97(1-2):273-324, 1997.
[10]
T. Menzies, Z. Chen, D. Port, and J. Hihn. Simple software cost estimation: Safe or unsafe? In Proceedings, PROMISE workshop, ICSE 2005, 2005. Available from http://menzies.us/pdf/05safewhen.pdf.
[11]
A. Miller. Subset Selection in Regression (second edition). Chapman & Hall, 2002.
[12]
P. S. L. M. L. NJ. Your guide to price-s: Estimating cost and schedule of software development and support, 1998.
[13]
D. of USA. Parametric cost estimating handbook, second edition, 1999.
[14]
L. H. Putnam. Software Cost Estimating and Life-Cycle Control: Getting the Software Numbers, New York. The Institute of Electrical and Electronics Engineers, Inc., 1980.
[15]
J. R. Quinlan. Learning with Continuous Classes. In 5th Australian Joint Conference on Artificial Intelligence, pages 343-348, 1992. Available from http://citeseer.nj.nec.com/quinlan92learning.html.
[16]
I. H. Witten and E. Frank. Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations. Morgan Kaufmann, 1999.

Cited By

View all
  • (2016)Improving effort estimation of Fuzzy Analogy using feature subset selection2016 IEEE Symposium Series on Computational Intelligence (SSCI)10.1109/SSCI.2016.7849928(1-8)Online publication date: Dec-2016
  • (2010)Regularities in learning defect predictorsProceedings of the 11th international conference on Product-Focused Software Process Improvement10.1007/978-3-642-13792-1_11(116-130)Online publication date: 21-Jun-2010
  • (2008)The effects of data mining techniques on software cost estimation2008 IEEE International Engineering Management Conference10.1109/IEMCE.2008.4617949(1-5)Online publication date: Jun-2008

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
ASE '05: Proceedings of the 20th IEEE/ACM International Conference on Automated Software Engineering
November 2005
482 pages
ISBN:1581139934
DOI:10.1145/1101908
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 November 2005

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Article

Conference

ASE05

Acceptance Rates

Overall Acceptance Rate 82 of 337 submissions, 24%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 07 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2016)Improving effort estimation of Fuzzy Analogy using feature subset selection2016 IEEE Symposium Series on Computational Intelligence (SSCI)10.1109/SSCI.2016.7849928(1-8)Online publication date: Dec-2016
  • (2010)Regularities in learning defect predictorsProceedings of the 11th international conference on Product-Focused Software Process Improvement10.1007/978-3-642-13792-1_11(116-130)Online publication date: 21-Jun-2010
  • (2008)The effects of data mining techniques on software cost estimation2008 IEEE International Engineering Management Conference10.1109/IEMCE.2008.4617949(1-5)Online publication date: Jun-2008

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media