Article

Specialization and extrapolation of software cost models

Authors:

Jairus HihnAuthors Info & Claims

ASE '05: Proceedings of the 20th IEEE/ACM International Conference on Automated Software Engineering

Pages 384 - 387

https://doi.org/10.1145/1101908.1101976

Published: 07 November 2005 Publication History

Abstract

Despite the widespread availability of software effort estimation models (e.g. COCOMO [2], Price-S [12], SEER-SEM [13], SLIM [14]), most managers still estimate new projects by extrapolating from old projects [3, 5, 7]. In this delta method, the cost of the next project is the cost of the last project multiplied by some factors modeling the difference between old and new projects [2].Delta estimation is simple, fast, and best of all, can take full advantage of local costing information. However delta estimation fails when the experience base (the old projects) can not be extrapolated to the new projects. Previously [10], we have shown that for a set of NASA projects, delta estimation would usually fail since most of the features and coefficients of the learned model vary wildly across sub-samples of the training data. In that prior work, no solution was offered for this problem.Here, we offer a solution and report the results of experiment with feature subset selection (FSS) and extrapolation. FSS methods are usually assessed via the mean change in model performance. However, as shown below, FSS can significantly reduce the variance as well. Hence, FSS should be routinely used in cost estimation.Our results should stop the trend in the effort modeling community of continually adding to the number of features in a model in order to improve estimation performance. Here we show that there are benefits in intelligently subtracting model features.

References

[1]

B. Boehm. Software Engineering Economics. Prentice Hall, 1981.

Digital Library

[2]

B. Boehm. Safe and simple software cost analysis. IEEE Software, pages 14-17, September/October 2000. Available from http://www.computer.org/ certification/beta/Boehm_Safe.pdf.

Digital Library

[3]

B. Boehm. Personnel communication, 2003.

[4]

B. Boehm, E. Horowitz, R. Madachy, D. Reifer, B. K. Clark, B. Steece, A. W. Brown, S. Chulani, and C. Abts. Software Cost Estimation with Cocomo II. Prentice Hall, 2000.

Digital Library

[5]

A. Griesel, J. Hihn, K. Bruno, and R. Tausworthe. Software Forecasting: As it is Really Done: A Study of JPL Software Engineers. In Proceedings of the Eighteenth Annual Software Engineering Workshop, Goddard Space Flight Center, Decemeber 1993.

[6]

M. Hall and G. Holmes. Benchmarking attribute selection techniques for discrete class data mining. IEEE Transactions On Knowledge And Data Engineering, 15(6):1437- 1447, 2003.

Digital Library

[7]

J. Hihn and H. Habib-agahi. Cost estimation of software intensive projects: A survey of current practices. In Proceedings of the Thirteenth IEEE International Conference of Software Engineering, May 1991.

Digital Library

[8]

C. Kirsopp and M. Shepperd. Case and feature subset selection in case-based software project effort prediction. In Proc. of 22nd SGAI International Conference on Knowledge-Based Systems and Applied Artificial Intelligence, Cambridge, UK, 2002.

[9]

R. Kohavi and G. H. John. Wrappers for feature subset selection. Artificial Intelligence, 97(1-2):273-324, 1997.

Digital Library

[10]

T. Menzies, Z. Chen, D. Port, and J. Hihn. Simple software cost estimation: Safe or unsafe? In Proceedings, PROMISE workshop, ICSE 2005, 2005. Available from http://menzies.us/pdf/05safewhen.pdf.

Digital Library

[11]

A. Miller. Subset Selection in Regression (second edition). Chapman & Hall, 2002.

[12]

P. S. L. M. L. NJ. Your guide to price-s: Estimating cost and schedule of software development and support, 1998.

[13]

D. of USA. Parametric cost estimating handbook, second edition, 1999.

[14]

L. H. Putnam. Software Cost Estimating and Life-Cycle Control: Getting the Software Numbers, New York. The Institute of Electrical and Electronics Engineers, Inc., 1980.

[15]

J. R. Quinlan. Learning with Continuous Classes. In 5th Australian Joint Conference on Artificial Intelligence, pages 343-348, 1992. Available from http://citeseer.nj.nec.com/quinlan92learning.html.

[16]

I. H. Witten and E. Frank. Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations. Morgan Kaufmann, 1999.

Digital Library

Cited By

Idri ACherradi S(2016)Improving effort estimation of Fuzzy Analogy using feature subset selection2016 IEEE Symposium Series on Computational Intelligence (SSCI)10.1109/SSCI.2016.7849928(1-8)Online publication date: Dec-2016
https://doi.org/10.1109/SSCI.2016.7849928
Turhan BBener AMenzies T(2010)Regularities in learning defect predictorsProceedings of the 11th international conference on Product-Focused Software Process Improvement10.1007/978-3-642-13792-1_11(116-130)Online publication date: 21-Jun-2010
https://dl.acm.org/doi/10.1007/978-3-642-13792-1_11
Lum KBaker DHihn J(2008)The effects of data mining techniques on software cost estimation2008 IEEE International Engineering Management Conference10.1109/IEMCE.2008.4617949(1-5)Online publication date: Jun-2008
https://doi.org/10.1109/IEMCE.2008.4617949

Index Terms

Specialization and extrapolation of software cost models

Recommendations

Improved software cost estimation models: A new perspective based on evolution in Dynamic Environment
Special Section: Ambient advancements in intelligent computational sciences

Software cost estimation is the process of predicting the most realistic and valid amount of effort necessary for the development of any software. The cost estimation of any software is a difficult assignment due to the involvement of many factors that ...
Software cost estimation using economic production models

One of the major difficulties in controlling software development project cost overruns and schedule delays has been developing practical and accurate software cost models. Software development could be modeled as an economic production process and we ...
An approach for software cost estimation
CompSysTech '10: Proceedings of the 11th International Conference on Computer Systems and Technologies and Workshop for PhD Students in Computing on International Conference on Computer Systems and Technologies

Considerable studies are now directed at developing and using software cost estimation methods and tools. This paper underlines the specificity of the software estimation process and the fact that no one method is the best for all type projects. The ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

ASE '05: Proceedings of the 20th IEEE/ACM International Conference on Automated Software Engineering

November 2005

482 pages

ISBN:1581139934

DOI:10.1145/1101908

General Chair:
David Redmiles
University of California, Irvine, CA
,
Program Chairs:
Tom Ellman
Vassar College
,
Andrea Zisman
City University, UK

Copyright © 2005 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 November 2005

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Qualifiers

Article

Conference

ASE05

Sponsor:

ASE05: International Conference on Automated Software Engineering 2005

November 7 - 11, 2005

CA, Long Beach, USA

Acceptance Rates

Overall Acceptance Rate 82 of 337 submissions, 24%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

3
Total Citations
View Citations
600
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 07 Mar 2025

Other Metrics

View Author Metrics

Citations

Cited By

Idri ACherradi S(2016)Improving effort estimation of Fuzzy Analogy using feature subset selection2016 IEEE Symposium Series on Computational Intelligence (SSCI)10.1109/SSCI.2016.7849928(1-8)Online publication date: Dec-2016
https://doi.org/10.1109/SSCI.2016.7849928
Turhan BBener AMenzies T(2010)Regularities in learning defect predictorsProceedings of the 11th international conference on Product-Focused Software Process Improvement10.1007/978-3-642-13792-1_11(116-130)Online publication date: 21-Jun-2010
https://dl.acm.org/doi/10.1007/978-3-642-13792-1_11
Lum KBaker DHihn J(2008)The effects of data mining techniques on software cost estimation2008 IEEE International Engineering Management Conference10.1109/IEMCE.2008.4617949(1-5)Online publication date: Jun-2008
https://doi.org/10.1109/IEMCE.2008.4617949

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten