skip to main content
10.1145/1159733.1159746acmconferencesArticle/Chapter ViewAbstractPublication PagesesemConference Proceedingsconference-collections
Article

A comparative study of attribute weighting heuristics for effort estimation by analogy

Published: 21 September 2006 Publication History

Abstract

Five heuristics for attribute weighting in analogy-based effort estimation are evaluated in this paper. The baseline heuristic involves using all attributes with equal weights. We propose four additional heuristics that use rough set analysis for attribute weighting. These five heuristics are evaluated over five data sets related to software projects. Three of the data sets are publicly available, hence allowing comparison with other methods. The results indicate that three of the rough set analysis based heuristics perform better than the equal weights heuristic. This evaluation is based on an integrated measure of accuracy.

References

[1]
Cartwright, M., Shepperd, M., and Song, Q. Dealing with Missing Software Project Data, Proc. of the 9th International Symposium on Software Metrics, September 2003, 154--165.
[2]
Chen, Z., Boehm, B., Menzies, T., and Port, D. Finding the Right Data for Software Cost Modeling, IEEE Software, 22(2005) 38--46.
[3]
Chmielewski, M. R., and Grzymala-Busse, J. W. Global Discretization of Continuous Attributes As Preprocessing for Machine Learning, Third International Workshop on Rough Sets and Soft Computing, 1994, 294--301.
[4]
Conte, S. D., Dunsmore, H., and Shen, V.Y. Software engineering metrics and models, Benjamin-Cummings Publishing Co. Inc., 1986.
[5]
Dougherty, J., Kohavi, R., and Sahami, M. Supervised and Unsupervised Discretization of Continuous Features, Proc. of International Conference on Machine Learning, 1995, 194--202.
[6]
Efron, B., and Gong, G. A Leisurely Look at the Bootstrap, the Jackknife, and Cross-Validation, The American Statistician. 37(1983): 36--48.
[7]
IDSS, ROSE2, Institute of Computing Science, PoznaN University of Technology, http://idss.cs.put.poznan.pl/site/rose.html, January 2006.
[8]
ISBSG, International Software Benchmark and Standards Group, Data R8, www.isbsg.org, January 2006.
[9]
Kadoda, G., Michelle, C., Chen, L., and Shepperd, M. Experiences Using Case-Based Reasoning to Predict Software Project Effort, Proc. of EASE'2000 - Fourth International Conference on Empirical Assessment and Evaluation in Software Engineering, January 2000.
[10]
Kemerer, C. F. An Empirical Validation of Software Cost Estimation Models, Communication of the ACM, 30(1987) 436--445.
[11]
Kirsopp, C., and Shepperd, M. Case and Feature Subset Selection in Case-Based Software Project Effort Prediction, Proc. 22nd SGAI Int'l Conf. Knowledge-Based Systems and Applied Artificial Intelligence, December 2002.
[12]
Kohavi, R., and John, G. H. Wrappers for Feature Subset Selection, Artificial Intelligence 97(1997) 273--324.
[13]
Laplante, P. A., and Neil, C. J. Modeling Uncertainty in Software Engineering Using Rough Sets, Innovations in Systems and Software Engineering, 1(2005) 71--78.
[14]
Leung, H. K. N. Estimating Maintenance Effort by Analogy, Empirical Software Engineering, 7(2002) 157--175.
[15]
Li, J., Ruhe, G., Al-Emran, A., and Richter, M. M. A Flexible Method for Effort Estimation by Analogy, Empirical Software Engineering, (2006)
[16]
Li, J., and Ruhe, G. Attribute Selection and Weighting Using Rough Sets for Effort Estimation by Analogy--Initial Results, Technical Report SEDS-TR-05102, Software Engineering Decision Support Laboratory at the University of Calgary, Canada, October 2005.
[17]
Mendes, E., Watson, I., Chris T., Nile M., and Steve, C. A Comparative Study of Cost Estimation Models for Web Hypermedia Applications, Empirical Software Engineering, 8(2003) 163--196.
[18]
Mukhopadhyay, T., Vicinanza, S., and Prietula, M. J. Examining the Feasibility of a Case-based Reasoning Model for Software Effort Estimation. MIS Quarterly, 16, 2 (1992) 155--171.
[19]
Myrtveit, I., Stensrud, E., and Olsson, U. H. Analyzing Data Sets with Missing Data: An Empirical Evaluation of Imputation Methods and Likelihood-Based Methods, IEEE Transactions on Software Engineering, 27, 11(2001) 999--1013.
[20]
Pawlak, Z. Rough Sets: Theoretical Aspects of Reasoning about Data, Kluwer Acdemic Publishers, 1991.
[21]
Ruhe, G. Rough sets Based Data Analysis in Goal Oriented Software Measurement, Proc. of the third International Symposium on Software Metrics, March 1996, 10--19.
[22]
Shepperd, M., and Schofield,C. Estimating Software Project Effort Using Analogies, IEEE Transactions on Software Engineering, 23(1997) 736--743.
[23]
Song, Q., Shepperd, M., and Mair, C. Using Grey Relational Analysis to Predict Software Effort with Small Data Sets, METRICS'05: Proceedings of the 11th IEEE International Software Metrics Symposium. Como, Italy, 2005, 35--45.
[24]
Strike, K. et al. Software Cost Estimation with Incomplete Data, IEEE Transactions on Software Engineering, 27, 10 (October 2001) 890--908.
[25]
Zhang, M., and Yao, J. A Rough Sets Based Approach to Feature Selection, Proc. of the 23rd International Conference of NAFIPS, June 2004, 434--439.
[26]
Zhong, N., and Dong, J. Using Rough Sets with Heuristics for Feature Selection, Journal of Intelligent Information Systems 16(2001) 199--214.

Cited By

View all
  • (2019)A Generic Data Mining Model for Software Cost Estimation Based on Novel Input Selection ProcedureInternational Journal of Information Retrieval Research10.4018/IJIRR.20190101029:1(16-32)Online publication date: 1-Jan-2019
  • (2017)Negative results for software effort estimationEmpirical Software Engineering10.1007/s10664-016-9472-222:5(2658-2683)Online publication date: 1-Oct-2017
  • (2014)A Short Introduction to Computational Trends in Analogical ReasoningComputational Approaches to Analogical Reasoning: Current Trends10.1007/978-3-642-54516-0_1(1-22)Online publication date: 23-Mar-2014
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
ISESE '06: Proceedings of the 2006 ACM/IEEE international symposium on Empirical software engineering
September 2006
388 pages
ISBN:1595932186
DOI:10.1145/1159733
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 September 2006

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. attribute
  2. estimation by analogy
  3. missing values
  4. non-quantitative attributes
  5. rough set analysis
  6. selection and weighting
  7. software effort estimation

Qualifiers

  • Article

Conference

ISESE06
Sponsor:

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)3
  • Downloads (Last 6 weeks)0
Reflects downloads up to 15 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2019)A Generic Data Mining Model for Software Cost Estimation Based on Novel Input Selection ProcedureInternational Journal of Information Retrieval Research10.4018/IJIRR.20190101029:1(16-32)Online publication date: 1-Jan-2019
  • (2017)Negative results for software effort estimationEmpirical Software Engineering10.1007/s10664-016-9472-222:5(2658-2683)Online publication date: 1-Oct-2017
  • (2014)A Short Introduction to Computational Trends in Analogical ReasoningComputational Approaches to Analogical Reasoning: Current Trends10.1007/978-3-642-54516-0_1(1-22)Online publication date: 23-Mar-2014
  • (2013)Learning Project Management DecisionsIEEE Transactions on Software Engineering10.1109/TSE.2013.4339:12(1698-1713)Online publication date: 1-Dec-2013
  • (2013)Probabilistic size proxy for software effort predictionInformation and Software Technology10.1016/j.infsof.2012.08.00155:2(241-251)Online publication date: 1-Feb-2013
  • (2013)Case-Based ReasoningSoftware Project Effort Estimation10.1007/978-3-319-03629-8_11(305-313)Online publication date: 12-Dec-2013
  • (2012)Functional Link Artificial Neural Networks for Software Cost EstimationInternational Journal of Applied Evolutionary Computation10.4018/jaec.20120401043:2(62-82)Online publication date: 1-Apr-2012
  • (2012)Data Mining Techniques for Software Effort EstimationIEEE Transactions on Software Engineering10.1109/TSE.2011.5538:2(375-397)Online publication date: 1-Mar-2012
  • (2012)Exploiting the Essential Assumptions of Analogy-Based Effort EstimationIEEE Transactions on Software Engineering10.1109/TSE.2011.2738:2(425-438)Online publication date: 1-Mar-2012
  • (2011)Comparison of weighted grey relational analysis for software effort estimationSoftware Quality Journal10.1007/s11219-010-9110-y19:1(165-200)Online publication date: 1-Mar-2011
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media