| A proposal for using continuous attributes in classification trees |
| Full text |
Pdf
(155 KB)
|
| Source
|
ACM International Conference Proceeding Series; Vol. 27
archive
Proceedings of the 14th international conference on Software engineering and knowledge engineering
table of contents
Ischia, Italy
SESSION: Measurement and empirical software engineering
table of contents
Pages: 417 - 424
Year of Publication: 2002
ISBN:1-58113-556-4
|
|
Author
|
|
Sandro Morasca
|
Universitàdegli Studi dell'Insubria, Via Valleggio 11, I-22100 Como, Italy
|
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 11, Downloads (12 Months): 38, Citation Count: 0
|
|
|
ABSTRACT
Classification trees have been successfully used in several application fields. However, continuous attributes cannot be used directly when building classification trees, but they must be first discretized with clustering techniques, which require some degree of subjectivity. We propose an approach to build classification trees that does not require the discretization of the continuous attributes. The approach is an extension of existing methods for building classification trees and is based on the information gain yielded by discrete and continuous attributes. Data from a software development case study are analyzed with both the proposed approach and C4.5 to show the approach's applicability and benefits over C4.5.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
L. Breiman, J. H. Friedman, R. A. Olshen, and C. J. Stone, "Classification and Regression Trees," Monterey, U.S.A.: Wadsworth, Inc., 1984.
|
| |
2
|
|
| |
3
|
|
| |
4
|
B. Everitt, "Cluster Analysis - 3rd Edition," E. Arnold Press, London., 1993.
|
| |
5
|
D. Hosmer and S. Lemeshow, "Applied Logistic Regression," Wiley-Interscience Publication, 1989.
|
| |
6
|
L. Kaufman and P. J. Rousseeuw, "Finding Groups in Data: An Introduction to Cluster Analysis," Wiley-Interscience Publication, 1990.
|
| |
7
|
M. Kendall, A. Stuart. The Advanced Theory of Statistics (4th ed). Vol. 1, Charles Griffin & Co., London, UK, 1977.
|
| |
8
|
|
| |
9
|
|
| |
10
|
|
Peer to Peer - Readers of this Article have also read:
-
Data structures for quadtree approximation and compression
Communications of the ACM
28, 9
Hanan Samet
-
A hierarchical single-key-lock access control using the Chinese remainder theorem
Proceedings of the 1992 ACM/SIGAPP Symposium on Applied computing
Kim S. Lee
, Huizhu Lu
, D. D. Fisher
-
The GemStone object database management system
Communications of the ACM
34, 10
Paul Butterworth
, Allen Otis
, Jacob Stein
-
Putting innovation to work: adoption strategies for multimedia communication systems
Communications of the ACM
34, 12
Ellen Francik
, Susan Ehrlich Rudman
, Donna Cooper
, Stephen Levine
-
An intelligent component database for behavioral synthesis
Proceedings of the 27th ACM/IEEE conference on Design automation
Gwo-Dong Chen
, Daniel D. Gajski
|