ACM Home Page
Please provide us with feedback. Feedback
CLOPE: a fast and effective clustering algorithm for transactional data
Full text PdfPdf (622 KB)
Source International Conference on Knowledge Discovery and Data Mining archive
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining table of contents
Edmonton, Alberta, Canada
POSTER SESSION: Poster papers table of contents
Pages: 682 - 687  
Year of Publication: 2002
ISBN:1-58113-567-X
Authors
Yiling Yang  Shanghai Jiao Tong University, Shanghai, P.R.China
Xudong Guan  Shanghai Jiao Tong University, Shanghai, P.R.China
Jinyuan You  Shanghai Jiao Tong University, Shanghai, P.R.China
Sponsors
SIGKDD: ACM Special Interest Group on Knowledge Discovery in Data
SIGMOD: ACM Special Interest Group on Management of Data
: AAAI
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 7,   Downloads (12 Months): 48,   Citation Count: 6
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues   peer to peer  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/775047.775149
What is a DOI?

ABSTRACT

This paper studies the problem of categorical data clustering, especially for transactional data characterized by high dimensionality and large volume. Starting from a heuristic method of increasing the height-to-width ratio of the cluster histogram, we develop a novel algorithm -- CLOPE, which is very fast and scalable, while being quite effective. We demonstrate the performance of our algorithm on two real world datasets, and compare CLOPE with the state-of-art algorithms.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

1
2
 
3
Cooley, R., Mohasher, B., and Srivastava, J. Data preparation for mining world wide web browsing patterns. Knowledge and Information Systems, 1(1):5--32, 1999.
 
4
Ester, M., Kriegel, H.-P., Sander, J., and Xu, X. A densitybased algorithm for discovering clusters in large spatial databases with noise. In Proc. KDD'96, Portland, Oregon, 1996.
5
 
6
 
7
 
8
Han. E.H., Karypis G., Kumar, V., and Mobashad, B. Clustering based on association rule hypergraphs. In Proc. SIGMOD Workshop on Research Issues on Data Mining and Knowledge Discovery, 1997.
9
 
10
Huang Z. A fast clustering algorithm to cluster very large categorical data sets in data mining. In Proc. SIGMOD Workshop on Research Issues on Data Mining and Knowledge Discovery, 1997.
 
11
MacQueen, J.B. Some methods for classification and analysis of multivariate observations. In Proc. 5th Berkeley Symposium on Math. Stat. and Prob., 1967.
 
12
13
14
 
15
Zhao, Y. and Karypis, G. Criterion functions for document clustering: experiments and analysis. Tech. Report #01--40, Department of Comp. Sci. & Eng., U. Minnesota, 2001. Avaliable as: http://www-users.itlabs.umn.edu/~karypis/publications/Papers/Postscript/vscluster.ps


Collaborative Colleagues:
Yiling Yang: colleagues
Xudong Guan: colleagues
Jinyuan You: colleagues

Peer to Peer - Readers of this Article have also read: