ACM Home Page
Please provide us with feedback. Feedback
Clustering pair-wise dissimilarity data into partially ordered sets
Full text PdfPdf (312 KB)
Source Conference on Knowledge Discovery in Data archive
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining table of contents
Philadelphia, PA, USA
POSTER SESSION: Research track posters table of contents
Pages: 637 - 642  
Year of Publication: 2006
ISBN:1-59593-339-5
Authors
Jinze Liu  University of North Carolina, Chapel Hill, NC
Qi Zhang  University of North Carolina, Chapel Hill, NC
Wei Wang  University of North Carolina, Chapel Hill, NC
Leonard McMillan  University of North Carolina, Chapel Hill, NC
Jan Prins  University of North Carolina, Chapel Hill, NC
Sponsors
ACM: Association for Computing Machinery
SIGKDD: ACM Special Interest Group on Knowledge Discovery in Data
SIGMOD: ACM Special Interest Group on Management of Data
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 9,   Downloads (12 Months): 95,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1150402.1150480
What is a DOI?

ABSTRACT

Ontologies represent data relationships as hierarchies of possibly overlapping classes. Ontologies are closely related to clustering hierarchies, and in this article we explore this relationship in depth. In particular, we examine the space of ontologies that can be generated by pairwise dissimilarity matrices. We demonstrate that classical clustering algorithms, which take dissimilarity matrices as inputs, do not incorporate all available information. In fact, only special types of dissimilarity matrices can be exactly preserved by previous clustering methods. We model ontologies as a partially ordered set (poset) over the subset relation. In this paper, we propose a new clustering algorithm, that generates a partially ordered set of clusters from a dissimilarity matrix.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
M. Ashburner, C. A. Ball, J. A. Blake, D. Botstein, H. Butler, J. M. Cherry, A. P. Davis, K. Dolinski, S. S. Dwight, J. T. Eppig, M. A. Harris, D. P. Hill, L. Issel-Tarver, A. Kasarskis, S. Lewis, J. C. Matese, J. E. Richardson, M. Ringwald, G. M. Rubin, G. Sherlock: Gene Ontology: tool for the unification of biology. Nat Genet 2000, 25:25--29.
 
2
Applications of the pyramidal clustering method to biological objects. Comput Chem, 23(3-4):303--15, Jun 15, 1999.
 
3
P. Berkhin. Survey of clustering data mining techniques https://umdrive.memphis.edu/vphan/public/berkhin-survey.pdf, Accrue Software, 2002.
 
4
5
 
6
Budanitsky, A., and G. Hirst, "Semantic Distance in WordNet: An Experimental, Application-oriented Evaluation of Five Measures", Workshop on WordNet and Other Lexical Resources, in the North American Chapter of the Association for Computational Linguistics (NAACL-2001), Pittsburgh, PA, June 2001.
 
7
E. Diday, Orders and overlapping clusters in pyramids. In: J. De Leeuw et al. Multidimensional Data Analysis, DSWO Press, Leiden (1986), pp. 201--234.
 
8
 
9
L. K. Hua, Introduction to Number Theory. Springer-Verlag, New York, 1982.
 
10
 
11
 
12
 
13
R. M. Karp Reducibility among combinatorial problems. Complexity of computer computations, Plenum Press, New York, pp.85--103, 1972.
 
14
L. Kaufman and P. Rousseeuw, Finding groups in data: an introduction to cluster analysis. New York: John Wiley and Sons, 1990.
 
15
 
16
P. W. Lord, R. Stevens, A. Brass, and C. A.Goble. Investigating semantic similarity measures across the Gene Ontology: the relationship between sequence and annotation. Bioinformatics, 19(10):1275--83, 2003.
 
17
W. T. McCormick, P. J. Schweitzer, and T. W. White. Problem decomposition and data reorganization by a clustering technique. Operations Research, 20:993--1009, 1972.
 
18
P. T. Spellman, G. Sherlock, M. Q.Zhang, V. R. Lyer, K. Anders, M. B. Eisen, P. O. Brown, D. Botstein, and Futcher. Comprehensive identification of cell cycle-regulated genes of the yeast sacccharomyces cerevisiae by microaray hybidization. Molecular Biology of the Cell, 9:3273--2297, 1998.
 
19
S. Tavazoie, J. D. Hughes, M. J. Campbell, R. J. Cho and G. Church. Systematic determination of genetic network architecture. Nature Genetics 22: 281--285, 1999.
 
20

Collaborative Colleagues:
Jinze Liu: colleagues
Qi Zhang: colleagues
Wei Wang: colleagues
Leonard McMillan: colleagues
Jan Prins: colleagues