| Discovering frequent topological structures from graph datasets |
| Full text |
Pdf
(1.41 MB)
|
| Source
|
Conference on Knowledge Discovery in Data
archive
Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
table of contents
Chicago, Illinois, USA
POSTER SESSION: Research track poster
table of contents
Pages: 606 - 611
Year of Publication: 2005
ISBN:1-59593-135-X
|
|
Authors
|
|
R. Jin
|
Ohio State University, Columbus, OH
|
|
C. Wang
|
Ohio State University, Columbus, OH
|
|
D. Polshakov
|
Ohio State University, Columbus, OH
|
|
S. Parthasarathy
|
Ohio State University, Columbus, OH
|
|
G. Agrawal
|
Ohio State University, Columbus, OH
|
|
| Sponsors |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 9, Downloads (12 Months): 96, Citation Count: 1
|
|
|
ABSTRACT
The problem of finding frequent patterns from graph-based datasets is an important one that finds applications in drug discovery, protein structure analysis, XML querying, and social network analysis among others. In this paper we propose a framework to mine frequent large-scale structures, formally defined as frequent topological structures, from graph datasets. Key elements of our framework include, fast algorithms for discovering frequent topological patterns based on the well known notion of a topological minor, algorithms for specifying and pushing constraints deep into the mining process for discovering constrained topological patterns, and mechanisms for specifying approximate matches when discovering frequent topological patterns in noisy datasets. We demonstrate the viability and scalability of the proposed algorithms on real and synthetic datasets and also discuss the use of the framework to discover meaningful topological structures from protein structure data.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
Reinhard Diestel. Graph Theory. Springer-Verlag, 2000.
|
| |
2
|
H. Hofer, C. Borgelt, and M. R. Berthold. Large scale mining of molecular fragments with wildcards. In Advances in Intelligent Data Analysis V, pages 380--389, 2003.
|
 |
3
|
Jun Huan , Wei Wang , Jan Prins , Jiong Yang, SPIN: mining maximal frequent subgraphs from graph databases, Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, August 22-25, 2004, Seattle, WA, USA
[doi> 10.1145/1014052.1014123]
|
| |
4
|
|
| |
5
|
Ruoming Jin, Chao Wang, Dimitrii Polshakov, Srinivasan Parthasarathy, and Gagan Agrawal. Discovering frequent topological structures from graph datasets. Technical report, CSE, Ohio State University.
|
| |
6
|
|
| |
7
|
Thorsen Meinl, Christian Borgelt, Michael R. Berthold, and Michael Philippsen. Mining fragments with fuzzy chains in molecular databases. In Second International Workshop on Mining Graphs, Trees and Sequences (MGTS2004), 2004.
|
 |
8
|
|
| |
9
|
H. Palsdottir and C. Hunte. Lipids in membrane protein structures. BBA, 1666:2--18, 2004.
|
| |
10
|
|
| |
11
|
D. Polshakov, K. Marsolo, and S. Parthasarathy. Mining 3d-motifs using phisical-chemical constraints: application to cardiolipin binding sites. In ISMB, 2005.
|
| |
12
|
A. Srinivasan, R.D. King, S.H. Muggleton, and M. Sternberg. The predictive toxicology evaluation challenge. In IJCAI, pages 1--6, 1997.
|
| |
13
|
|
CITED BY
|
Hanghang Tong , Christos Faloutsos , Brian Gallagher , Tina Eliassi-Rad, Fast best-effort pattern matching in large attributed graphs, Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining, August 12-15, 2007, San Jose, California, USA
|
|