ACM Home Page
Please provide us with feedback. Feedback
Top-k subgraph matching query in a large graph
Full text PdfPdf (900 KB)
Source
Conference on Information and Knowledge Management archive
Proceedings of the ACM first Ph.D. workshop in CIKM table of contents
Lisbon, Portugal
SESSION: Session 4 table of contents
Pages 139-146  
Year of Publication: 2007
ISBN:978-1-59593-832-9
Authors
Lei Zou  Huazhong University of Science and Technology, Wuhan, China
Lei Chen  Hong Kong University of Science and Technology, Hong Kong, Hong Kong
Yansheng Lu  Huazhong University of Science and Technology, Wuhan, China
Sponsors
SIGWEB: ACM Special Interest Group on Hypertext, Hypermedia, and Web
SIGIR: ACM Special Interest Group on Information Retrieval
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 11,   Downloads (12 Months): 176,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1316874.1316897
What is a DOI?

ABSTRACT

Recently, due to its wide applications, subgraph search has attracted a lot of attention from database and data mining community. Sub-graph search is defined as follows: given a query graph Q, we report all data graphs containing Q in the database. However, there is little work about sub-graph search in a single large graph, which has been used in many applications, such as biological network and social network.

In this paper, we address top-k sub-graph matching query problem, which is defined as follows: given a query graph Q, we locate top-k matchings of Q in a large data graph G according to a score function. The score function is defined as the sum of the pairwise similarity between a vertex in Q and its matching vertex in G. Specifically, we first design a balanced tree (that is G-Tree) to index the large data graph. Then, based on G-Tree, we propose an efficient query algorithm (that is Ranked Matching algorithm). Our extensive experiment results show that, due to efficiency of pruning strategy, given a query with up to 20 vertices, we can locate the top-100 matchings in less than 10 seconds in a large data graph with 100K vertices. Furthermore, our approach outperforms the alternative method by orders of magnitude.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
D. Cai, Z. Shao, X. He, X. Yan, and J. Han. Community mining from multi-relational networks. In PKDD, 2005.
3
 
4
D. Conte, P. Foggia, C. Sansone, and M. Vento. Thirty years of graph matching in pattern recognition. IJPRAI, 18(3), 2004.
 
5
 
6
J. H. D. W. Williams and W. Wang. Graph database indexing using structured graph decomposition. In ICDE, 2007.
7
 
8
S. Fortin. The graph isomorphism problem. Department of Computing Science, University of Alberta, 1996.
 
9
P. Y. H. Jiang, H. Wang and S. Zhou. Gstring: A novel approach for efficient search in graph databases. In ICDE, 2007.
 
10
T. R. Hagadone. Molecular substructure similarity searching: Efficient retrieval in two-dimensional structure databases. J.Chem. Inf. Comput. Sci, 32, 1992.
 
11
 
12
 
13
 
14
 
15
A. Ng, M. Jordan, and Y. Weiss. On spectral clustering: Analysis and an algorithm. In NIPS, 2001.
 
16
E. Ravasz and A.-L. Barabasi. Hierarchical organization in complex networks. Physical Review E, 67, 2003.
 
17
A. SF, G. W, M. W, M. EW, and L. DJ. Basic local alignment search tool. J Mol Biol., 215(3), 1990.
18
19
 
20
D. J. Watts. Six Degrees:The Science of a Connected Age. W. W. Norton & Company, 2004.
 
21
P. Willett. Chemical similarity searching. J. Chem. Inf. Comput. Sci, 38(6), 1998.
22
 
23
Q. Yang and S. hoi Sze. Path matching and graph matching in biological networks. J Comput Biol., 14(1), 2007.
 
24
S. Zhang, M. Hu, and J. Yang. Treepi: A novel graph indexing method. In ICDE, 2007.
 
25

Collaborative Colleagues:
Lei Zou: colleagues
Lei Chen: colleagues
Yansheng Lu: colleagues