skip to main content
10.1145/2396761.2398565acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
short-paper

User guided entity similarity search using meta-path selection in heterogeneous information networks

Published: 29 October 2012 Publication History

Abstract

With the emergence of web-based social and information applications, entity similarity search in information networks, aiming to find entities with high similarity to a given query entity, has gained wide attention. However, due to the diverse semantic meanings in heterogeneous information networks, which contain multi-typed entities and relationships, similarity measurement can be ambiguous without context. In this paper, we investigate entity similarity search and the resulting ambiguity problems in heterogeneous information networks. We propose to use a meta-path-based ranking model ensemble to represent semantic meanings for similarity queries, exploit the possibility of using using user-guidance to understand users query. Experiments on real-world datasets show that our framework significantly outperforms competitor methods.

References

[1]
H. Abdi. The kendall rank correlation coefficient. Encyclopedia of Measurement and Statistics. Thousand Oaks (CA): Sage, pages 1--7, 2007.
[2]
S. Chakrabarti. Dynamic personalized pagerank in entity-relation graphs. In WWW'07, pages 571--580, 2007.
[3]
C. Chang, Y. Du, J. Wang, S. Guo, and P. Thouin. Survey and comparative analysis of entropy and relative entropy thresholding techniques. In Vision, Image and Signal Processing, IEE Proceedings, volume 153, pages 837--850. IET, 2006.
[4]
X. Geng, T. Liu, T. Qin, and H. Li. Feature selection for ranking. In Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, pages 407--414. ACM, 2007.
[5]
S. Gu, J. Yan, L. Ji, S. Yan, J. Huang, N. Liu, Y. Chen, and Z. Chen. Cross domain random walk for query intent pattern mining from search engine log. In Data Mining (ICDM), 2011 IEEE 11th International Conference on, pages 221--230. IEEE, 2011.
[6]
G. Jeh and J. Widom. Simrank: a measure of structural-context similarity. In Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, pages 538--543. ACM, 2002.
[7]
N. Lao and W. Cohen. Relational retrieval using a combination of path-constrained random walks. Machine learning, 81(1):53--67, 2010.
[8]
Y. Sun, R. Barber, M. Gupta, C. Aggarwal, and J. Han. Co-Author Relationship Prediction in Heterogeneous Bibliographic Networks. In Proceedings of 2011 Int. Conf. on Advances in Social Network Analysis and Mining. IEEE, 2011.
[9]
Y. Sun, J. Han, X. Yan, S. P. Yu, and T. Wu. PathSim: Meta Path-Based Top-K Similarity Search in Heterogeneous Information Networks. In Proceedings of the 37th International Conference on Very Large Data Bases. ACM, 2011.
[10]
X. Yu, Q. Gu, M. Zhou, and J. Han. Citation prediction in heterogeneous bibliographic networks. In Proc. of Siam International Conference on Data Mining, 2012.
[11]
X. Yu, A. Pan, L. Tang, Z. Li, and J. Han. Geo-friends recommendation in gps-based cyber-physical social network. In 2011 International Conference on Advances in Social Networks Analysis and Mining, pages 361--368. IEEE, 2011.

Cited By

View all

Index Terms

  1. User guided entity similarity search using meta-path selection in heterogeneous information networks

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    CIKM '12: Proceedings of the 21st ACM international conference on Information and knowledge management
    October 2012
    2840 pages
    ISBN:9781450311564
    DOI:10.1145/2396761
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 29 October 2012

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. entity similarity search
    2. heterogeneous information network
    3. user guided

    Qualifiers

    • Short-paper

    Conference

    CIKM'12
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,861 of 8,427 submissions, 22%

    Upcoming Conference

    CIKM '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)6
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 19 Feb 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)A node clustering algorithm for heterogeneous information networks based on node embeddingsMultimedia Tools and Applications10.1007/s11042-023-15245-983:2(3745-3766)Online publication date: 1-Jan-2024
    • (2023)TRAVERS: A Diversity-Based Dynamic Approach to Iterative Relevance Search over Knowledge GraphsProceedings of the ACM Web Conference 202310.1145/3543507.3583429(2560-2571)Online publication date: 30-Apr-2023
    • (2022)Ranked enumeration of join queries with projectionsProceedings of the VLDB Endowment10.14778/3510397.351040115:5(1024-1037)Online publication date: 18-May-2022
    • (2022)A Meta Path Based Method for Entity Set Expansion in Knowledge GraphIEEE Transactions on Big Data10.1109/TBDATA.2018.28053668:3(616-629)Online publication date: 1-Jun-2022
    • (2021)DFraud³: Multi-Component Fraud Detection Free of Cold-StartIEEE Transactions on Information Forensics and Security10.1109/TIFS.2021.308125816(3456-3468)Online publication date: 2021
    • (2021)Measuring diversity in heterogeneous information networksTheoretical Computer Science10.1016/j.tcs.2021.01.013859(80-115)Online publication date: Mar-2021
    • (2021)CMG2Vec: A composite meta-graph based heterogeneous information network embedding approachKnowledge-Based Systems10.1016/j.knosys.2020.106661216(106661)Online publication date: Mar-2021
    • (2020)GREASE: A Generative Model for Relevance Search over Knowledge GraphsProceedings of the 13th International Conference on Web Search and Data Mining10.1145/3336191.3371772(780-788)Online publication date: 20-Jan-2020
    • (2020)WMPEClus: Clustering via Weighted Meta-Path Embedding for Heterogeneous Information Networks2020 IEEE 32nd International Conference on Tools with Artificial Intelligence (ICTAI)10.1109/ICTAI50040.2020.00127(799-806)Online publication date: Nov-2020
    • (2020)A survey of typical attributed graph queriesWorld Wide Web10.1007/s11280-020-00849-024:1(297-346)Online publication date: 20-Nov-2020
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media