skip to main content
10.1145/2588555.2612662acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
abstract

Efficient top-K SimRank-based similarity join

Published:18 June 2014Publication History

ABSTRACT

SimRank is an effective and widely adopted measure to quantify the structural similarity between pairs of nodes in a graph. In this paper we study the problem of top-k SimRank-based similarity join, which finds k pairs of nodes with the largest SimRank values. To the best of our knowledge, this is the first attempt to address this problem. We propose a random-walk-based method to efficiently identify top-k pairs. Experiment results on real datasets show that our method significantly outperforms baseline approaches.

References

  1. W. Zheng, L. Zou, Y. Feng, L. Chen, and D. Zhao, "Efficient simrank-based similarity join over large graphs," PVLDB, vol. 6, no. 7, pp. 493--504, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. D. Lizorkin, P. Velikhov, M. N. Grinev, and D. Turdakov, "Accuracy estimate and optimization techniques for simrank computation," VLDB J., vol. 19, no. 1, pp. 45--66, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. W. Yu, X. Lin, and W. Zhang, "Towards efficient simrank computation on large networks," in ICDE, pp. 601--612, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. G. Jeh and J. Widom, "Simrank: a measure of structural-context similarity," in KDD, pp. 538--543, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Y. Low and A. X. Zheng, "Fast top-k similarity queries via matrix compression," in CIKM, pp. 2070--2074, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Efficient top-K SimRank-based similarity join

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      SIGMOD '14: Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data
      June 2014
      1645 pages
      ISBN:9781450323765
      DOI:10.1145/2588555

      Copyright © 2014 Owner/Author

      Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 18 June 2014

      Check for updates

      Qualifiers

      • abstract

      Acceptance Rates

      SIGMOD '14 Paper Acceptance Rate107of421submissions,25%Overall Acceptance Rate785of4,003submissions,20%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader