skip to main content
10.1145/1076034.1076105acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
Article

A study of relevance propagation for web search

Published:15 August 2005Publication History

ABSTRACT

Different from traditional information retrieval, both content and structure are critical to the success of Web information retrieval. In recent years, many relevance propagation techniques have been proposed to propagate content information between web pages through web structure to improve the performance of web search. In this paper, we first propose a generic relevance propagation framework, and then provide a comparison study on the effectiveness and efficiency of various representative propagation models that can be derived from this generic framework. We come to many conclusions that are useful for selecting a propagation model in real-world search applications, including 1) sitemap-based propagation models outperform hyperlink-based models in sense of both effectiveness and efficiency, and 2) sitemap-based term propagation is easier to be integrated into real-world search engines because of its parallel offline implementation and acceptable complexity. Some other more detailed study results are also reported in the paper.

References

  1. Amento, B., Terveen, L., and Hill, W. Does "Authority" Mean Quality? Predicting Expert Quality Ratings of Web Pages. In Proc. ACM SIGIR 2000, pages 296--303. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Amitay, E., Carmel, D., Darlow, A., Lempel, R., and Soffer, A. Topic Distillation with Knowledge Agents, in the 11th TREC, 2002.Google ScholarGoogle Scholar
  3. Baeza-Yates, R., Ribeiro-Neto, B. Modern Information Retrieval, Addison Wesley, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Bharat, K., and Henzinger, M. R. Improved Algorithms for Topic Distillation in a Hyperlinked Environment. In Proceedings of the ACM-SIGIR, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Bharat, K., and Mihaila, G. A. When Experts Agree: Using Non-affiliated Experts to Rank Popular Topics. In 10th WWW, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Brin, S., and Page, L. The Anatomy of a Large Scale Hypertextual Web Search Engine, Proc. 7th WWW, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Broder, A. A Taxonomy of Web Search. SIGIR Forum 36(2), 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Chakrabarti, S. Integrating the Page Object Model with hyperlinks for enhanced topic distillation and information extraction, In the 10th WWW, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Chakrabarti, S., Joshi, M., and Tawde, V. Enhanced Topic Distillation Using Text, Markup Tags, and Hyperlinks, In Proceedings of the 24th ACM SIGIR, 2001, pp. 208--216. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Craswell, N., Hawking, D. Overview of the TREC 2003 Web Track, in the 12th TREC, 2003.Google ScholarGoogle Scholar
  11. Craswell, N., Hawking, D. Overview of the TREC 2004 Web Track, in the 13th TREC, 2004.Google ScholarGoogle Scholar
  12. Feng, G., Liu, T. Y., Zhang, X. D., Qin. T., Gao, B., Ma, W. Y. Level-Based Link Analysis, in the 7th APWeb, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Haveliwala, T.H. Topic-Sensitive Pagerank. In Proc. of the 11th WWW, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Hawking, D. Overview of the TREC-9 Web Track, in the 9th TREC, 2000.Google ScholarGoogle Scholar
  15. Ingongngam, P., and Rungsawang, A. Report on the TREC 2003 Experiments Using Web Topic-Centric Link Analysis, in the 12th TREC, 2003.Google ScholarGoogle Scholar
  16. Kamvar, S. D., Haveliwala, T. H., Manning, C. D., Golub, G. H. Exploiting the Block Structure of the Web for Computing PageRank, In Proc. of the 13th WWW, 2003.Google ScholarGoogle Scholar
  17. Kleinberg, J. Authoritative Sources in a Hyperlinked Environment, Journal of the ACM, Vol. 46, No. 5, pp. 604--622, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Mcbryan, O. GENVL and WWWW: Tools for Taming the Web. In Proceedings of the 1st WWW, 1994.Google ScholarGoogle Scholar
  19. Page, L., Brin, S., Motwani, R., and Winograd, T. The PageRank Citation Ranking: Bringing Order to the Web, Technical report, Stanford University, Stanford, CA, 1998.Google ScholarGoogle Scholar
  20. Robertson, S. E. Overview of the Okapi Projects, Journal of Documentation, Vol. 53, No. 1, 1997, pp. 3--7.Google ScholarGoogle ScholarCross RefCross Ref
  21. Robertson, S. E., and Sparck Jones, K. Relevance Weighting of Search Terms, Journal of the American Society of Information Science, Vol. 27, No. May-June, 1976, pp. 129--146.Google ScholarGoogle ScholarCross RefCross Ref
  22. Shakery, A., Zhai, C. X. Relevance Propagation for Topic Distillation UIUC TREC 2003 Web Track Experiments, in the 12th TREC, 2003.Google ScholarGoogle Scholar
  23. Song, R., Wen, J. R., Shi, S. M., Xin, G. M., Liu, T. Y., Qin, T., Zheng, X., Zhang, J. Y., Xue, G. R., and Ma, W. Y. Microsoft Research Asia at Web Track and Terabyte Track of TREC 2004, in the 13th TREC, 2004.Google ScholarGoogle Scholar

Index Terms

  1. A study of relevance propagation for web search

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          SIGIR '05: Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
          August 2005
          708 pages
          ISBN:1595930345
          DOI:10.1145/1076034

          Copyright © 2005 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 15 August 2005

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • Article

          Acceptance Rates

          Overall Acceptance Rate792of3,983submissions,20%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader