skip to main content
10.1145/1242572.1242724acmconferencesArticle/Chapter ViewAbstractPublication PagesthewebconfConference Proceedingsconference-collections
Article

A cautious surfer for PageRank

Published: 08 May 2007 Publication History

Abstract

This work proposes a novel cautious surfer to incorporate trust into the process of calculating authority for web pages. We evaluate a total of sixty queries over two large, real-world datasets to demonstrate that incorporating trust can improve PageRank's performance.

References

[1]
C. Castillo, D. Donato, L. Becchetti, P. Boldi, M. Santini, and S. Vigna. A reference collection for web spam. ACM SIGIR Forum, 40(2), Dec. 2006.
[2]
J. Cho, H. Garcia-Molina, T. Haveliwala, W. Lam, A. Paepcke, S. Raghavan and G. Wesley. Stanford WebBase components and applications. ACM Transactions on Internet Technology, 6(2):153--186, 2006.
[3]
Z. Gyöngyi, H. Garcia-Molina, and J. Pedersen. Combating web spam with TrustRank. In Proc. of the 30th Int'l Conf. on Very Large Data Bases (VLDB), pages 271--279, Toronto, Canada, Sept. 2004.
[4]
L. Nie, B. Wu, and B. D. Davison. Incorporating trust into web search. Available as Technical Report LU-CSE-07-002, Dept. of Computer Science and Engineering, Lehigh University, 2007.
[5]
L. Page, S. Brin, R. Motwani, and T. Winograd. The PageRank citation ranking: Bringing order to the Web. Unpublished draft, 1998.
[6]
B. Wu, V. Goel, and B. D. Davison. Propagating trust and distrust to demote web spam. In Proc. of Models of Trust for the Web workshop at the 15th Int'l World Wide Web Conf., Edinburgh, Scotland, May 2006.
[7]
Yahoo! Research. Web collection UK-2006. http://research.yahoo.com/. Crawled by the Laboratory of Web Algorithmics, University of Milan, http://law.dsi.unimi.it/. URL retrieved Oct. 2006.

Cited By

View all
  • (2012)Detecting Fake Medical Web Sites Using Recursive Trust LabelingACM Transactions on Information Systems10.1145/2382438.238244130:4(1-36)Online publication date: 1-Nov-2012
  • (2009)A Parameterized Approach to Spam-Resilient Link Analysis of the WebIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2008.22720:10(1422-1438)Online publication date: 1-Oct-2009
  • (2007)Winnowing wheat from the chaffProceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval10.1145/1277741.1277950(869-870)Online publication date: 23-Jul-2007

Index Terms

  1. A cautious surfer for PageRank

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    WWW '07: Proceedings of the 16th international conference on World Wide Web
    May 2007
    1382 pages
    ISBN:9781595936547
    DOI:10.1145/1242572
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 08 May 2007

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. authority
    2. ranking performance
    3. spam
    4. trust
    5. web search engine

    Qualifiers

    • Article

    Conference

    WWW'07
    Sponsor:
    WWW'07: 16th International World Wide Web Conference
    May 8 - 12, 2007
    Alberta, Banff, Canada

    Acceptance Rates

    Overall Acceptance Rate 1,899 of 8,196 submissions, 23%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)2
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 02 Mar 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2012)Detecting Fake Medical Web Sites Using Recursive Trust LabelingACM Transactions on Information Systems10.1145/2382438.238244130:4(1-36)Online publication date: 1-Nov-2012
    • (2009)A Parameterized Approach to Spam-Resilient Link Analysis of the WebIEEE Transactions on Parallel and Distributed Systems10.1109/TPDS.2008.22720:10(1422-1438)Online publication date: 1-Oct-2009
    • (2007)Winnowing wheat from the chaffProceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval10.1145/1277741.1277950(869-870)Online publication date: 23-Jul-2007

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media