ACM Home Page
Please provide us with feedback. Feedback
Combining link and content analysis to estimate semantic similarity
Full text PdfPdf (260 KB)
Source International World Wide Web Conference archive
Proceedings of the 13th international World Wide Web conference on Alternate track papers & posters table of contents
New York, NY, USA
POSTER SESSION: Posters table of contents
Pages: 452 - 453  
Year of Publication: 2004
ISBN:1-58113-912-8
Author
Filippo Menczer  Indiana University, Bloomington, IN
Sponsor
ACM: Association for Computing Machinery
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 5,   Downloads (12 Months): 39,   Citation Count: 3
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1013367.1013521
What is a DOI?

ABSTRACT

Search engines use content and link information to crawl, index, retrieve, and rank Web pages. The correlations between similarity measures based on these cues and on semantic associations between pages therefore crucially affects the performance of any search tool. Here I begin to quantitatively analyze the relationship between content, link, and semantic similarity measures across a massive number of Web page pairs. Maps of semantic similarity across textual and link similarity highlight the potential and limitations of lexical and link analysis for relevance approximation, and provide us with a way to study whether and how text and link based measures should be combined.