ABSTRACT
Maximizing only the relevance between queries and documents will not satisfy users if they want the top search results to present a wide coverage of topics by a few representative documents. In this paper, we propose two new metrics to evaluate the performance of information retrieval: diversity, which measures the topic coverage of a group of documents, and information richness, which measures the amount of information contained in a document. Then we present a novel ranking scheme, Affinity Rank, which utilizes these two metrics to improve search results. We demonstrate how Affinity Rank works by a toy data set, and verify our method by experiments on real-world data sets.
- Page, L., Brin, S., Motwani, R. and Windograd, T. The pagerank citation ranking: Bring order to the web, Stanford Digital Library Technologies Project, 1998.Google Scholar
- Kleinberg, J. M. Authoritative sources in a hyperlinked environment. Journal of the ACM (JACM), 46 (5). 604--632. Google ScholarDigital Library
- Meila, M. and Shi, J., A random walks view of spectral segmentation. In Proceedings of the International Workshop on AI and Statistics(AISTATS), (Florida, 2001), 177--182.Google Scholar
- He, X., Ma, W.-Y. and Zhang, H.-J., Spectral Techniques for Structural Analysis of Image Database. In Proceedings of the 2003 International Conference on Multimedia and Expo, (Baltimore, 2003), 25--28. Google ScholarDigital Library
Index Terms
- Affinity rank: a new scheme for efficient web search
Recommendations
Improving web search results using affinity graph
SIGIR '05: Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrievalIn this paper, we propose a novel ranking scheme named Affinity Ranking (AR) to re-rank search results by optimizing two metrics: (1) diversity -- which indicates the variance of topics in a group of documents; (2) information richness -- which measures ...
Improving product review search experiences on general search engines
ICEC '09: Proceedings of the 11th International Conference on Electronic CommerceIn the Web 2.0 era, internet users contribute a large amount of online content. Product review is a good example. Since these phenomena are distributed all over shopping sites, weblogs, forums etc., most people have to rely on general search engines to ...
Rank-Stability and Rank-Similarity of Link-Based Web Ranking Algorithms in Authority-Connected Graphs
AbstractWeb search algorithms that rank Web pages by examining the link structure of the Web are attractive from both theoretical and practical aspects. Today’s prevailing link-based ranking algorithms rank Web pages by using the dominant eigenvector of ...
Comments