ABSTRACT
In recent years, researchers have investigated search result diversification through a variety of approaches. In such situations, information retrieval systems need to consider both aspects of relevance and diversity for those retrieved documents. On the other hand, previous research has demonstrated that data fusion is useful for improving performance when we are only concerned with relevance. However, it is not clear if it helps when both relevance and diversity are both taken into consideration. In this short paper, we propose a few data fusion methods to try to improve performance when both relevance and diversity are concerned. Experiments are carried out with 3 groups of top-ranked results submitted to the TREC web diversity task. We find that data fusion is still a useful approach to performance improvement for diversity as for relevance previously.
- E. Aktolga and J. Allan. Sentiment diversification with different biases. In Proceedings of the 36th Annual International ACM SIGIR Conference, pages 593--602, Dublin, Ireland, July 2013. Google ScholarDigital Library
- G. V. Cormack, C. L. A. Clarke, and S. B$\ddotu$ttcher. Reciprocal rank fusion outperforms condorcet and individual rank learning mthods. In Proceedings of the 32nd Annual International ACM SIGIR Conference, pages 758--759, Boston, MA, USA, July 2009. Google ScholarDigital Library
- V. Dang and W. B. Croft. Term level search result diversification. In Proceedings of the 36th Annual International ACM SIGIR Conference, pages 603--612, Dublin, Ireland, July 2013. Google ScholarDigital Library
- Z. Dou, K. Chen, R. Song, Y. Ma, S. Shi, and J. Wen. Microsoft Research Asia at the web track of TREC 2009. In Proceedings of The Eighteenth Text REtrieval Conference, Gaithersburg, Maryland, USA, November 2009.Google Scholar
- R. Kohavi. A study of cross-validation and bootstrap for accuracy estimation and model selection. In Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence (Volumn 2), pages 1137--1145, Montreal, Canada, August 1995. Google ScholarDigital Library
- J. H. Lee. Analysis of multiple evidence combination. In Proceedings of the 20th Annual International ACM SIGIR Conference, pages 267--275, Philadelphia, Pennsylvania, USA, July 1997. Google ScholarDigital Library
- R. McCreadie, C. Macdonald, R. Santos, and I. Ounis. University of Glasgow at TREC 2011: Experiments with terrier in crowdsourcing, microblog, and web tracks. In Proceedings of The Twentieth Text REtrieval Conference, Gaithersburg, Maryland, USA, November 2011.Google Scholar
- S. Wu and S. McClean. Performance prediction of data fusion for information retrieval. Information Processing & Management, 42(4):899--915, July 2006. Google ScholarDigital Library
Index Terms
Search result diversification via data fusion
Recommendations
Fusion helps diversification
SIGIR '14: Proceedings of the 37th international ACM SIGIR conference on Research & development in information retrievalA popular strategy for search result diversification is to first retrieve a set of documents utilizing a standard retrieval method and then rerank the results. We adopt a different perspective on the problem, based on data fusion. Starting from the ...
Applying the data fusion technique to blog opinion retrieval
In recent years, blogs have been very popular on the Web as a grassroots publishing platform. Some research has been conducted on them and blog opinion retrieval is one of the key issues. In this paper, we investigate if data fusion can be useful for ...
Probability-based fusion of information retrieval result sets
Information Retrieval (IR) forms the basis of many information management tasks. Information management itself has become an extremely important area as the amount of electronically available information increases dramatically. There are numerous ...
Comments