ABSTRACT
Result merging is an important step in federated search to merge the documents returned from multiple source-specific ranked lists for a user query. Previous result merging methods such as Semi-Supervised Learning (SSL) and Sample- Agglomerate Fitting Estimate (SAFE) use regression methods to estimate global document scores from document ranks in individual ranked lists. SSL relies on overlapping documents that exist in both individual ranked lists and a centralized sample database. SAFE goes a step further by using both overlapping documents with accurate rank information and documents with estimated rank information for regression. However, existing methods do not distinguish the accurate rank information from the estimated information. Furthermore, all documents are assigned equal weights in regression while intuitively, documents in the top should carry higher weights. This paper proposes a weighted curve fitting method for result merging in federated search. The new method explicitly models the importance of information from overlapping documents over non-overlapping ones. It also weights documents at different positions differently. Empirically results on two datasets clearly demonstrate the advantage of the proposed algorithm.
- J. Callan. Distributed information retrieval. Advances in Information Retrieval, pages 127--150, 2000.Google Scholar
- J. Callan, W. B. Croft, and S. M. Harding. The inquery retrieval system. In Proceedings of the Third International Conference on Database and Expert Systems Applications, 1992.Google ScholarCross Ref
- S. Kirsch. Document retrieval over networks wherein ranking and relevance scores are computed at the client for multiple database documents, Aug. 19 2003.Google Scholar
- M. Shokouhi and J. Zobel. Robust result merging using sample-based score estimates. ACM Transactions on Information Systems, 27(3):1--29, 2009. Google ScholarDigital Library
- L. Si and J. Callan. A semi-supervised learning method to merge search engine results. ACM Transactions on Information Systems, 21(4):457--491, 2003. Google ScholarDigital Library
Index Terms
- A weighted curve fitting method for result merging in federated search
Recommendations
Mixture model with multiple centralized retrieval algorithms for result merging in federated search
SIGIR '12: Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrievalResult merging is an important research problem in federated search for merging documents retrieved from multiple ranked lists of selected information sources into a single list. The state-of-the-art result merging algorithms such as Semi-Supervised ...
A personalized result merging method for metasearch engine
ICSCA '17: Proceedings of the 6th International Conference on Software and Computer ApplicationsMetasearch engine integrates the search results from multiple sources, and improves recall in the big data environment. Result merging is a key component which will greatly affect the effectiveness of a metasearch engine. Great progress has been made in ...
Exploration of the tradeoff between effectiveness and efficiency for results merging in federated search
SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrievalFederated search is the task of retrieving relevant documents from different information resources. One of the main research problems in federated search is to combine the results from different sources into a single ranked list. Recent work proposed a ...
Comments