ACM Home Page
Please provide us with feedback. Feedback
Updating collection representations for federated search
Full text PdfPdf (210 KB)
Source
Annual ACM Conference on Research and Development in Information Retrieval archive
Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval table of contents
Amsterdam, The Netherlands
SESSION: Collection representation in distributed IR table of contents
Pages: 511 - 518  
Year of Publication: 2007
ISBN:978-1-59593-597-7
Authors
Milad Shokouhi  RMIT University, Melbourne, Australia
Mark Baillie  University of Strathclyde, Glasgow, Scotland, UK
Leif Azzopardi  University of Glasgow, Glasgow, Scotland, UK
Sponsors
ACM: Association for Computing Machinery
SIGIR: ACM Special Interest Group on Information Retrieval
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 11,   Downloads (12 Months): 192,   Citation Count: 0
Additional Information:

abstract   references   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1277741.1277829
What is a DOI?

ABSTRACT

To facilitate the search for relevant information across a setof online distributed collections, a federated information retrieval system typically represents each collection, centrally, by a set of vocabularies or sampled documents. Accurate retrieval is therefore related to how precise each representation reflects the underlying content stored in that collection. As collections evolve over time, collection representations should also be updated to reflect any change, however, a current solution has not yet been proposed. In this study we examine both the implications of out-of-date representation sets on retrieval accuracy, as well as proposing three different policies for managing necessary updates. Each policyis evaluated on a testbed of forty-four dynamic collections over an eight-week period. Our findings show that out-of-date representations significantly degrade performance overtime, however, adopting a suitable update policy can minimise this problem.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
 
2
Baillie, M., Azzopardi, L., and Crestani, F. (2006). Adaptive query-based sampling of distributed collections. In Proc. SPIRE Conf., Glasgow, UK pages 316--328.
 
3
Callan, J. (2000). Advances in information retrieval Chapter 5, Distributed information retrieval, pages 127--150. Kluwer.
4
5
6
7
 
8
9
10
11
12
 
13
 
14
Kleinberg, J. (2006). Temporal dynamics of on-line information systems. Data Stream Management: Processing High-Speed Data Streams.
 
15
S. Kullback. Information theoery and statistics. Wiley, New York, NY 1959.
16
 
17
Paepcke, A., Brandriff, R., Janee, G., Larson, R.,Ludaescher, B., Melnik, S., and Raghavan, S. (2000). Search middleware and the simple digital library interoperability protocol. D-Lib Magazine 6(3).
 
18
 
19
Robertson, S., Walker, S., Hancock-Beaulieu, M., Gull ,A., and Lau, M. (1992). Okapi at TREC. In Proceedings of TREC-1992, Gaithersburg, MA pages 21--30.
20
21
22
23
 
24
Shokouhi, M. (2007). Central-Rank-Based Collection Selection in uncooperative distributed information retrieval. Proc. ECIR Conf., Rome, Italy pages 160--172.
 
25
26
27
28

Collaborative Colleagues:
Milad Shokouhi: colleagues
Mark Baillie: colleagues
Leif Azzopardi: colleagues