ACM Home Page
Please provide us with feedback. Feedback
Indexing and searching tera-scale Grid-Based Digital Libraries
Full text PdfPdf (138 KB)
Source ACM International Conference Proceeding Series; Vol. 152 archive
Proceedings of the 1st international conference on Scalable information systems table of contents
Hong Kong
Article No. 3  
Year of Publication: 2006
ISBN:1-59593-428-6
Authors
Robert Sanderson  University of Liverpool, Liverpool, U.K.
Ray R. Larson  University of California, Berkeley, California
Publisher
ACM  New York, NY, USA
Bibliometrics
Downloads (6 Weeks): 5,   Downloads (12 Months): 69,   Citation Count: 2
Additional Information:

abstract   references   cited by   index terms   collaborative colleagues  

Tools and Actions: Review this Article  
Save this Article to a Binder    Display Formats: BibTex  EndNote ACM Ref   
DOI Bookmark: Use this link to bookmark this Article: http://doi.acm.org/10.1145/1146847.1146850
What is a DOI?

ABSTRACT

The University of California, Berkeley and the University of Liverpool in conjunction with the San Diego Supercomputer Center are developing a framework for Grid-Based Digital Library systems and Information Retrieval Services (Cheshire3) that operates in both single-processor and distributed computing environments. In this paper we discuss some results of testing Grid-based parallel approaches in indexing and retrieval for a variety of information resources, ranging from small test collections like the TREC and INEX collections, to medium-scale metadata collections like Medline and a test version of University of California Online Union Catalog, MELVYL (with 15 million and 16.5 million records respectively) ranging up to large-scale collections like the US National Records and Archives Administration (NARA) Preservation Prototype. This paper examines our approaches to indexing and retrieving from these collections and the architecture of the system that supports them.


REFERENCES

Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.

 
1
A. Rajasekar, M. Wan, R. Moore, W. Schroeder, G. Kremenek, A. Jagatheesan, C. Cowart, B. Zhu, S.-Y. Chen, and R. Olschanowsky, "Storage resource broker - managing distributed data in a grid," Computer Society of India Journal, Vol. 33, no. 4, pp. 42--54, 2003.
 
2
 
3
 
4
C. Lagoze, H. Van de Sompel, M. Nelson, S. Warner, "The Open Archives Initiative Protocol for Metadata Harvesting, Version 2.0". June 2002. http://www.openarchives.org/OAI/2.0/openarchivesprotocol.htm
 
5
R. Denenberg, R. Sanderson, M. Dovey et al. (eds), "SRW - Search/Retrieve Webservice", February 2004. http://www.loc.gov/srw
 
6
OASIS WSBPEL Technical Committee, "Web Services Business Process Execution Language". 2005. http://www.oasis-open.org/committees/wsbpel/charter.php
 
7
8
 
9
A. Geist, A. Beguelin, J. Dongarra, W. Jiang, R. Manchek, V. Sunderam, "PVM: Parallel Virtual Machine" MIT Press, 1995
 
10
Message Passing Interface Forum. "MPI: A Message-Passing Interface standard (version 1.1)", Technical report, 1995. http://www.mpiforum.org.
 
11
W3C, "SOAP Specifications" June 2003. http://www.w3.org/TR/soap/
 
12
N. Nassar, G. Newby, K. Gamiel, M. Dovey, J. Morris, "Grid Information Retrieval Architecture", 2003, urlhttp://www.gir-wg.org/
 
13
DILIGENT Project, "Architectural Overview", April 2005, http://diligentproject.org/content/view/71/99
 
14
SleepyCat Software, "Berkeley DB", 2005, http://sleepycat.com/products/db.shtml
 
15
L. Declerck, C. Frymann, "DSpace/SRB Integration", CNI Fall Task Force Meeting, http://libnet.ucsd.edu/nara/
 
16
"About the TeraGrid", 2005. http://www.teragrid.org/about/index.html
 
17
"National Centre for Text Mining", 2005. http://www.nactem.ac.uk
 
18
National Library of Medicine, "Medline Factsheet". http://www.nlm.nih.gov/pubs/factsheets/medline.html
 
19
Tsujii, Jun'ichi, "Tsujii Laboratory", 2005. http://www-tsujii.is.s.u-tokyo.ac.jp/
 
20
"PyMPI", 2005, http://pympi.sourceforge.net/
 
21
The National Archives, "National Archives Electronic Records Archives Option Award Announcement" 2005. http://www.archives.gov/era/acquisition/option-award.html
 
22
T. Phelps, P. Watry, "A No-Compromises Architecture for Digital Document Preservation" in Research and Advanced Technology for Digital Libraries 9th European Conference, ECDL2005, Proceedings 2005, pp. 266--277
 
23
P Tooby, "Building a 'Memory' for the National Science Digital Library", in NPACI Online Vol. 7, issue 2, January 2003. http://www.npaci.edu/online/v7.2/nsdl.html


Collaborative Colleagues:
Robert Sanderson: colleagues
Ray R. Larson: colleagues