ABSTRACT
The fundamental difference between standard information retrieval and XML retrieval is the unit of retrieval. In traditional IR, the unit of retrieval is fixed: it is the complete document. In XML retrieval, every XML element in a document is a retrievable unit. This makes XML retrieval more difficult: besides being relevant, a retrieved unit should be neither too large nor too small. The research presented here, a comparative analysis of two approaches to XML retrieval, aims to shed light on which XML elements should be retrieved. The experimental evaluation uses data from the Initiative for the Evaluation of XML retrieval (INEX 2002).
- N. Fuhr, N. Gövert, G. Kazai, and M. Lalmas, editors. INEX 2002 Workshop Proceedings, 2002.Google ScholarCross Ref
- N. Gövert, M. Abolhassani, N. Fuhr, and K. Grosjohann. Content oriented XML retrieval with HyREX. In Fuhr et al. {1}, pages 13--17.Google Scholar
- K. Hatano, H. Kinutani, and M. Watanabe. An appropriate unit of retrieval results for XML document retrieval. In Fuhr et al. {1}, pages 66--71.Google Scholar
- D. Hiemstra. Using Language Models for Information Retrieval. PhD thesis, University of Twente, 2001.Google Scholar
- J. Kekäläinen and K. Järvelin. Using graded relevance assessments in IR evaluation. JASIST, 53:1120--1129, 2002. Google ScholarDigital Library
- M. Marx, J. Kamps, and M. de Rijke. The University of Amsterdam at INEX-2002. In Fuhr et al. {1}, pages 24--28.Google Scholar
- S. H. Myaeng, D.-H. Jang, M.-S. Kim, and Z.-C. Zhoo. A flexible model for retrieval of SGML documents. In SIGIR 1998, pages 138--145, 1998. Google ScholarDigital Library
- G. Salton, J. Allan, and C. Buckley. Approaches to passage retrieval in full text information systems. In SIGIR 1993, pages 49--58, 1993. Google ScholarDigital Library
- R. Wilkinson. Effective retrieval of structured documents. In SIGIR 1994, pages 311--317, 1994. Google ScholarDigital Library
Index Terms
- XML retrieval: what to retrieve?
Recommendations
Hybrid XML Retrieval: Combining Information Retrieval and a Native XML Database
AbstractThis paper investigates the impact of three approaches to XML retrieval: using Zettair, a full-text information retrieval system; using eXist, a native XML database; and using a hybrid system that takes full article answers from Zettair and uses ...
Theoretical evaluation of XML retrieval
This thesis has developed a theoretical framework to evaluate XML retrieval. XML retrieval deals with retrieving those document parts that specifically answer a query. It is concerned with using the document structure to improve the retrieval of ...
Information Retrieval System for XML Documents
DEXA '02: Proceedings of the 13th International Conference on Database and Expert Systems ApplicationsIn the research field of document information retrieval, the unit of retrieval results returned by IR systems is a whole document or a document fragment, like a paragraph in passage retrieval. IR systems based on the vector space model compute feature ...
Comments