ABSTRACT
Most cross language information retrieval research concentrates on language pairs for which direct, rich, and often multiple translation resources already exist. However, for most language pairs, translation via an intermediate language is necessary. Two distinct methods for dealing with the additional ambiguity introduced by the extra translation step have been proposed and individually, shown to improve retrieval effectiveness. Two previous works indicated that in combination, the methods were ineffective. This paper provides strong empirical evidence that the methods can be combined to produce consistent and often significant improvements in retrieval effectiveness. The improvement is shown across a number of different intermediate languages and test collections.
- Ballesteros, L. Cross-Language Retrieval via Transitive Translation. Advances in Information Retrieval: Recent Research from the CIIR, pp. 203-234. W. Bruce Croft, Ed. Kluwer Academic Publishers, 2000.]]Google Scholar
- Ballesteros, L. and Croft, W. B. Resolving Ambiguity for Cross-Language Retrieval. Proceedings of the 21st ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 64--71, August 1998.]] Google ScholarDigital Library
- Ballesteros, L. and Croft, W. B. Phrasal Translation and Query Expansion Techniques for Cross-Language Information Retrieval. Proceedings of the 20th ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 84--91, July 1997.]] Google ScholarDigital Library
- Broglio, J., Callan, J., and Croft, W. B. Inquery system overview. In Proceedings of the TIPSTER Text Program (Phase I), pp 47--67, 1994.]] Google ScholarDigital Library
- Gollins, T and Sanderson, M., Improving Cross Language Retrieval with Triangulated Translation. In Proceedings of the 24th annual international ACM SIGIR conference on research and development in information retrieval, pp 90-95, 2001]] Google ScholarDigital Library
- Gollins, T., Dictionary Based Transitive Cross-Language Information Retrieval Using Lexical Triangulation. MSc Dissertation, University of Sheffield. 2000.]]Google Scholar
- Lehtokangas, R. and Airio, E., Translation via a Pivot Language Challenges Direct Translation in CLIR. Working Notes of the SIGIR 2002 Workshop on Cross-language Information Retrieval, pp 19--24.]]Google Scholar
- Lehtokangas, R. and Airio, E., and Jarvelin, K. Transitive dictionary translation challenges direct dictionary translation in CLIR. University of Tampere Technical Report, 2003. http://www.info.uta.fi/tutkimus/fire/archive/RL_200503.pdf.]]Google Scholar
- Peters, C., Cross-Language Information Retrieval and Evaluation, Workshop of Cross-Language Evaluation Forum, CLEF 2000, Lisbon, Portugal, September 21-22, 2000, Revised Papers. Editor: Carol Peters.]] Google ScholarDigital Library
- Piepenbrock, R. CELEX, The Dutch Center for Lexical Information, CELEX, 2000. http://www.kun.nl/celex/.]]Google Scholar
- Pirkola, A., Hedlund, T., Keskusalo, H., and Jarvelin, K., Cross-language Information Retrieval Problems, Methods, and Research Findings. In Information Retrieval, vol. 4, 2001. pp. 209--230.]] Google ScholarDigital Library
- Pirkola, A. The effects of query structure and dictionary setups in dictionary-based cross-language information retrieval. In Proceedings of the 21st ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 55--63, August 1998]] Google ScholarDigital Library
- Voorhees, E. M., and Harman, D. K., NIST Special Publication 500-246: The Eighth Text REtrieval Conference (TREC-8), 1999. Editors: E. M. Voorhees and D. K. Harman]]Google Scholar
- Sperer, R. and Oard, D. W. Structured Translation for Cross-Language Information Retrieval. In Proceedings of the 23rd annual international ACM SIGIR conference on research and development in information retrieval, pp 120--127, 2000.]] Google ScholarDigital Library
- University of Neuchatel multilingual retrieval page. http://www.unine.ch/info/clef/]]Google Scholar
- Kielikone multilingual dictionaries. www.kielikone.fi.]]Google Scholar
- Darwish, K. and Oard, D. W. Probabilistic structured query methods. In Proceedings of the 26th ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 338--344, July 2003.]] Google ScholarDigital Library
Index Terms
- Addressing the lack of direct translation resources for cross-language retrieval
Recommendations
Transitive dictionary translation challenges direct dictionary translation in CLIR
The paper reports on experiments carried out in transitive translation, a branch of cross-language information retrieval (CLIR). By transitive translation we mean translation of search queries into the language of the document collection through an ...
Cross-language spoken document retrieval using HMM-based retrieval model with multi-scale fusion
Cross-language spoken document retrieval (CL-SDR) is the technology that facilitates automatic retrieval of relevant information from a collection of spoken documents in a language that is different from that used in the queries. Information sources ...
Translation techniques in cross-language information retrieval
Cross-language information retrieval (CLIR) is an active sub-domain of information retrieval (IR). Like IR, CLIR is centered on the search for documents and for information contained within those documents. Unlike IR, CLIR must reconcile queries and ...
Comments