| Enhancing cross-language information retrieval by an automatic acquisition of bilingual terminology from comparable corpora |
| Full text |
Pdf
(167 KB)
|
| Source
|
Annual ACM Conference on Research and Development in Information Retrieval
archive
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
table of contents
Toronto, Canada
POSTER SESSION: Posters
table of contents
Pages: 397 - 398
Year of Publication: 2003
ISBN:1-58113-646-3
|
|
Authors
|
|
Fatiha Sadat
|
Nara Institute of Science and Technology, Ikoma, Nara, Japan
|
|
Masatoshi Yoshikawa
|
Nagoya University, Furo-cho, Chikusa-ku, Nagoya, Japan
|
|
Shunsuke Uemura
|
Nara Institute of Science and Technology, Ikoma, Nara, Japan
|
|
| Sponsor |
|
| Publisher |
|
| Bibliometrics |
Downloads (6 Weeks): 2, Downloads (12 Months): 41, Citation Count: 2
|
|
|
ABSTRACT
This paper presents an approach to bilingual lexicon extraction from comparable corpora and evaluations on Cross-Language Information Retrieval. We explore a bi-directional extraction of bilingual terminology primarily from comparable corpora. A combined statistics-based and linguistics-based model to select best translation candidates to phrasal translation is proposed. Evaluations using a large test collection for Japanese-English revealed the proposed combination of bi-directional comparable corpora, bilingual dictionaries and transliteration, augmented with linguistics-based pruning to be highly effective in Cross-Language Information Retrieval.
REFERENCES
Note: OCR errors may be found in this Reference List extracted from the full text article. ACM has opted to expose the complete List rather than only correct and linked references.
| |
1
|
|
| |
2
|
Fung, P. A Statistical View of Bilingual Lexicon Extraction: From Parallel Corpora to Non-Parallel Corpora. 2000. In Jean Veronis, Ed. Parallel Text Processing.
|
| |
3
|
|
| |
4
|
|
| |
5
|
|
CITED BY 2
|
|
G. Craig Murray , Bonnie J. Dorr , Jimmy Lin , Jan Hajič , Pavel Pecina, Leveraging reusability: cost-effective lexical acquisition for large-scale ontology translation, Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the ACL, p.945-952, July 17-18, 2006, Sydney, Australia
|
|
|
|
|