skip to main content
10.1145/1571941.1572065acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
poster

Cross language name matching

Published: 19 July 2009 Publication History

Abstract

Cross language information retrieval methods are used to determine which segments of Arabic language documents match name-based English queries. We investigate and contrast a word-based translation model with a character-based transliteration model in order to handle spelling variation and previously unseen names. We measure performance by making a novel use of the training data from the 2007 ACE Entity Translation

References

[1]
P. Brown, J. Cocke, S. Della Pietra, V. Della Pietra, F. Jelinek, J. Lafferty, R. Mercer, and P. Roossin. A statistical approach to machine translation. Computational Linguistics, 16(2), June 1990.
[2]
R. Florian, H. Hassan, A. Ittycheriah, H. Jing, N. Kambhatla, X. Luo, N. Nicolov, and S. Roukos. A statistical model for multilingual entity detection and tracking. In HLT-NAACL, pages 1--8, 2004.
[3]
A. Ittycheriah and S. Roukos. A maximum entropy word aligner for arabic-english machine translation. In HLT-EMNLP, pages 89--96, 2005.
[4]
Z. Song and S. Strassel. Entity translation and alignment in the ACE-07 ET task. In E. L. R. A. (ELRA), editor, Proceedings of the Sixth International Language Resources and Evaluation (LREC'08), 2008.
[5]
S. Vogel, H. Ney, and C. Tillmann. HMM-based word alignment in statistical translation. In Proceedings of the 16th conference on Computational linguistics, pages 836--841, 1996.
[6]
J. Xu, R. M. Weischedel, and C. Nguyen. Evaluating a probabilistic model for cross-lingual information retrieval. In SIGIR, pages 105--110, 2001.

Cited By

View all
  • (2012)Entity clustering across languagesProceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies10.5555/2382029.2382039(60-69)Online publication date: 3-Jun-2012

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
SIGIR '09: Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
July 2009
896 pages
ISBN:9781605584836
DOI:10.1145/1571941

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 July 2009

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. algorithms
  2. named entities
  3. sentence retrieval

Qualifiers

  • Poster

Conference

SIGIR '09
Sponsor:

Acceptance Rates

Overall Acceptance Rate 792 of 3,983 submissions, 20%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)1
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2012)Entity clustering across languagesProceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies10.5555/2382029.2382039(60-69)Online publication date: 3-Jun-2012

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media