research-article

Improved Monolingual Hypothesis Alignment for Machine Translation System Combination

Authors:
Xiaodong He

Microsoft Research

Microsoft Research
View Profile

,
Mei Yang

University of Washington

University of Washington
View Profile

,
Jianfeng Gao

Microsoft Research

Microsoft Research
View Profile

,
Patrick Nguyen

Microsoft Research

Microsoft Research
View Profile

,
Robert Moore

Microsoft Research

Microsoft Research
View Profile

ACM Transactions on Asian Language Information Processing Volume 8 Issue 2Article No.: 6pp 1–19https://doi.org/10.1145/1526252.1526254

Published:01 May 2009Publication History

ACM Transactions on Asian Language Information Processing

Abstract

This article presents a new hypothesis alignment method for combining outputs of multiple machine translation (MT) systems. An indirect hidden Markov model (IHMM) is proposed to address the synonym matching and word ordering issues in hypothesis alignment. Unlike traditional HMMs whose parameters are trained via maximum likelihood estimation (MLE), the parameters of the IHMM are estimated indirectly from a variety of sources including word semantic similarity, word surface similarity, and a distance-based distortion penalty. The IHMM-based method significantly outperforms the state-of-the-art, TER-based alignment model in our experiments on NIST benchmark datasets. Our combined SMT system using the proposed method achieved the best Chinese-to-English translation result in the constrained training track of the 2008 NIST Open MT Evaluation.

References

Andrew, G. 2006. A hybrid Markov/semi-Markov conditional random field for sequence segmentation. In Proceedings of the Joint Meeting of the Conference on Empirical Methods in Natural Language Proceedings (EMNLP’06). Google ScholarDigital Library
Bangalore, S., Bordel, G., and Riccardi, G. 2001. Computing consensus translation from multiple machine translation systems. In Proceedings of the IEEE Conference on Automatic Speech Recognition and Understanding (ASRU’01). 351--354.Google Scholar
Brent, R. 1973. Algorithms for minimization without derivatives. Prentice-Hall, Chapter 7.Google Scholar
Chiang, D. 2007. Hierarchical phrase-based translation. Comput. Linguist. 33, 2, 201--228. Google ScholarDigital Library
Corston-Oliver, S., Aue, A., Duh, K., and Ringger, E. 2006. Multilingual dependency parsing using Bayes point machine. In Proceedings of the North American Chapter of the Association for Computational Linguistic (NAACL’06). Google ScholarDigital Library
Foster, G. and Kuhn, R. 2007. Mixture-model adaptation for SMT. In Proceedings of the 2nd ACL Workshop on Statistical Machine Translation (SML’07). 128--136. Google ScholarDigital Library
Galley, M., Graehl, J., Knight, K., Marcu, D. DeNeefe, S., Wang, W., and Thayer, I. 2006. Scalable inference and training of context-rich syntactic translation models. In Proceedings of the International Conference on Computer Linguistics (COLING’06). 961--968. Google ScholarDigital Library
Gao, J., Li, M., Wu, A., and Huang, C.-N. 2005. Chinese word segmentation and named entity recognition: A pragmatic approach. Comput. Linguist. 31, 4. Google ScholarDigital Library
He, X. 2007. Using word-dependent transition models in HMM based word alignment for statistical machine translation. In Proceedings of the 2nd ACL Workshop on Statistical Machine Translation (SMT’07). Google ScholarDigital Library
Huang, L. and Chiang, D. 2007. Forest rescoring: Faster decoding with integrated language models. In Proceedings of the Association on Computer Linguistics (ACL’07).Google Scholar
Jayaraman, S. and Lavie, A. 2005. Multi-engine machine translation guided by explicit word matching. In Proceedings of the European Association for Machine Translation (EAMT’05). 143--152.Google Scholar
Karakos, D., Eisner, J., Khudanpur, S., and Dreyer, M. 2008. Machine translation system combination using ITG-based alignments. In Proceedings of the Human Language Technology Conference (ACL-HLT’08). 81--84. Google ScholarDigital Library
Koehn, P., Och, F. J., and Marcu, D. 2003. Statistical phrase based translation. In Proceedings of the North American Chapter of the Association for Computational Linguistics (NAACL’03). 48--54. Google ScholarDigital Library
Levy, R. and Manning, C. 2003. Is it harder to parse Chinese, or the Chinese treebank? In Proceedings of the Association on Computer Linguistics (ACL’03). Google ScholarDigital Library
Li, C.-H., Li, M., Zhang, D., Li, M., Zhou, M., and Guan, Y. 2007. A probabilistic approach to syntax-based reordering for statistical machine translation. In Proceedings of the Association on Computer Linguistics (ACL’07). 720--727.Google Scholar
Liang, P., Taskar, B., and Klein, D. 2006. Alignment by agreement. In Proceedings of the North American Chapter of the Association for Computational Linguistics (NAACL’06). 104--111. Google ScholarDigital Library
Matusov, E., Ueffing, N., and Ney, H. 2006. Computing consensus translation from multiple machine translation systems using enhanced hypotheses alignment. In Proceedings of the European Association of Computer Linguistics (EACL’06). 33--40.Google Scholar
Moore, R. and Quirk, C. 2007. Faster beam-search decoding for phrasal statistical machine translation. In Proceedings of the Machine Translation Summit XI (MT’07).Google Scholar
Nguyen, P., Gao J., and Mahajan, M. 2007. MSRLM: A scalable language modeling toolkit. Microsoft Research Tech. Rep. MSR-TR-2007-144.Google Scholar
NIST. 2008. The 2008 NIST open machine translation evaluation. http://www.nist.gov/speech/tests/mt/2008/doc/.Google Scholar
Och, F. J. and Ney, H. 2003. A systematic comparison of various statistical alignment models. Comput. Linguist. 29, 1, 19--51. Google ScholarDigital Library
Papineni, K., Roukos, S., Ward, T., and Zhu, W.-J. 2002. BLEU: A method for automatic evaluation of machine translation. In Proceedings of the Association for Computational Linguistics (ACL’02). 311--318. Google ScholarDigital Library
Philipp, K., Och, F. J., and Marcu, D. 2003. Statistical phrase based translation. In Proceedings of the North American Chapter of the Association for Computational Linguistics (NAACL’03). 48--54. Google ScholarDigital Library
Quirk, C., Menezes, A., and Cherry, C. 2005. Dependency treelet translation: Syntactically informed phrasal SMT. In Proceedings of the Association for Computer Linguistics (ACL’05). 271--279. Google ScholarDigital Library
Rosti, A.-V. I., Xiang, B., Matsoukas, S., Schwartz, R., Ayan, N. F., and Dorr, B. J. 2007a. Combining outputs from multiple machine translation systems. In Proceedings of Human Language Technology Conference/North American Chapter of the Association for Computational Linguistics (HLT-NAACL’07). 228--235.Google Scholar
Rosti, A.-V. I., Matsoukas, S., and Schwartz, R. 2007b. Improved word-level system combination for machine translation. In Proceedings of the Association for Computer Linguistics (ACL’07). 312--319.Google Scholar
Rosti, A.-V.I., Zhang, B., Matsoukas, S., and Schwartz, R. 2008. Incremental hypothesis alignment for building confusion networks with application to machine translation system combination. In Proceedings of the 3rd ACL Workshop on Statistical Machine Translation (SMT’08). 183--186. Google ScholarDigital Library
Shen, L., Xu, J., and Weischedel, R. 2008. A new string-to-dependency machine translation algorithm with a target dependency language model. In Proceedings of the Association for Computer Linguistics/Human Language Translation (ACL-HLT’08). 577--585.Google Scholar
Sim, K. C., Byrne, W. J., Gales, M. J. F., Sahbi, H., and Woodland, P. C. 2007. Consensus network decoding for statistical machine translation system combination. In Proceedings of the International on Acoustics, Speech, and Signal Processing (ICASSP’07). 4, 105--108.Google Scholar
Snover, M., Dorr, B., Schwartz, R., Micciulla, L., and Makhoul, J. 2006. A study of translation edit rate with targeted human annotation. In Proceedings of the Association for Machine Translation in the Americas (AMTA’06).Google Scholar
Toutanova, K., Suzuki, H., and Ruopp, A. 2008. Applying morphology generation models to machine translation. In Proceedings of the Association of Computer Linguistics (ACL’08). 514--522.Google Scholar
Vogel, S., Ney, H., and Tillmann, C. 1996. HMM-based word alignment in statistical translation. In Proceedings of the International Conference on Computer Linguistics (COLING’96). 836--841. Google ScholarDigital Library
Wang, C., Collins, M., and Koehn, P. 2007a. Chinese syntactic reordering for statistical machine translation. In Proceedings of the Joint Meeting of the Conference on Empirical Methods in Natural Language Processing (EMNLP-CoNLL’07). 737--745.Google Scholar
Wang, W., Stolcke, A., and Zheng, J. 2007b. Reranking machine translation hypotheses with structured and Web-based language models. In Proceedings of the IEEE Conference on Automatic Speech Recognition and Understanding (ASRU’07). 159--164.Google Scholar
Xiong, D., Liu, Q., and Lin, S. 2006. Maximum entropy based phrase reordering model for statistical machine translation. In Proceedings of the Association for Computer Linguistics (ACL’06). 521--528. Google ScholarDigital Library

Index Terms

Improved Monolingual Hypothesis Alignment for Machine Translation System Combination
1. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
      1. Machine translation

Recommendations

English to Tamil statistical machine translation and alignment using HMM
ICNVS'10: Proceedings of the 12th international conference on Networking, VLSI and signal processing

This paper describes English to Tamil statistical machine translation and its alignment using Hidden Markov Model (HMM).Statistical machine translation is a part of natural language processing and is based on probability distribution. Machine ...
Read More
HMM Word and Phrase Alignment for Statistical Machine Translation

Estimation and alignment procedures for word and phrase alignment hidden Markov models (HMMs) are developed for the alignment of parallel text. The development of these models is motivated by an analysis of the desirable features of IBM Model 4, one of ...
Read More
Integrating source-language context into phrase-based statistical machine translation

The translation features typically used in Phrase-Based Statistical Machine Translation (PB-SMT) model dependencies between the source and target phrases, but not among the phrases in the source language themselves. A swathe of research has demonstrated ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in

ACM Transactions on Asian Language Information Processing Volume 8, Issue 2
May 2009
89 pages
ISSN:1530-0226
EISSN:1558-3430
DOI:10.1145/1526252
Issue’s Table of Contents

Copyright © 2009 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 1 May 2009
- Accepted: 1 March 2009
- Revised: 1 February 2009
- Received: 1 November 2008
Published in talip Volume 8, Issue 2

Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Statistical machine translation
hidden Markov model
system combination
word alignment
Qualifiers
- research-article
- Research
- Refereed
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 1
  Total Citations
  View Citations
- 245
  Total Downloads
- Downloads (Last 12 months)2
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Improved Monolingual Hypothesis Alignment for Machine Translation System Combination

ACM Transactions on Asian Language Information Processing

Abstract

References

Cited By

Index Terms

Recommendations

English to Tamil statistical machine translation and alignment using HMM

HMM Word and Phrase Alignment for Statistical Machine Translation

Integrating source-language context into phrase-based statistical machine translation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Improved Monolingual Hypothesis Alignment for Machine Translation System Combination

ACM Transactions on Asian Language Information Processing

Abstract

References

Cited By

Index Terms

Recommendations

English to Tamil statistical machine translation and alignment using HMM

HMM Word and Phrase Alignment for Statistical Machine Translation

Integrating source-language context into phrase-based statistical machine translation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media