Abstract
This article presents a new hypothesis alignment method for combining outputs of multiple machine translation (MT) systems. An indirect hidden Markov model (IHMM) is proposed to address the synonym matching and word ordering issues in hypothesis alignment. Unlike traditional HMMs whose parameters are trained via maximum likelihood estimation (MLE), the parameters of the IHMM are estimated indirectly from a variety of sources including word semantic similarity, word surface similarity, and a distance-based distortion penalty. The IHMM-based method significantly outperforms the state-of-the-art, TER-based alignment model in our experiments on NIST benchmark datasets. Our combined SMT system using the proposed method achieved the best Chinese-to-English translation result in the constrained training track of the 2008 NIST Open MT Evaluation.
- Andrew, G. 2006. A hybrid Markov/semi-Markov conditional random field for sequence segmentation. In Proceedings of the Joint Meeting of the Conference on Empirical Methods in Natural Language Proceedings (EMNLP’06). Google ScholarDigital Library
- Bangalore, S., Bordel, G., and Riccardi, G. 2001. Computing consensus translation from multiple machine translation systems. In Proceedings of the IEEE Conference on Automatic Speech Recognition and Understanding (ASRU’01). 351--354.Google Scholar
- Brent, R. 1973. Algorithms for minimization without derivatives. Prentice-Hall, Chapter 7.Google Scholar
- Chiang, D. 2007. Hierarchical phrase-based translation. Comput. Linguist. 33, 2, 201--228. Google ScholarDigital Library
- Corston-Oliver, S., Aue, A., Duh, K., and Ringger, E. 2006. Multilingual dependency parsing using Bayes point machine. In Proceedings of the North American Chapter of the Association for Computational Linguistic (NAACL’06). Google ScholarDigital Library
- Foster, G. and Kuhn, R. 2007. Mixture-model adaptation for SMT. In Proceedings of the 2nd ACL Workshop on Statistical Machine Translation (SML’07). 128--136. Google ScholarDigital Library
- Galley, M., Graehl, J., Knight, K., Marcu, D. DeNeefe, S., Wang, W., and Thayer, I. 2006. Scalable inference and training of context-rich syntactic translation models. In Proceedings of the International Conference on Computer Linguistics (COLING’06). 961--968. Google ScholarDigital Library
- Gao, J., Li, M., Wu, A., and Huang, C.-N. 2005. Chinese word segmentation and named entity recognition: A pragmatic approach. Comput. Linguist. 31, 4. Google ScholarDigital Library
- He, X. 2007. Using word-dependent transition models in HMM based word alignment for statistical machine translation. In Proceedings of the 2nd ACL Workshop on Statistical Machine Translation (SMT’07). Google ScholarDigital Library
- Huang, L. and Chiang, D. 2007. Forest rescoring: Faster decoding with integrated language models. In Proceedings of the Association on Computer Linguistics (ACL’07).Google Scholar
- Jayaraman, S. and Lavie, A. 2005. Multi-engine machine translation guided by explicit word matching. In Proceedings of the European Association for Machine Translation (EAMT’05). 143--152.Google Scholar
- Karakos, D., Eisner, J., Khudanpur, S., and Dreyer, M. 2008. Machine translation system combination using ITG-based alignments. In Proceedings of the Human Language Technology Conference (ACL-HLT’08). 81--84. Google ScholarDigital Library
- Koehn, P., Och, F. J., and Marcu, D. 2003. Statistical phrase based translation. In Proceedings of the North American Chapter of the Association for Computational Linguistics (NAACL’03). 48--54. Google ScholarDigital Library
- Levy, R. and Manning, C. 2003. Is it harder to parse Chinese, or the Chinese treebank? In Proceedings of the Association on Computer Linguistics (ACL’03). Google ScholarDigital Library
- Li, C.-H., Li, M., Zhang, D., Li, M., Zhou, M., and Guan, Y. 2007. A probabilistic approach to syntax-based reordering for statistical machine translation. In Proceedings of the Association on Computer Linguistics (ACL’07). 720--727.Google Scholar
- Liang, P., Taskar, B., and Klein, D. 2006. Alignment by agreement. In Proceedings of the North American Chapter of the Association for Computational Linguistics (NAACL’06). 104--111. Google ScholarDigital Library
- Matusov, E., Ueffing, N., and Ney, H. 2006. Computing consensus translation from multiple machine translation systems using enhanced hypotheses alignment. In Proceedings of the European Association of Computer Linguistics (EACL’06). 33--40.Google Scholar
- Moore, R. and Quirk, C. 2007. Faster beam-search decoding for phrasal statistical machine translation. In Proceedings of the Machine Translation Summit XI (MT’07).Google Scholar
- Nguyen, P., Gao J., and Mahajan, M. 2007. MSRLM: A scalable language modeling toolkit. Microsoft Research Tech. Rep. MSR-TR-2007-144.Google Scholar
- NIST. 2008. The 2008 NIST open machine translation evaluation. http://www.nist.gov/speech/tests/mt/2008/doc/.Google Scholar
- Och, F. J. and Ney, H. 2003. A systematic comparison of various statistical alignment models. Comput. Linguist. 29, 1, 19--51. Google ScholarDigital Library
- Papineni, K., Roukos, S., Ward, T., and Zhu, W.-J. 2002. BLEU: A method for automatic evaluation of machine translation. In Proceedings of the Association for Computational Linguistics (ACL’02). 311--318. Google ScholarDigital Library
- Philipp, K., Och, F. J., and Marcu, D. 2003. Statistical phrase based translation. In Proceedings of the North American Chapter of the Association for Computational Linguistics (NAACL’03). 48--54. Google ScholarDigital Library
- Quirk, C., Menezes, A., and Cherry, C. 2005. Dependency treelet translation: Syntactically informed phrasal SMT. In Proceedings of the Association for Computer Linguistics (ACL’05). 271--279. Google ScholarDigital Library
- Rosti, A.-V. I., Xiang, B., Matsoukas, S., Schwartz, R., Ayan, N. F., and Dorr, B. J. 2007a. Combining outputs from multiple machine translation systems. In Proceedings of Human Language Technology Conference/North American Chapter of the Association for Computational Linguistics (HLT-NAACL’07). 228--235.Google Scholar
- Rosti, A.-V. I., Matsoukas, S., and Schwartz, R. 2007b. Improved word-level system combination for machine translation. In Proceedings of the Association for Computer Linguistics (ACL’07). 312--319.Google Scholar
- Rosti, A.-V.I., Zhang, B., Matsoukas, S., and Schwartz, R. 2008. Incremental hypothesis alignment for building confusion networks with application to machine translation system combination. In Proceedings of the 3rd ACL Workshop on Statistical Machine Translation (SMT’08). 183--186. Google ScholarDigital Library
- Shen, L., Xu, J., and Weischedel, R. 2008. A new string-to-dependency machine translation algorithm with a target dependency language model. In Proceedings of the Association for Computer Linguistics/Human Language Translation (ACL-HLT’08). 577--585.Google Scholar
- Sim, K. C., Byrne, W. J., Gales, M. J. F., Sahbi, H., and Woodland, P. C. 2007. Consensus network decoding for statistical machine translation system combination. In Proceedings of the International on Acoustics, Speech, and Signal Processing (ICASSP’07). 4, 105--108.Google Scholar
- Snover, M., Dorr, B., Schwartz, R., Micciulla, L., and Makhoul, J. 2006. A study of translation edit rate with targeted human annotation. In Proceedings of the Association for Machine Translation in the Americas (AMTA’06).Google Scholar
- Toutanova, K., Suzuki, H., and Ruopp, A. 2008. Applying morphology generation models to machine translation. In Proceedings of the Association of Computer Linguistics (ACL’08). 514--522.Google Scholar
- Vogel, S., Ney, H., and Tillmann, C. 1996. HMM-based word alignment in statistical translation. In Proceedings of the International Conference on Computer Linguistics (COLING’96). 836--841. Google ScholarDigital Library
- Wang, C., Collins, M., and Koehn, P. 2007a. Chinese syntactic reordering for statistical machine translation. In Proceedings of the Joint Meeting of the Conference on Empirical Methods in Natural Language Processing (EMNLP-CoNLL’07). 737--745.Google Scholar
- Wang, W., Stolcke, A., and Zheng, J. 2007b. Reranking machine translation hypotheses with structured and Web-based language models. In Proceedings of the IEEE Conference on Automatic Speech Recognition and Understanding (ASRU’07). 159--164.Google Scholar
- Xiong, D., Liu, Q., and Lin, S. 2006. Maximum entropy based phrase reordering model for statistical machine translation. In Proceedings of the Association for Computer Linguistics (ACL’06). 521--528. Google ScholarDigital Library
Index Terms
- Improved Monolingual Hypothesis Alignment for Machine Translation System Combination
Recommendations
English to Tamil statistical machine translation and alignment using HMM
ICNVS'10: Proceedings of the 12th international conference on Networking, VLSI and signal processingThis paper describes English to Tamil statistical machine translation and its alignment using Hidden Markov Model (HMM).Statistical machine translation is a part of natural language processing and is based on probability distribution. Machine ...
HMM Word and Phrase Alignment for Statistical Machine Translation
Estimation and alignment procedures for word and phrase alignment hidden Markov models (HMMs) are developed for the alignment of parallel text. The development of these models is motivated by an analysis of the desirable features of IBM Model 4, one of ...
Integrating source-language context into phrase-based statistical machine translation
The translation features typically used in Phrase-Based Statistical Machine Translation (PB-SMT) model dependencies between the source and target phrases, but not among the phrases in the source language themselves. A swathe of research has demonstrated ...
Comments