skip to main content
research-article

Improved Monolingual Hypothesis Alignment for Machine Translation System Combination

Published:01 May 2009Publication History
Skip Abstract Section

Abstract

This article presents a new hypothesis alignment method for combining outputs of multiple machine translation (MT) systems. An indirect hidden Markov model (IHMM) is proposed to address the synonym matching and word ordering issues in hypothesis alignment. Unlike traditional HMMs whose parameters are trained via maximum likelihood estimation (MLE), the parameters of the IHMM are estimated indirectly from a variety of sources including word semantic similarity, word surface similarity, and a distance-based distortion penalty. The IHMM-based method significantly outperforms the state-of-the-art, TER-based alignment model in our experiments on NIST benchmark datasets. Our combined SMT system using the proposed method achieved the best Chinese-to-English translation result in the constrained training track of the 2008 NIST Open MT Evaluation.

References

  1. Andrew, G. 2006. A hybrid Markov/semi-Markov conditional random field for sequence segmentation. In Proceedings of the Joint Meeting of the Conference on Empirical Methods in Natural Language Proceedings (EMNLP’06). Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Bangalore, S., Bordel, G., and Riccardi, G. 2001. Computing consensus translation from multiple machine translation systems. In Proceedings of the IEEE Conference on Automatic Speech Recognition and Understanding (ASRU’01). 351--354.Google ScholarGoogle Scholar
  3. Brent, R. 1973. Algorithms for minimization without derivatives. Prentice-Hall, Chapter 7.Google ScholarGoogle Scholar
  4. Chiang, D. 2007. Hierarchical phrase-based translation. Comput. Linguist. 33, 2, 201--228. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Corston-Oliver, S., Aue, A., Duh, K., and Ringger, E. 2006. Multilingual dependency parsing using Bayes point machine. In Proceedings of the North American Chapter of the Association for Computational Linguistic (NAACL’06). Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Foster, G. and Kuhn, R. 2007. Mixture-model adaptation for SMT. In Proceedings of the 2nd ACL Workshop on Statistical Machine Translation (SML’07). 128--136. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Galley, M., Graehl, J., Knight, K., Marcu, D. DeNeefe, S., Wang, W., and Thayer, I. 2006. Scalable inference and training of context-rich syntactic translation models. In Proceedings of the International Conference on Computer Linguistics (COLING’06). 961--968. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Gao, J., Li, M., Wu, A., and Huang, C.-N. 2005. Chinese word segmentation and named entity recognition: A pragmatic approach. Comput. Linguist. 31, 4. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. He, X. 2007. Using word-dependent transition models in HMM based word alignment for statistical machine translation. In Proceedings of the 2nd ACL Workshop on Statistical Machine Translation (SMT’07). Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Huang, L. and Chiang, D. 2007. Forest rescoring: Faster decoding with integrated language models. In Proceedings of the Association on Computer Linguistics (ACL’07).Google ScholarGoogle Scholar
  11. Jayaraman, S. and Lavie, A. 2005. Multi-engine machine translation guided by explicit word matching. In Proceedings of the European Association for Machine Translation (EAMT’05). 143--152.Google ScholarGoogle Scholar
  12. Karakos, D., Eisner, J., Khudanpur, S., and Dreyer, M. 2008. Machine translation system combination using ITG-based alignments. In Proceedings of the Human Language Technology Conference (ACL-HLT’08). 81--84. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Koehn, P., Och, F. J., and Marcu, D. 2003. Statistical phrase based translation. In Proceedings of the North American Chapter of the Association for Computational Linguistics (NAACL’03). 48--54. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Levy, R. and Manning, C. 2003. Is it harder to parse Chinese, or the Chinese treebank? In Proceedings of the Association on Computer Linguistics (ACL’03). Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Li, C.-H., Li, M., Zhang, D., Li, M., Zhou, M., and Guan, Y. 2007. A probabilistic approach to syntax-based reordering for statistical machine translation. In Proceedings of the Association on Computer Linguistics (ACL’07). 720--727.Google ScholarGoogle Scholar
  16. Liang, P., Taskar, B., and Klein, D. 2006. Alignment by agreement. In Proceedings of the North American Chapter of the Association for Computational Linguistics (NAACL’06). 104--111. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Matusov, E., Ueffing, N., and Ney, H. 2006. Computing consensus translation from multiple machine translation systems using enhanced hypotheses alignment. In Proceedings of the European Association of Computer Linguistics (EACL’06). 33--40.Google ScholarGoogle Scholar
  18. Moore, R. and Quirk, C. 2007. Faster beam-search decoding for phrasal statistical machine translation. In Proceedings of the Machine Translation Summit XI (MT’07).Google ScholarGoogle Scholar
  19. Nguyen, P., Gao J., and Mahajan, M. 2007. MSRLM: A scalable language modeling toolkit. Microsoft Research Tech. Rep. MSR-TR-2007-144.Google ScholarGoogle Scholar
  20. NIST. 2008. The 2008 NIST open machine translation evaluation. http://www.nist.gov/speech/tests/mt/2008/doc/.Google ScholarGoogle Scholar
  21. Och, F. J. and Ney, H. 2003. A systematic comparison of various statistical alignment models. Comput. Linguist. 29, 1, 19--51. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Papineni, K., Roukos, S., Ward, T., and Zhu, W.-J. 2002. BLEU: A method for automatic evaluation of machine translation. In Proceedings of the Association for Computational Linguistics (ACL’02). 311--318. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Philipp, K., Och, F. J., and Marcu, D. 2003. Statistical phrase based translation. In Proceedings of the North American Chapter of the Association for Computational Linguistics (NAACL’03). 48--54. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Quirk, C., Menezes, A., and Cherry, C. 2005. Dependency treelet translation: Syntactically informed phrasal SMT. In Proceedings of the Association for Computer Linguistics (ACL’05). 271--279. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Rosti, A.-V. I., Xiang, B., Matsoukas, S., Schwartz, R., Ayan, N. F., and Dorr, B. J. 2007a. Combining outputs from multiple machine translation systems. In Proceedings of Human Language Technology Conference/North American Chapter of the Association for Computational Linguistics (HLT-NAACL’07). 228--235.Google ScholarGoogle Scholar
  26. Rosti, A.-V. I., Matsoukas, S., and Schwartz, R. 2007b. Improved word-level system combination for machine translation. In Proceedings of the Association for Computer Linguistics (ACL’07). 312--319.Google ScholarGoogle Scholar
  27. Rosti, A.-V.I., Zhang, B., Matsoukas, S., and Schwartz, R. 2008. Incremental hypothesis alignment for building confusion networks with application to machine translation system combination. In Proceedings of the 3rd ACL Workshop on Statistical Machine Translation (SMT’08). 183--186. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Shen, L., Xu, J., and Weischedel, R. 2008. A new string-to-dependency machine translation algorithm with a target dependency language model. In Proceedings of the Association for Computer Linguistics/Human Language Translation (ACL-HLT’08). 577--585.Google ScholarGoogle Scholar
  29. Sim, K. C., Byrne, W. J., Gales, M. J. F., Sahbi, H., and Woodland, P. C. 2007. Consensus network decoding for statistical machine translation system combination. In Proceedings of the International on Acoustics, Speech, and Signal Processing (ICASSP’07). 4, 105--108.Google ScholarGoogle Scholar
  30. Snover, M., Dorr, B., Schwartz, R., Micciulla, L., and Makhoul, J. 2006. A study of translation edit rate with targeted human annotation. In Proceedings of the Association for Machine Translation in the Americas (AMTA’06).Google ScholarGoogle Scholar
  31. Toutanova, K., Suzuki, H., and Ruopp, A. 2008. Applying morphology generation models to machine translation. In Proceedings of the Association of Computer Linguistics (ACL’08). 514--522.Google ScholarGoogle Scholar
  32. Vogel, S., Ney, H., and Tillmann, C. 1996. HMM-based word alignment in statistical translation. In Proceedings of the International Conference on Computer Linguistics (COLING’96). 836--841. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Wang, C., Collins, M., and Koehn, P. 2007a. Chinese syntactic reordering for statistical machine translation. In Proceedings of the Joint Meeting of the Conference on Empirical Methods in Natural Language Processing (EMNLP-CoNLL’07). 737--745.Google ScholarGoogle Scholar
  34. Wang, W., Stolcke, A., and Zheng, J. 2007b. Reranking machine translation hypotheses with structured and Web-based language models. In Proceedings of the IEEE Conference on Automatic Speech Recognition and Understanding (ASRU’07). 159--164.Google ScholarGoogle Scholar
  35. Xiong, D., Liu, Q., and Lin, S. 2006. Maximum entropy based phrase reordering model for statistical machine translation. In Proceedings of the Association for Computer Linguistics (ACL’06). 521--528. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Improved Monolingual Hypothesis Alignment for Machine Translation System Combination

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Asian Language Information Processing
      ACM Transactions on Asian Language Information Processing  Volume 8, Issue 2
      May 2009
      89 pages
      ISSN:1530-0226
      EISSN:1558-3430
      DOI:10.1145/1526252
      Issue’s Table of Contents

      Copyright © 2009 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 1 May 2009
      • Accepted: 1 March 2009
      • Revised: 1 February 2009
      • Received: 1 November 2008
      Published in talip Volume 8, Issue 2

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Research
      • Refereed

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader