Back to Search Start Over

Investigations on Translation Model Adaptation Using Monolingual Data

Authors :
Lambert, Patrik
Schwenk, Holger
Servan, Christophe
Abdul-Rauf, Sadaf
Lambert, Patrik
Bringing Machine Translation for European Languages to the User - EUROMATRIXPLUS - - EC:FP7:ICT2009-03-01 - 2012-02-29 - 231720 - VALID
Laboratoire d'Informatique de l'Université du Mans (LIUM)
Le Mans Université (UM)
European Project: 231720,EC:FP7:ICT,FP7-ICT-2007-3,EUROMATRIXPLUS(2009)
Source :
Proceedings of the Sixth Workshop on Statistical Machine Translation, Sixth Workshop on Statistical Machine Translation, Sixth Workshop on Statistical Machine Translation, Jul 2011, Edinburgh, United Kingdom. pp.284-293
Publication Year :
2011
Publisher :
HAL CCSD, 2011.

Abstract

International audience; Most of the freely available parallel data to train the translation model of a statistical machine translation system comes from very specific sources (European parliament, United Nations, etc). Therefore, there is increasing interest in methods to perform an adaptation of the translation model. A popular approach is based on unsupervised training, also called self-enhancing. Both only use monolingual data to adapt the translation model. In this paper we extend the previous work and provide new insight in the existing methods. We report results on the translation between French and English. Improvements of up to 0.5 BLEU were observed with respect to a very competitive baseline trained on more than 280M words of human translated parallel data.

Details

Language :
English
Database :
OpenAIRE
Journal :
Proceedings of the Sixth Workshop on Statistical Machine Translation, Sixth Workshop on Statistical Machine Translation, Sixth Workshop on Statistical Machine Translation, Jul 2011, Edinburgh, United Kingdom. pp.284-293
Accession number :
edsair.dedup.wf.001..698d19e331fd174d503f3a61b75dd433