Back to Search Start Over

The TALP-UPC phrase-based translation systems for WMT12: morphology simplification and domain adaptation

Authors :
Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
Formiga Fanals, Lluís
Henríquez Quintana, Carlos Alberto
Hernández Huerta, Adolfo
Mariño Acebal, José Bernardo
Monte Moreno, Enrique
Rodríguez Fonollosa, José Adrián
Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
Formiga Fanals, Lluís
Henríquez Quintana, Carlos Alberto
Hernández Huerta, Adolfo
Mariño Acebal, José Bernardo
Monte Moreno, Enrique
Rodríguez Fonollosa, José Adrián
Publication Year :
2012

Abstract

This paper describes the UPC participation in the WMT 12 evaluation campaign. All sys- tems presented are based on standard phrase- based Moses systems. Variations adopted sev- eral improvement techniques such as mor- phology simplification and generation and do- main adaptation. The morphology simpli- fication overcomes the data sparsity prob- lem when translating into morphologically- rich languages such as Spanish by translat- ing first to a morphology-simplified language and secondly leave the morphology gener- ation to an independent classification task. The domain adaptation approach improves the SMT system by adding new translation units learned from MT-output and reference align- ment. Results depict an improvement on TER, METEOR, NIST and BLEU scores compared to our baseline system, obtaining on the of- ficial test set more benefits from the domain adaptation approach than from the morpho- logical generalization method.<br />Peer Reviewed<br />Postprint (published version)

Details

Database :
OAIster
Notes :
8 p., application/pdf, English
Publication Type :
Electronic Resource
Accession number :
edsoai.on1132966760
Document Type :
Electronic Resource