Back to Search Start Over

The TALP & I2R SMT Systems for IWSLT 2008

Authors :
Li, H.
Aw, A.
Zhang, Ming
Khalilov, Maxim
Ruiz Costa-Jussà, Marta
Henríquez Quintana, Carlos Alberto
Rodríguez Fonollosa, José Adrián
Hernández, A.
Mariño Acebal, José Bernardo
Banchs Martínez, Rafael Enrique
Chen, B.
Universitat Politècnica de Catalunya. Departament de Teoria del Senyal i Comunicacions
Universitat Politècnica de Catalunya. VEU - Grup de Tractament de la Parla
Source :
UPCommons. Portal del coneixement obert de la UPC, Universitat Politècnica de Catalunya (UPC), Recercat. Dipósit de la Recerca de Catalunya, instname
Publication Year :
2008
Publisher :
NICT/ATR, 2008.

Abstract

This paper gives a description of the statistical machine translation (SMT) systems developed at the TALP Research Center of the UPC (Universitat Polit`ecnica de Catalunya) for our participation in the IWSLT’08 evaluation campaign. We present Ngram-based (TALPtuples) and phrase-based (TALPphrases) SMT systems. The paper explains the 2008 systems’ architecture and outlines translation schemes we have used, mainly focusing on the new techniques that are challenged to improve speech-to-speech translation quality. The novelties we have introduced are: improved reordering method, linear combination of translation and reordering models and new technique dealing with punctuation marks insertion for a phrase-based SMT system. This year we focus on the Arabic-English, Chinese-Spanish and pivot Chinese-(English)-Spanish translation tasks.

Details

Language :
English
Database :
OpenAIRE
Journal :
UPCommons. Portal del coneixement obert de la UPC, Universitat Politècnica de Catalunya (UPC), Recercat. Dipósit de la Recerca de Catalunya, instname
Accession number :
edsair.dedup.wf.001..1ece9dc125f4302932783ac588883147