Back to Search
Start Over
Cognate-aware morphological segmentation for multilingual neural translation
- Source :
- University of Helsinki, Proceedings of the Third Conference on Machine Translation: Shared Task Papers, WMT (shared task)
-
Abstract
- This article describes the Aalto University entry to the WMT18 News Translation Shared Task. We participate in the multilingual subtrack with a system trained under the constrained condition to translate from English to both Finnish and Estonian. The system is based on the Transformer model. We focus on improving the consistency of morphological segmentation for words that are similar orthographically, semantically, and distributionally; such words include etymological cognates, loan words, and proper names. For this, we introduce Cognate Morfessor, a multilingual variant of the Morfessor method. We show that our approach improves the translation quality particularly for Estonian, which has less resources for training the translation model.<br />To appear in WMT18
- Subjects :
- FOS: Computer and information sciences
Computer science
02 engineering and technology
010501 environmental sciences
computer.software_genre
01 natural sciences
cognate
morphology
0202 electrical engineering, electronic engineering, information engineering
Proper noun
Cognate
0105 earth and related environmental sciences
Transformer (machine learning model)
Computer Science - Computation and Language
business.industry
Estonian
language.human_language
neural machine translation
language
020201 artificial intelligence & image processing
Artificial intelligence
multilingual
business
Computation and Language (cs.CL)
computer
Morphological segmentation
Natural language processing
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- University of Helsinki, Proceedings of the Third Conference on Machine Translation: Shared Task Papers, WMT (shared task)
- Accession number :
- edsair.doi.dedup.....b5806cbb8cce22c5787d42a6e429af18