Back to Search Start Over

Morfeus+: Word parsing in Basque beyond morphological segmentation

Authors :
Koldo Gojenola
Xabier Artola
Itziar Aduriz
Zuhaitz Beloki
Jose Maria Arriola
Nerea Ezeiza
Source :
Word Structure. 13:283-315
Publication Year :
2020
Publisher :
Edinburgh University Press, 2020.

Abstract

This work describes the formalization of a word structure grammar that represents the complex morphological and morphosyntactic information embedded within the word forms of an agglutinative language (Basque), giving a comprehensive linguistic description of the main morphological phenomena, such as affixation, derivation, and composition, and also taking into account the modeling of both standard and non-standard words. We have identified the relevant issues to be addressed in the representation of such a grammar.We also present the development of Morfeus+, a tool for the analysis of unrestricted texts, testing its applicability and showing that its coverage is wide and robust, allowing the efficient processing of big volumes of data.This paper describes a mature system that has required several person/years and that tries to integrate a rigorous linguistic specification together with more practical implementation matters, such as the appropriate treatment of unknown words in unrestricted texts.

Details

ISSN :
17552036 and 17501245
Volume :
13
Database :
OpenAIRE
Journal :
Word Structure
Accession number :
edsair.doi...........37fcdeec89546423f24edc6742f651ab
Full Text :
https://doi.org/10.3366/word.2020.0172