Back to Search Start Over

Bi-LSTM Model for Morpheme Segmentation of Russian Words

Authors :
Elena I. Bolshakova
Alexander S. Sapin
Source :
Communications in Computer and Information Science ISBN: 9783030345174
Publication Year :
2019
Publisher :
Springer International Publishing, 2019.

Abstract

The paper addresses the task of automatic morpheme segmentation involving both splitting words into morphs and classification of resulted morphs. For segmentation of Russian words, a new model based on Bi-LSTM neural network is proposed and experimentally evaluated on several training data sets differing in labeling. The proposed model has comparable quality with the best supervised machine learning models for morpheme segmentation with classification, slightly outperforming them in word-level classification accuracy with score 89%.

Details

ISBN :
978-3-030-34517-4
ISBNs :
9783030345174
Database :
OpenAIRE
Journal :
Communications in Computer and Information Science ISBN: 9783030345174
Accession number :
edsair.doi...........d19ef91cd6994e07c71534e6c557bf2e
Full Text :
https://doi.org/10.1007/978-3-030-34518-1_11