Back to Search Start Over

Point the Point: Uyghur Morphological Segmentation Using PointerNetwork with GRU

Authors :
Huaping Zhang
Shuqin Li
Yaofei Yang
Yangsen Zhang
Source :
Lecture Notes in Computer Science ISBN: 9783030323806, CCL
Publication Year :
2019
Publisher :
Springer International Publishing, 2019.

Abstract

Uyghur is an agglutinative language that has many morphemes. It is necessary for processing Uyghur to segment words into morphemes. This work is called morphological segmentation. Previous works treat morphological segmentation as a tagging task and classify each character as one of four classes, which are \(\{b,m,e,s\}\). However, these labels are not independent from each other, which makes the models easily overfitted. We propose a new method for the segmentation task. Instead of using these labels, we use only segmentation points for modeling. The model used in our method is more robust and easier to train than previous methods. Applying our model to Uyghur morphological segmentation, it achieves high accuracy and higher recall and f1 score than previous models.

Details

ISBN :
978-3-030-32380-6
ISBNs :
9783030323806
Database :
OpenAIRE
Journal :
Lecture Notes in Computer Science ISBN: 9783030323806, CCL
Accession number :
edsair.doi...........25cdd34a00b07157712afdfce7090a01
Full Text :
https://doi.org/10.1007/978-3-030-32381-3_30