Back to Search
Start Over
Point the Point: Uyghur Morphological Segmentation Using PointerNetwork with GRU
- Source :
- Lecture Notes in Computer Science ISBN: 9783030323806, CCL
- Publication Year :
- 2019
- Publisher :
- Springer International Publishing, 2019.
-
Abstract
- Uyghur is an agglutinative language that has many morphemes. It is necessary for processing Uyghur to segment words into morphemes. This work is called morphological segmentation. Previous works treat morphological segmentation as a tagging task and classify each character as one of four classes, which are \(\{b,m,e,s\}\). However, these labels are not independent from each other, which makes the models easily overfitted. We propose a new method for the segmentation task. Instead of using these labels, we use only segmentation points for modeling. The model used in our method is more robust and easier to train than previous methods. Applying our model to Uyghur morphological segmentation, it achieves high accuracy and higher recall and f1 score than previous models.
- Subjects :
- Agglutinative language
Point (typography)
Computer science
business.industry
ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION
Pattern recognition
Task (project management)
Character (mathematics)
Morpheme
Segmentation
Artificial intelligence
business
F1 score
Morphological segmentation
Subjects
Details
- ISBN :
- 978-3-030-32380-6
- ISBNs :
- 9783030323806
- Database :
- OpenAIRE
- Journal :
- Lecture Notes in Computer Science ISBN: 9783030323806, CCL
- Accession number :
- edsair.doi...........25cdd34a00b07157712afdfce7090a01
- Full Text :
- https://doi.org/10.1007/978-3-030-32381-3_30