Uyghur Word Stemming Based on Stem and Affix Features
2017
Uyghur is an agglutinative language with complex morphology, and word stemming is one of the essentials in Uyghur information processing. However, the performance of Uyghur word-stem segmentation still leaves much room for improvement. In this study, stemming was performed on Uyghur words using an affix-occurred probability feature, which provided the stemming accuracy of 88.59% for a baseline system. The performance of this stemmer was further improved by using parameter ‘α’ in combination with the proposed method.
Keywords:
- Correction
- Source
- Cite
- Save
- Machine Reading By IdeaReader
12
References
1
Citations
NaN
KQI