Back to Search Start Over

Detection and Correction of Non-Words in Arabic: A Hybrid Approach

Authors :
Mustafa Yaseen
Bassam Haddad
Source :
International Journal of Computer Processing of Languages. 20:237-257
Publication Year :
2007
Publisher :
World Scientific Pub Co Pte Lt, 2007.

Abstract

As Arabic is known for its highly inflectional morphological structure, this hybrid approach is utilizing morphological knowledge in form of consistent root-pattern relationships, and some morpho-syntactical knowledge based on affixation and morphographemic rules to specify the word recognition and non-word correction process. Furthermore this paper is proposing novel probabilistic measures for completing the task of the correction by locating, reducing and ranking of the most probable correction candidates in Arabic derivative words. In this context based on frequency of occurrence analysis, two probabilistic measures are introduced, Root-Pattern Predictive Value, RPV, and Pattern-Root Predictive Value, PPV. Moreover, keyboard effect, letter sound and similarity are considered in addition to some lexical features as a supplementary aid to improve the process of error detection and correction.

Details

ISSN :
20100205 and 17938406
Volume :
20
Database :
OpenAIRE
Journal :
International Journal of Computer Processing of Languages
Accession number :
edsair.doi...........a3b46ea259c3e48d19aaa92d8e0950c5
Full Text :
https://doi.org/10.1142/s0219427907001706