Back to Search
Start Over
Detection and Correction of Non-Words in Arabic: A Hybrid Approach
- Source :
- International Journal of Computer Processing of Languages. 20:237-257
- Publication Year :
- 2007
- Publisher :
- World Scientific Pub Co Pte Lt, 2007.
-
Abstract
- As Arabic is known for its highly inflectional morphological structure, this hybrid approach is utilizing morphological knowledge in form of consistent root-pattern relationships, and some morpho-syntactical knowledge based on affixation and morphographemic rules to specify the word recognition and non-word correction process. Furthermore this paper is proposing novel probabilistic measures for completing the task of the correction by locating, reducing and ranking of the most probable correction candidates in Arabic derivative words. In this context based on frequency of occurrence analysis, two probabilistic measures are introduced, Root-Pattern Predictive Value, RPV, and Pattern-Root Predictive Value, PPV. Moreover, keyboard effect, letter sound and similarity are considered in addition to some lexical features as a supplementary aid to improve the process of error detection and correction.
- Subjects :
- Structure (mathematical logic)
Similarity (geometry)
Computer science
business.industry
Process (computing)
Probabilistic logic
Pattern recognition
computer.software_genre
Ranking (information retrieval)
Task (project management)
Word recognition
Artificial intelligence
Error detection and correction
business
computer
Natural language processing
Subjects
Details
- ISSN :
- 20100205 and 17938406
- Volume :
- 20
- Database :
- OpenAIRE
- Journal :
- International Journal of Computer Processing of Languages
- Accession number :
- edsair.doi...........a3b46ea259c3e48d19aaa92d8e0950c5
- Full Text :
- https://doi.org/10.1142/s0219427907001706