Back to Search Start Over

A HYBRID LANGUAGE MODEL BASED ON STATISTICS AND LINGUISTIC RULES.

Authors :
Xiaolong Wang
Yeung, Daniel S.
Liu, James N. K.
Robert Luk
Xuan Wang
Source :
International Journal of Pattern Recognition & Artificial Intelligence. Feb2005, Vol. 19 Issue 1, p109-128. 20p.
Publication Year :
2005

Abstract

Language modeling is a current research topic in many domains including speech recognition, optical character recognition, handwriting recognition, machine translation and spelling correction. There are two main types of language models, the mathematical and the linguistic. The most widely used mathematical language model is the n-gram model inferred from statistics. This model has three problems: long distance restriction, recursive nature and partial language understanding. Language models based on linguistics present many difficulties when applied to large scale real texts. We present here a new hybrid language model that combines the advantages of the n-gram statistical language model with those of a linguistic language model which makes use of grammatical or semantic rules. Using suitable rules, this hybrid model can solve problems such as long distance restriction, recursive nature and partial language understanding. The new language model has been effective in experiments and has been incorporated in Chinese sentence input products for Windows and Macintosh OS. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
02180014
Volume :
19
Issue :
1
Database :
Academic Search Index
Journal :
International Journal of Pattern Recognition & Artificial Intelligence
Publication Type :
Academic Journal
Accession number :
15910916
Full Text :
https://doi.org/10.1142/S0218001405003934