Back to Search Start Over

Constructing and validating readability models: the method of integrating multilevel linguistic features with machine learning.

Authors :
Sung, Yao-Ting
Chen, Ju-Ling
Cha, Ji-Her
Tseng, Hou-Chiang
Chang, Tao-Hsing
Chang, Kuo-En
Source :
Behavior Research Methods. Jun2015, Vol. 47 Issue 2, p340-354. 15p.
Publication Year :
2015

Abstract

Multilevel linguistic features have been proposed for discourse analysis, but there have been few applications of multilevel linguistic features to readability models and also few validations of such models. Most traditional readability formulae are based on generalized linear models (GLMs; e.g., discriminant analysis and multiple regression), but these models have to comply with certain statistical assumptions about data properties and include all of the data in formulae construction without pruning the outliers in advance. The use of such readability formulae tends to produce a low text classification accuracy, while using a support vector machine (SVM) in machine learning can enhance the classification outcome. The present study constructed readability models by integrating multilevel linguistic features with SVM, which is more appropriate for text classification. Taking the Chinese language as an example, this study developed 31 linguistic features as the predicting variables at the word, semantic, syntax, and cohesion levels, with grade levels of texts as the criterion variable. The study compared four types of readability models by integrating unilevel and multilevel linguistic features with GLMs and an SVM. The results indicate that adopting a multilevel approach in readability analysis provides a better representation of the complexities of both texts and the reading comprehension process. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
1554351X
Volume :
47
Issue :
2
Database :
Academic Search Index
Journal :
Behavior Research Methods
Publication Type :
Academic Journal
Accession number :
102601524
Full Text :
https://doi.org/10.3758/s13428-014-0459-x