Back to Search
Start Over
Constructing a Novel Chinese Readability Classification Model Using Principal Component Analysis and Genetic Programming
- Source :
- ICALT
- Publication Year :
- 2012
- Publisher :
- IEEE, 2012.
-
Abstract
- The studies of readability aim to measure the level of text difficulty. Although traditional formulae such as the Flesch-Kincaid formula can properly predict text readability, they are only effective for English text. Other formulae with very few features may result in inaccurate text classification. The study takes into account multiple linguistic features, and attempts to increase the level of accuracy in text classification by adopting a new model which integrates Principal Component Analysis (PCA) with Genetic Programming (GP). Empirical data are utilized to demonstrate the performance of the proposed model.
- Subjects :
- Measure (data warehouse)
Empirical data
business.industry
Computer science
Genetic programming
computer.software_genre
Readability
Support vector machine
Text mining
Principal component analysis
ComputingMethodologies_DOCUMENTANDTEXTPROCESSING
Artificial intelligence
business
computer
Natural language processing
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- 2012 IEEE 12th International Conference on Advanced Learning Technologies
- Accession number :
- edsair.doi...........2bfecc9cf81effe381a165c8c08d0f55
- Full Text :
- https://doi.org/10.1109/icalt.2012.134