Back to Search
Start Over
An Effective Dimension Reduction Approach to Chinese Document Classification Using Genetic Algorithm
- Source :
- Advances in Neural Networks – ISNN 2009 ISBN: 9783642015090, ISNN (2)
- Publication Year :
- 2009
- Publisher :
- Springer Berlin Heidelberg, 2009.
-
Abstract
- Different kinds of methods have been proposed in Chinese document classification, while high dimension of feature vector is one of the most significant limits in these methods. In this paper, an important difference is pointed out between Chinese document classification and English document classification. Then an efficient approach is proposed to reduce the dimension of feature vector in Chinese document classification using Genetic Algorithm. Through merely choosing the set of much more "important" features, the proposed method significantly reduces the number of Chinese feature words. Experiments combining with several relative studies show that the proposed method has great effect on dimension reduction with little loss in correctly classified rate.
- Subjects :
- business.industry
Dimensionality reduction
Feature vector
Document classification
Linear classifier
Pattern recognition
computer.software_genre
Effective dimension
Reduction (complexity)
Dimension (vector space)
Feature (machine learning)
Data mining
Artificial intelligence
business
computer
Mathematics
Subjects
Details
- ISBN :
- 978-3-642-01509-0
- ISBNs :
- 9783642015090
- Database :
- OpenAIRE
- Journal :
- Advances in Neural Networks – ISNN 2009 ISBN: 9783642015090, ISNN (2)
- Accession number :
- edsair.doi...........e1ddabeb98e1c2982a91d1d568656120
- Full Text :
- https://doi.org/10.1007/978-3-642-01510-6_55