Back to Search Start Over

Combination of Rough Sets and Genetic Algorithms for Text Classification.

Authors :
Carbonell, Jaime G.
Siekmann, Jörg
Gorodetsky, Vladimir
Zhang, Chengqi
Skormin, Victor A.
Longbing Cao
Rujiang Bai
Xiaoyue Wang
Junhua Liao
Source :
Autonomous Intelligent Systems: Multi-Agents & Data Mining; 2007, p256-268, 13p
Publication Year :
2007

Abstract

Automatic categorization of documents into pre-defined taxonomies is a crucial step in data mining and knowledge discovery. Standard machine learning techniques like support vector machines(SVM) and related large margin methods have been successfully applied for this task. Unfortunately, the high dimensionality of input feature vectors impacts on the classification speed. The kernel parameters setting for SVM in a training process impacts on the classification accuracy. Feature selection is another factor that impacts classification accuracy. The objective of this work is to reduce the dimension of feature vectors, optimizing the parameters to improve the SVM classification accuracy and speed. In order to improve classification speed we spent rough sets theory to reduce the feature vector space. We present a genetic algorithm approach for feature selection and parameters optimization to improve classification accuracy. Experimental results indicate our method is more effective than traditional SVM methods and other traditional methods. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISBNs :
9783540728382
Database :
Supplemental Index
Journal :
Autonomous Intelligent Systems: Multi-Agents & Data Mining
Publication Type :
Book
Accession number :
33213886
Full Text :
https://doi.org/10.1007/978-3-540-72839-9_21