1. A multi-objective genetic algorithm for text feature selection using the relative discriminative criterion
- Author
-
Parham Moradi, Mahdi Jalili, and Mahdieh Labani
- Subjects
0209 industrial biotechnology ,Computer science ,business.industry ,General Engineering ,Pattern recognition ,Feature selection ,02 engineering and technology ,Filter (signal processing) ,Class (biology) ,Computer Science Applications ,020901 industrial engineering & automation ,Discriminative model ,Artificial Intelligence ,Genetic algorithm ,0202 electrical engineering, electronic engineering, information engineering ,020201 artificial intelligence & image processing ,Relevance (information retrieval) ,Artificial intelligence ,business - Abstract
With exponentially increasing the number of digital documents, text classification has become a major task in data science applications. Selecting discriminative features highly relevant to class labels while having low levels of redundancy is essential to improve the performance of text classification methods. In this paper, we propose a novel multi-objective algorithm for text feature selection, called Multi-Objective Relative Discriminative Criterion (MORDC), which balances minimal redundant features against those maximally relevant to the target class. The proposed method employs a multi-objective evolutionary framework to search through the solution space. The first objective function measures the relevance of the text features to the target class, whereas the second one evaluates the correlation between the features. None of these objectives use learning to evaluate the goodness of the selected features; thus, the proposed method can be classified as a multivariate filter method. In order to assess the effectiveness of the proposed method, several experiments are performed on three real-world datasets. Comparisons with state-of-the-art feature selection methods show that in most cases MORDC results in better classification performance.
- Published
- 2020