Back to Search Start Over

Hybrid Feature Selection Method Based on Feature Subset and Factor Analysis

Authors :
Lizeng Gong
Shanshan Xie
Yan Zhang
Mengyao Wang
Xiaoyan Wang
Source :
IEEE Access, Vol 10, Pp 120792-120803 (2022)
Publication Year :
2022
Publisher :
IEEE, 2022.

Abstract

With the advent of big data era and the rapid improvement of raw data scale, feature selection, as the basis and critical technologies for data mining, plays an increasingly important role. However, most studies on feature selection methods, mainly directed to treat the single feature or overall feature subset, while the influence of the correlation and redundancy of features in the feature subset on the classification results is ignored. In this paper, a hybrid feature selection method based on feature subsets generated by factor analysis (FAFS_HFS) is proposed. Firstly, this method generates feature subsets from the maximum load (maximum explanatory power) of each feature through factor analysis. Then, minimal redundancy and maximal relevance (mRMR) and sequential forward selection (SFS) are used to remove the redundancy of each feature subset. Finally, fisher score based on feature subset (FSF-score) is utilized to evaluate and obtain the optimal feature subsets. Experiments are conducted on 14 datasets, the results show that FAFS_HFS method has higher classification accuracy and lower dimension on almost all datasets, especially in high-dimensional datasets, and it has competitive efficiency and classification performance compared with other methods.

Details

Language :
English
ISSN :
21693536
Volume :
10
Database :
Directory of Open Access Journals
Journal :
IEEE Access
Publication Type :
Academic Journal
Accession number :
edsdoj.b717ee46f866411c86a4a71cf238edc2
Document Type :
article
Full Text :
https://doi.org/10.1109/ACCESS.2022.3222812