Back to Search
Start Over
Hybrid Feature Selection Method Based on Feature Subset and Factor Analysis
- Source :
- IEEE Access, Vol 10, Pp 120792-120803 (2022)
- Publication Year :
- 2022
- Publisher :
- IEEE, 2022.
-
Abstract
- With the advent of big data era and the rapid improvement of raw data scale, feature selection, as the basis and critical technologies for data mining, plays an increasingly important role. However, most studies on feature selection methods, mainly directed to treat the single feature or overall feature subset, while the influence of the correlation and redundancy of features in the feature subset on the classification results is ignored. In this paper, a hybrid feature selection method based on feature subsets generated by factor analysis (FAFS_HFS) is proposed. Firstly, this method generates feature subsets from the maximum load (maximum explanatory power) of each feature through factor analysis. Then, minimal redundancy and maximal relevance (mRMR) and sequential forward selection (SFS) are used to remove the redundancy of each feature subset. Finally, fisher score based on feature subset (FSF-score) is utilized to evaluate and obtain the optimal feature subsets. Experiments are conducted on 14 datasets, the results show that FAFS_HFS method has higher classification accuracy and lower dimension on almost all datasets, especially in high-dimensional datasets, and it has competitive efficiency and classification performance compared with other methods.
Details
- Language :
- English
- ISSN :
- 21693536
- Volume :
- 10
- Database :
- Directory of Open Access Journals
- Journal :
- IEEE Access
- Publication Type :
- Academic Journal
- Accession number :
- edsdoj.b717ee46f866411c86a4a71cf238edc2
- Document Type :
- article
- Full Text :
- https://doi.org/10.1109/ACCESS.2022.3222812