Back to Search
Start Over
Feature Selection Based on the Discriminative Significance for Sparse Binary-Valued and Imbalanced Dataset.
- Source :
- International Journal of Pattern Recognition & Artificial Intelligence; Mar2023, Vol. 37 Issue 3, p1-29, 29p
- Publication Year :
- 2023
-
Abstract
- Identifying the significant, or dominant, features is important to reveal the cause-and-effect relations in many pattern recognition applications, such as medical diagnosis, gene analysis, cyber security, finance and insurance fraud detection, etc. Samples that are sparsely populated and binary-valued in highly imbalanced datasets pose a challenge to the identification of these features. This paper explores an approach based on the confusion matrix measurement of the feature values with respect to their potential classification outcomes. The approach is able to compute the Discriminative Significances of the features and rank the features unbiasedly with respect to the imbalance ratios of the datasets. Experiment results on real-world and experimental datasets show that the approach made consistent evaluations of the features and identified the most significant ones accordingly on the sparse and binary-valued samples of the class-imbalanced datasets. [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 02180014
- Volume :
- 37
- Issue :
- 3
- Database :
- Complementary Index
- Journal :
- International Journal of Pattern Recognition & Artificial Intelligence
- Publication Type :
- Academic Journal
- Accession number :
- 162594936
- Full Text :
- https://doi.org/10.1142/S0218001423500088