Back to Search Start Over

Feature Selection Based on the Discriminative Significance for Sparse Binary-Valued and Imbalanced Dataset.

Authors :
Zhu, Qiuming
Source :
International Journal of Pattern Recognition & Artificial Intelligence; Mar2023, Vol. 37 Issue 3, p1-29, 29p
Publication Year :
2023

Abstract

Identifying the significant, or dominant, features is important to reveal the cause-and-effect relations in many pattern recognition applications, such as medical diagnosis, gene analysis, cyber security, finance and insurance fraud detection, etc. Samples that are sparsely populated and binary-valued in highly imbalanced datasets pose a challenge to the identification of these features. This paper explores an approach based on the confusion matrix measurement of the feature values with respect to their potential classification outcomes. The approach is able to compute the Discriminative Significances of the features and rank the features unbiasedly with respect to the imbalance ratios of the datasets. Experiment results on real-world and experimental datasets show that the approach made consistent evaluations of the features and identified the most significant ones accordingly on the sparse and binary-valued samples of the class-imbalanced datasets. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
02180014
Volume :
37
Issue :
3
Database :
Complementary Index
Journal :
International Journal of Pattern Recognition & Artificial Intelligence
Publication Type :
Academic Journal
Accession number :
162594936
Full Text :
https://doi.org/10.1142/S0218001423500088