Back to Search Start Over

Distance based resampling of imbalanced classes: With an application example of speech quality assessment

Authors :
Silvana Punišić
Slobodan T. Jovicic
Miško Subotić
Srdjan S. Stankovic
Drasko Furundzic
Source :
Engineering Applications of Artificial Intelligence. 64:440-461
Publication Year :
2017
Publisher :
Elsevier BV, 2017.

Abstract

This paper presents a new general methodological approach to imbalanced learning as one of the challenging problems in pattern classification. The presented method is founded on maximization of the sample entropy. The method involves detection of distributive properties of ideally balanced regular lattice sample and acceptable transfer of these properties to an arbitrary imbalanced sample increasing its representativeness. The proposed procedure assumes undersampling applied on areas of high probability density in the sample space combined with oversampling in the areas of low density. The main achievement of this method is the increased sample class entropy which reduces the inductive learner’s tendency to favor prominent class, or cluster. In addition to class balancing, this method can be useful for function approximation, clustering, and sample dimension reduction. The high degree of generality of the method implies its applicability on data of various complexity and imbalance. The presented theoretical foundation of the method was verified on a set of proper synthetic samples. The method’s practical usability is confirmed by a comparative classification of a large set of databases including speech signal samples.

Details

ISSN :
09521976
Volume :
64
Database :
OpenAIRE
Journal :
Engineering Applications of Artificial Intelligence
Accession number :
edsair.doi...........69dc98f86b6b6335fc3dd544bf5594e4
Full Text :
https://doi.org/10.1016/j.engappai.2017.07.001