Back to Search
Start Over
Distance based resampling of imbalanced classes: With an application example of speech quality assessment
- Source :
- Engineering Applications of Artificial Intelligence. 64:440-461
- Publication Year :
- 2017
- Publisher :
- Elsevier BV, 2017.
-
Abstract
- This paper presents a new general methodological approach to imbalanced learning as one of the challenging problems in pattern classification. The presented method is founded on maximization of the sample entropy. The method involves detection of distributive properties of ideally balanced regular lattice sample and acceptable transfer of these properties to an arbitrary imbalanced sample increasing its representativeness. The proposed procedure assumes undersampling applied on areas of high probability density in the sample space combined with oversampling in the areas of low density. The main achievement of this method is the increased sample class entropy which reduces the inductive learner’s tendency to favor prominent class, or cluster. In addition to class balancing, this method can be useful for function approximation, clustering, and sample dimension reduction. The high degree of generality of the method implies its applicability on data of various complexity and imbalance. The presented theoretical foundation of the method was verified on a set of proper synthetic samples. The method’s practical usability is confirmed by a comparative classification of a large set of databases including speech signal samples.
- Subjects :
- Computer science
02 engineering and technology
Entropy (classical thermodynamics)
Artificial Intelligence
020204 information systems
Resampling
0202 electrical engineering, electronic engineering, information engineering
Entropy (information theory)
Electrical and Electronic Engineering
Entropy (energy dispersal)
Cluster analysis
Entropy (arrow of time)
business.industry
Entropy (statistical thermodynamics)
Dimensionality reduction
Pattern recognition
Maximization
Binomial distribution
Sample entropy
Function approximation
Control and Systems Engineering
Undersampling
Sample space
020201 artificial intelligence & image processing
Artificial intelligence
business
Entropy (order and disorder)
Subjects
Details
- ISSN :
- 09521976
- Volume :
- 64
- Database :
- OpenAIRE
- Journal :
- Engineering Applications of Artificial Intelligence
- Accession number :
- edsair.doi...........69dc98f86b6b6335fc3dd544bf5594e4
- Full Text :
- https://doi.org/10.1016/j.engappai.2017.07.001