Back to Search Start Over

Effective data generation for imbalanced learning using conditional generative adversarial networks.

Authors :
Douzas, Georgios
Bacao, Fernando
Source :
Expert Systems with Applications. Jan2018, Vol. 91, p464-471. 8p.
Publication Year :
2018

Abstract

Learning from imbalanced datasets is a frequent but challenging task for standard classification algorithms. Although there are different strategies to address this problem, methods that generate artificial data for the minority class constitute a more general approach compared to algorithmic modifications. Standard oversampling methods are variations of the SMOTE algorithm, which generates synthetic samples along the line segment that joins minority class samples. Therefore, these approaches are based on local information, rather on the overall minority class distribution. Contrary to these algorithms, in this paper the conditional version of Generative Adversarial Networks (cGAN) is used to approximate the true data distribution and generate data for the minority class of various imbalanced datasets. The performance of cGAN is compared against multiple standard oversampling algorithms. We present empirical results that show a significant improvement in the quality of the generated data when cGAN is used as an oversampling algorithm. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
09574174
Volume :
91
Database :
Academic Search Index
Journal :
Expert Systems with Applications
Publication Type :
Academic Journal
Accession number :
125488745
Full Text :
https://doi.org/10.1016/j.eswa.2017.09.030