Back to Search Start Over

Distribution-Sensitive Unbalanced Data Oversampling Method for Medical Diagnosis.

Authors :
Han, Weihong
Huang, Zizhong
Li, Shudong
Jia, Yan
Source :
Journal of Medical Systems; Feb2019, Vol. 43 Issue 2, p1-1, 1p, 2 Diagrams, 1 Chart, 4 Graphs
Publication Year :
2019

Abstract

Aiming at the problem of low accuracy of classification learning algorithm caused by serious imbalance of sample set in medical diagnostic application, this paper proposes a distribution-sensitive oversampling algorithm for imbalanced data. The algorithm accurately divides the minority samples into noise samples, unstable samples, boundary samples and stable samples according to the location of the minority samples. Different samples are processed differently to select the most suitable sample for the synthesis of new samples. In the case of sample synthesis, a distribution-sensitive sample synthesis method is adopted. Different sample synthesis methods are selected according to their different distance from the surrounding minority samples, so as to ensure that the newly synthesized samples have the same characteristics with the original minority samples. The real medical diagnostic data test shows that this algorithm improves the accuracy rate of classification learning algorithm compared with the existing sampling algorithms, especially for the accuracy rate and recall rate of minority classes. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
01485598
Volume :
43
Issue :
2
Database :
Complementary Index
Journal :
Journal of Medical Systems
Publication Type :
Academic Journal
Accession number :
134561919
Full Text :
https://doi.org/10.1007/s10916-018-1154-8