1. 面向不平衡分类的IDP-SMOTE重采样算法.
- Author
-
盛 凯, 刘 忠, 周德超, and 冯成旭
- Subjects
- *
COMPUTER algorithms , *DENSITY , *AMBIGUITY , *ALGORITHMS , *CLASSIFICATION , *RESAMPLING (Statistics) - Abstract
In order to improve the classification accuracy of the minority samples, this paper proposed a novel resampling algorithm based on the improved density peaks clustering method, named IDP-SMOTE. First, it improved density peaks clustering algorithm by utilizing Box-Cox transformation and σ-rule for finding the clustering centers and outliers automatically. Second, it combined the improved density peaks clustering algorithm with SMOTE method. With removing the noisy data, the synthetic samples could be generated in the sub-class regions on the basis of the values of local density and nearest distance of the minority samples. The presented algorithm avoids the boundary ambiguity caused by over-sampling, improves the imbalance problem with-in class and reduces the learning difficulty of the boundary data. Meanwhile, it realizes automatic clustering and resampling, and avoids the interference of subjective factors. Through the contrast experiment, the proposed algorithm is effective and adaptive. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF