Back to Search Start Over

RUSBoost: A Hybrid Approach to Alleviating Class Imbalance.

Authors :
Seiffert, Chris
Khoshgoftaar, Taghi M.
van Hulse, Jason
Napolitano, Amri
Source :
IEEE Transactions on Systems, Man & Cybernetics: Part A; Jan2010, Vol. 40 Issue 1, p185-197, 13p, 1 Black and White Photograph, 1 Chart, 1 Graph
Publication Year :
2010

Abstract

Class imbalance is a problem that is common to many application domains. When examples of one class in a training data set vastly outnumber examples of the other class(es), traditional data mining algorithms tend to create suboptimal classification models. Several techniques have been used to alleviate the problem of class imbalance, including data sampling and boosting. In this paper, we present a new hybrid sampling/boosting algorithm, called RUSBoost, for learning from skewed training data. This algorithm provides a simpler and faster alternative to SMOTEBoost, which is another algorithm that combines boosting and data sampling. This paper evaluates the performances of RUSBoost and SMOTEBoost, as well as their individual components (random undersampling, synthetic minority oversampling technique, and AdaBoost). We conduct experiments using 15 data sets from various application domains, four base learners, and four evaluation metrics. RUSBoost and SMOTEBoost both outperform the other procedures, and RUSBoost performs comparably to (and often better than) SMOTEBoost while being a simpler and faster technique. Given these experimental results, we highly recommend RUSBoost as an attractive alternative for improving the classification performance of learners built using imbalanced data. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
10834427
Volume :
40
Issue :
1
Database :
Complementary Index
Journal :
IEEE Transactions on Systems, Man & Cybernetics: Part A
Publication Type :
Academic Journal
Accession number :
47877964
Full Text :
https://doi.org/10.1109/TSMCA.2009.2029559