Back to Search Start Over

Decision tree classification algorithm for non-equilibrium data set based on random forests.

Authors :
Wang, Peng
Zhang, Ningchao
Patnaik, Srikanta
Source :
Journal of Intelligent & Fuzzy Systems. 2020, Vol. 39 Issue 2, p1639-1648. 10p.
Publication Year :
2020

Abstract

In order to overcome the problems of poor accuracy and high complexity of current classification algorithm for non-equilibrium data set, this paper proposes a decision tree classification algorithm for non-equilibrium data set based on random forest. Wavelet packet decomposition is used to denoise non-equilibrium data, and SNM algorithm and RFID are combined to remove redundant data from data sets. Based on the results of data processing, the non-equilibrium data sets are classified by random forest method. According to Bootstrap resampling method with certain constraints, the majority and minority samples of each sample subset are sampled, CART is used to train the data set, and a decision tree is constructed. Obtain the final classification results by voting on the CART decision tree classification. Experimental results show that the proposed algorithm has the characteristics of high classification accuracy and low complexity, and it is a feasible classification algorithm for non-equilibrium data set. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
10641246
Volume :
39
Issue :
2
Database :
Academic Search Index
Journal :
Journal of Intelligent & Fuzzy Systems
Publication Type :
Academic Journal
Accession number :
145429407
Full Text :
https://doi.org/10.3233/JIFS-179937