Back to Search
Start Over
Decision tree classification algorithm for non-equilibrium data set based on random forests.
- Source :
-
Journal of Intelligent & Fuzzy Systems . 2020, Vol. 39 Issue 2, p1639-1648. 10p. - Publication Year :
- 2020
-
Abstract
- In order to overcome the problems of poor accuracy and high complexity of current classification algorithm for non-equilibrium data set, this paper proposes a decision tree classification algorithm for non-equilibrium data set based on random forest. Wavelet packet decomposition is used to denoise non-equilibrium data, and SNM algorithm and RFID are combined to remove redundant data from data sets. Based on the results of data processing, the non-equilibrium data sets are classified by random forest method. According to Bootstrap resampling method with certain constraints, the majority and minority samples of each sample subset are sampled, CART is used to train the data set, and a decision tree is constructed. Obtain the final classification results by voting on the CART decision tree classification. Experimental results show that the proposed algorithm has the characteristics of high classification accuracy and low complexity, and it is a feasible classification algorithm for non-equilibrium data set. [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 10641246
- Volume :
- 39
- Issue :
- 2
- Database :
- Academic Search Index
- Journal :
- Journal of Intelligent & Fuzzy Systems
- Publication Type :
- Academic Journal
- Accession number :
- 145429407
- Full Text :
- https://doi.org/10.3233/JIFS-179937