Back to Search
Start Over
Maximum a posteriori pruning on decision trees and its application to bootstrap BUMPing
- Source :
- Computational Statistics & Data Analysis. 50:710-719
- Publication Year :
- 2006
- Publisher :
- Elsevier BV, 2006.
-
Abstract
- The cost-complexity pruning generates nested subtrees and selects the best one. However, its computational cost is large since it uses holdout sample or cross-validation. On the other hand, the pruning algorithms based on posterior calculations such as BIC (MDL) and MEP are faster, but they sometimes produce too big or small trees to yield poor generalization errors. In this paper, we propose an alternative pruning procedure which combines the ideas of the cost-complexity pruning and posterior calculation. The proposed algorithm uses only training samples, so that its computational cost is almost same as the other posterior-based algorithms, and at the same time yields similar accuracies as the cost-complexity pruning. Moreover it can be used for comparing non-nested trees, which is necessary for the BUMPing procedure. The empirical results show that the proposed algorithm performs similarly as the cost-complexity pruning in standard situations and works better for BUMPing.
- Subjects :
- Statistics and Probability
Generalization
business.industry
Applied Mathematics
Posterior probability
Decision tree
Machine learning
computer.software_genre
Computational Mathematics
Computational Theory and Mathematics
Principal variation search
Maximum a posteriori estimation
Null-move heuristic
Pruning (decision trees)
Artificial intelligence
business
Algorithm
computer
Mathematics
Killer heuristic
Subjects
Details
- ISSN :
- 01679473
- Volume :
- 50
- Database :
- OpenAIRE
- Journal :
- Computational Statistics & Data Analysis
- Accession number :
- edsair.doi...........88a060bdce57228bed7c219d274d2bbc
- Full Text :
- https://doi.org/10.1016/j.csda.2004.09.010