Back to Search
Start Over
Classifying very-high-dimensional data with random forests of oblique decision trees
- Source :
- Advances in knowledge discovery and management, Advances in knowledge discovery and management, Springer, pp.39-55, 2010, vol. 292-Studies in computational intelligence, 978-3-642-00579-4
- Publication Year :
- 2010
- Publisher :
- HAL CCSD, 2010.
-
Abstract
- International audience; The random forests method is one of the most successful ensemble methods. However, random forests do not have high performance when dealing with very-high-dimensional data in presence of dependencies. In this case one can expect that there exist many combinations between the variables and unfortunately the usual random forests method does not effectively exploit this situation. We here investigate a new approach for supervised classification with a huge number of numerical attributes. We propose a random oblique decision trees method. It consists of randomly choosing a subset of predictive attributes and it uses SVM as a split function of these attributes.We compare, on 25 datasets, the effectiveness with classical measures (e.g. precision, recall, F1-measure and accuracy) of random forests of random oblique decision trees with SVMs and random forests of C4.5. Our proposal has significant better performance on very-high-dimensional datasets with slightly better results on lower dimensional datasets.
- Subjects :
- [INFO.INFO-AI] Computer Science [cs]/Artificial Intelligence [cs.AI]
Oblique decision trees
[INFO.INFO-DB]Computer Science [cs]/Databases [cs.DB]
[ INFO.INFO-DB ] Computer Science [cs]/Databases [cs.DB]
[INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG]
[INFO.INFO-DB] Computer Science [cs]/Databases [cs.DB]
[INFO.INFO-LG] Computer Science [cs]/Machine Learning [cs.LG]
Random forests
[ INFO.INFO-AI ] Computer Science [cs]/Artificial Intelligence [cs.AI]
Very-high-dimensional data
[ INFO.INFO-LG ] Computer Science [cs]/Machine Learning [cs.LG]
[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]
Subjects
Details
- Language :
- English
- ISBN :
- 978-3-642-00579-4
- ISBNs :
- 9783642005794
- Database :
- OpenAIRE
- Journal :
- Advances in knowledge discovery and management, Advances in knowledge discovery and management, Springer, pp.39-55, 2010, vol. 292-Studies in computational intelligence, 978-3-642-00579-4
- Accession number :
- edsair.dedup.wf.001..83846bbe53b483086bb5e4d0b6b0f3a2