Back to Search
Start Over
Multivariate Discretization Based on Evolutionary Cut Points Selection for Classification.
- Source :
- IEEE Transactions on Cybernetics; Mar2016, Vol. 46 Issue 3, p595-608, 14p
- Publication Year :
- 2016
-
Abstract
- Discretization is one of the most relevant techniques for data preprocessing. The main goal of discretization is to transform numerical attributes into discrete ones to help the experts to understand the data more easily, and it also provides the possibility to use some learning algorithms which require discrete data as input, such as Bayesian or rule learning. We focus our attention on handling multivariate classification problems, where high interactions among multiple attributes exist. In this paper, we propose the use of evolutionary algorithms to select a subset of cut points that defines the best possible discretization scheme of a data set using a wrapper fitness function. We also incorporate a reduction mechanism to successfully manage the multivariate approach on large data sets. Our method has been compared with the best state-of-the-art discretizers on 45 real datasets. The experiments show that our proposed algorithm overcomes the rest of the methods producing competitive discretization schemes in terms of accuracy, for C4.5, Naive Bayes, PART, and PrUning and BuiLding Integrated in Classification classifiers; and obtained far simpler solutions. [ABSTRACT FROM PUBLISHER]
Details
- Language :
- English
- ISSN :
- 21682267
- Volume :
- 46
- Issue :
- 3
- Database :
- Complementary Index
- Journal :
- IEEE Transactions on Cybernetics
- Publication Type :
- Academic Journal
- Accession number :
- 113114840
- Full Text :
- https://doi.org/10.1109/TCYB.2015.2410143