Back to Search Start Over

Multivariate Discretization Based on Evolutionary Cut Points Selection for Classification.

Authors :
Ramirez-Gallego, Sergio
Garcia, Salvador
Benitez, Jose Manuel
Herrera, Francisco
Source :
IEEE Transactions on Cybernetics; Mar2016, Vol. 46 Issue 3, p595-608, 14p
Publication Year :
2016

Abstract

Discretization is one of the most relevant techniques for data preprocessing. The main goal of discretization is to transform numerical attributes into discrete ones to help the experts to understand the data more easily, and it also provides the possibility to use some learning algorithms which require discrete data as input, such as Bayesian or rule learning. We focus our attention on handling multivariate classification problems, where high interactions among multiple attributes exist. In this paper, we propose the use of evolutionary algorithms to select a subset of cut points that defines the best possible discretization scheme of a data set using a wrapper fitness function. We also incorporate a reduction mechanism to successfully manage the multivariate approach on large data sets. Our method has been compared with the best state-of-the-art discretizers on 45 real datasets. The experiments show that our proposed algorithm overcomes the rest of the methods producing competitive discretization schemes in terms of accuracy, for C4.5, Naive Bayes, PART, and PrUning and BuiLding Integrated in Classification classifiers; and obtained far simpler solutions. [ABSTRACT FROM PUBLISHER]

Details

Language :
English
ISSN :
21682267
Volume :
46
Issue :
3
Database :
Complementary Index
Journal :
IEEE Transactions on Cybernetics
Publication Type :
Academic Journal
Accession number :
113114840
Full Text :
https://doi.org/10.1109/TCYB.2015.2410143