Back to Search Start Over

A hybrid decision tree/genetic algorithm method for data mining.

Authors :
Carvalho, Debprah R.
Freitas, Alex A.
Source :
Information Sciences. Jun2004, Vol. 163 Issue 1-3, p13-35. 23p. 7 Diagrams, 3 Charts.
Publication Year :
2004

Abstract

This paper addresses the well-known classification task of data mining, where the objective is to predict the class which an example belongs to. Discovered knowledge is expressed in the form of high-level, easy-to-interpret classification rules. In order to discover classification rules, we propose a hybrid decision tree/genetic algorithm method. The central idea of this hybrid method involves the concept of small disjuncts in data mining, as follows. In essence, a set of classification rules can be regarded as a logical disjunction of rules, so that each rule can be regarded as a disjunct. A small disjunct is a rule covering a small number of examples. Due to their nature, small disjuncts are error prone. However, although each small disjunct covers just a few examples, the set of all small disjuncts can cover a large number of examples, so that it is important to develop new approaches to cope with the problem of small disjuncts. In our hybrid approach, we have developed two genetic algorithms (GA) specifically designed for discovering rules covering examples belonging to small disjuncts, whereas a conventional decision tree algorithm is used to produce rules covering examples belonging to large disjuncts. We present results evaluating the performance of the hybrid method in 22 real-world data sets. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
00200255
Volume :
163
Issue :
1-3
Database :
Academic Search Index
Journal :
Information Sciences
Publication Type :
Periodical
Accession number :
13813033
Full Text :
https://doi.org/10.1016/j.ins.2003.03.013