Back to Search
Start Over
A new machine learning classifier for high dimensional healthcare data.
- Source :
-
Studies in health technology and informatics [Stud Health Technol Inform] 2007; Vol. 129 (Pt 1), pp. 664-8. - Publication Year :
- 2007
-
Abstract
- Data sets with many discrete variables and relatively few cases arise in health care, commerce, information security, and many other domains. Learning effective and efficient prediction models from such data sets is a challenging task. In this paper, we propose a new approach that combines Metaheuristic search and Bayesian Networks to learn a graphical Markov Blanket-based classifier from data. The Tabu Search enhanced Markov Blanket (TS/MB) procedure is based on the use of restricted neighborhoods in a general Bayesian Network constrained by the Markov condition, called Markov Blanket Neighborhoods. Computational results from two real world healthcare data sets indicate that the TS/MB procedure converges fast and is able to find a parsimonious model with substantially fewer predictor variables than in the full data set. Furthermore, it has comparable or better prediction performance when compared against several machine learning methods, and provides insight into possible causal relations among the variables.
Details
- Language :
- English
- ISSN :
- 0926-9630
- Volume :
- 129
- Issue :
- Pt 1
- Database :
- MEDLINE
- Journal :
- Studies in health technology and informatics
- Publication Type :
- Academic Journal
- Accession number :
- 17911800