Back to Search Start Over

A new machine learning classifier for high dimensional healthcare data.

Authors :
Padman R
Bai X
Airoldi EM
Source :
Studies in health technology and informatics [Stud Health Technol Inform] 2007; Vol. 129 (Pt 1), pp. 664-8.
Publication Year :
2007

Abstract

Data sets with many discrete variables and relatively few cases arise in health care, commerce, information security, and many other domains. Learning effective and efficient prediction models from such data sets is a challenging task. In this paper, we propose a new approach that combines Metaheuristic search and Bayesian Networks to learn a graphical Markov Blanket-based classifier from data. The Tabu Search enhanced Markov Blanket (TS/MB) procedure is based on the use of restricted neighborhoods in a general Bayesian Network constrained by the Markov condition, called Markov Blanket Neighborhoods. Computational results from two real world healthcare data sets indicate that the TS/MB procedure converges fast and is able to find a parsimonious model with substantially fewer predictor variables than in the full data set. Furthermore, it has comparable or better prediction performance when compared against several machine learning methods, and provides insight into possible causal relations among the variables.

Details

Language :
English
ISSN :
0926-9630
Volume :
129
Issue :
Pt 1
Database :
MEDLINE
Journal :
Studies in health technology and informatics
Publication Type :
Academic Journal
Accession number :
17911800