Back to Search Start Over

Prospective recruitment of patients with congestive heart failure using an ad-hoc binary classifier.

Authors :
Pakhomov, Serguei V.
Buntrock, James
Chute, Christopher G.
Source :
Journal of Biomedical Informatics; Apr2005, Vol. 38 Issue 2, p145-153, 9p
Publication Year :
2005

Abstract

Abstract: This paper addresses a very specific problem of identifying patients diagnosed with a specific condition for potential recruitment in a clinical trial or an epidemiological study. We present a simple machine learning method for identifying patients diagnosed with congestive heart failure and other related conditions by automatically classifying clinical notes dictated at Mayo Clinic. This method relies on an automatic classifier trained on comparable amounts of positive and negative samples of clinical notes previously categorized by human experts. The documents are represented as feature vectors, where features are a mix of demographic information as well as single words and concept mappings to MeSH and HICDA classification systems. We compare two simple and efficient classification algorithms (Naïve Bayes and Perceptron) and a baseline term spotting method with respect to their accuracy and recall on positive samples. Depending on the test set, we find that Naïve Bayes yields better recall on positive samples (95 vs. 86%) but worse accuracy than Perceptron (57 vs. 65%). Both algorithms perform better than the baseline with recall on positive samples of 71% and accuracy of 54%. [Copyright &y& Elsevier]

Details

Language :
English
ISSN :
15320464
Volume :
38
Issue :
2
Database :
Supplemental Index
Journal :
Journal of Biomedical Informatics
Publication Type :
Academic Journal
Accession number :
16837708
Full Text :
https://doi.org/10.1016/j.jbi.2004.11.016