1. Regression and Random Forest Machine Learning Have Limited Performance in Predicting Bowel Preparation in Veteran Population
- Author
-
Stacy B. Menees, Amy M. Cohn, Sameer D. Saini, Rachel Lipson, Alex N. Kokaly, Andrew J. Read, Karmel S. Shehadeh, Jacob E. Kurlander, and Akbar K. Waljee
- Subjects
education.field_of_study ,medicine.diagnostic_test ,Receiver operating characteristic ,Physiology ,business.industry ,Population ,Gastroenterology ,Colonoscopy ,Retrospective cohort study ,Machine learning ,computer.software_genre ,Logistic regression ,03 medical and health sciences ,0302 clinical medicine ,Brier score ,030220 oncology & carcinogenesis ,Cohort ,medicine ,030211 gastroenterology & hepatology ,Artificial intelligence ,education ,business ,computer ,Predictive modelling - Abstract
Inadequate bowel preparation undermines the quality of colonoscopy, but patients likely to be affected are difficult to identify beforehand. This study aimed to develop, validate, and compare prediction models for bowel preparation inadequacy using conventional logistic regression (LR) and random forest machine learning (RFML). We created a retrospective cohort of patients who underwent outpatient colonoscopy at a single VA medical center between January 2012 and October 2015. Candidate predictor variables were chosen after a literature review. We extracted all available predictor variables from the electronic medical record, and bowel preparation from the endoscopy database. The data were split into 70% training and 30% validation sets. Multivariable LR and RFML were used to predict preparation inadequacy as a dichotomous outcome. The cohort included 6,885 Veterans, of whom 964 (14%) had inadequate preparation. Using LR, the area under the receiver operating characteristic curve (AUC) for the validation cohort was 0.66 (95% CI 0.62, 0.69) and the Brier score, in which a lower score indicates better performance, was 0.11. Using RFML, the AUC for the validation cohort was 0.61 (95% CI 0.58, 0.65) and the Brier score was 0.12. LR and RFML had similar performance in predicting bowel preparation, which was modest and likely insufficient for use in practice. Future research is needed to identify additional predictor variables and to test other machine learning algorithms. At present, endoscopy units should focus on universal strategies to enhance preparation adequacy.
- Published
- 2021
- Full Text
- View/download PDF