1. Feature Ranking in Predictive Models for Hospital-Acquired Acute Kidney Injury
- Author
-
Lemuel R. Waitman, Xiangzhou Zhang, Mei Liu, Yong Hu, John A. Kellum, Xiaoxiao Liu, Weiqi Chen, Alan S.L. Yu, and Lijuan Wu
- Subjects
Adult ,Male ,0301 basic medicine ,Adolescent ,Stability (learning theory) ,lcsh:Medicine ,Feature selection ,urologic and male genital diseases ,Machine learning ,computer.software_genre ,Risk Assessment ,Severity of Illness Index ,Article ,Young Adult ,03 medical and health sciences ,0302 clinical medicine ,Predictive Value of Tests ,Electronic Health Records ,Humans ,Medicine ,030212 general & internal medicine ,lcsh:Science ,Selection (genetic algorithm) ,Retrospective Studies ,Models, Statistical ,Multidisciplinary ,business.industry ,lcsh:R ,Retrospective cohort study ,Acute Kidney Injury ,Middle Aged ,Hospitals ,female genital diseases and pregnancy complications ,3. Good health ,030104 developmental biology ,Feature (computer vision) ,Sample size determination ,Predictive value of tests ,lcsh:Q ,Female ,Artificial intelligence ,business ,computer ,Predictive modelling - Abstract
Acute Kidney Injury (AKI) is a common complication encountered among hospitalized patients, imposing significantly increased cost, morbidity, and mortality. Early prediction of AKI has profound clinical implications because currently no treatment exists for AKI once it develops. Feature selection (FS) is an essential process for building accurate and interpretable prediction models, but to our best knowledge no study has investigated the robustness and applicability of such selection process for AKI. In this study, we compared eight widely-applied FS methods for AKI prediction using nine-years of electronic medical records (EMR) and examined heterogeneity in feature rankings produced by the methods. FS methods were compared in terms of stability with respect to data sampling variation, similarity between selection results, and AKI prediction performance. Prediction accuracy did not intrinsically guarantee the feature ranking stability. Across different FS methods, the prediction performance did not change significantly, while the importance rankings of features were quite different. A positive correlation was observed between the complexity of suitable FS method and sample size. This study provides several practical implications, including recognizing the importance of feature stability as it is desirable for model reproducibility, identifying important AKI risk factors for further investigation, and facilitating early prediction of AKI.
- Published
- 2018
- Full Text
- View/download PDF