1. Data-Driven Identification of Long-Term Glycemia Clusters and Their Individualized Predictors in Finnish Patients with Type 2 Diabetes
- Author
-
Lavikainen P, Chandra G, Siirtola P, Tamminen S, Ihalapathirana AT, Röning J, Laatikainen T, and Martikainen J
- Subjects
type 2 diabetes ,cluster ,hba1c ,machine learning ,shap ,Infectious and parasitic diseases ,RC109-216 - Abstract
Piia Lavikainen,1,* Gunjan Chandra,2,* Pekka Siirtola,2 Satu Tamminen,2 Anusha T Ihalapathirana,2 Juha Röning,2 Tiina Laatikainen,3– 5 Janne Martikainen1 1School of Pharmacy, University of Eastern Finland, Kuopio, Finland; 2Biomimetics and Intelligent Systems Group, Faculty of ITEE, University of Oulu, Oulu, Finland; 3Joint Municipal Authority for North Karelia Social and Health Services (Siun Sote), Joensuu, Finland; 4Department of Public Health and Social Welfare, Finnish Institute for Health and Welfare, Helsinki, Finland; 5Institute of Public Health and Clinical Nutrition, University of Eastern Finland, Kuopio, Finland*These authors contributed equally to this workCorrespondence: Piia Lavikainen, School of Pharmacy C/O Faculty of Health Sciences, University of Eastern Finland, P.O. Box 1627, Kuopio, FI-70211, Finland, Tel +358 40 7024682, Email piia.lavikainen@uef.fiPurpose: To gain an understanding of the heterogeneous group of type 2 diabetes (T2D) patients, we aimed to identify patients with the homogenous long-term HbA1c trajectories and to predict the trajectory membership for each patient using explainable machine learning methods and different clinical-, treatment-, and socio-economic-related predictors.Patients and Methods: Electronic health records data covering primary and specialized healthcare on 9631 patients having T2D diagnosis were extracted from the North Karelia region, Finland. Six-year HbA1c trajectories were examined with growth mixture models. Linear discriminant analysis and neural networks were applied to predict the trajectory membership individually.Results: Three HbA1c trajectories were distinguished over six years: “stable, adequate” (86.5%), “improving, but inadequate” (7.3%), and “fluctuating, inadequate” (6.2%) glycemic control. Prior glucose levels, duration of T2D, use of insulin only, use of insulin together with some oral antidiabetic medications, and use of only metformin were the most important predictors for the long-term treatment balance. The prediction model had a balanced accuracy of 85% and a receiving operating characteristic area under the curve of 91%, indicating high performance. Moreover, the results based on SHAP (Shapley additive explanations) values show that it is possible to explain the outcomes of machine learning methods at the population and individual levels.Conclusion: Heterogeneity in long-term glycemic control can be predicted with confidence by utilizing information from previous HbA1c levels, fasting plasma glucose, duration of T2D, and use of antidiabetic medications. In future, the expected development of HbA1c could be predicted based on the patient’s unique risk factors offering a practical tool for clinicians to support treatment planning.Keywords: type 2 diabetes, cluster, HbA1c, machine learning, SHAP
- Published
- 2023