Back to Search
Start Over
Identifying Ventricular Arrhythmias and Their Predictors by Applying Machine Learning Methods to Electronic Health Records in Patients With Hypertrophic Cardiomyopathy(HCM-VAr-Risk Model)
- Publication Year :
- 2021
- Publisher :
- arXiv, 2021.
-
Abstract
- Clinical risk stratification for sudden cardiac death (SCD) in hypertrophic cardiomyopathy (HC) employs rules derived from American College of Cardiology Foundation/American Heart Association (ACCF/AHA) guidelines or the HCM Risk-SCD model (C-index ∼0.69), which utilize a few clinical variables. We assessed whether data-driven machine learning methods that consider a wider range of variables can effectively identify HC patients with ventricular arrhythmias (VAr) that lead to SCD. We scanned the electronic health records of 711 HC patients for sustained ventricular tachycardia or ventricular fibrillation. Patients with ventricular tachycardia or ventricular fibrillation (n = 61) were tagged as VAr cases and the remaining (n = 650) as non-VAr. The 2-sample ttest and information gain criterion were used to identify the most informative clinical variables that distinguish VAr from non-VAr; patient records were reduced to include only these variables. Data imbalance stemming from low number of VAr cases was addressed by applying a combination of over- and undersampling strategies. We trained and tested multiple classifiers under this sampling approach, showing effective classification. We evaluated 93 clinical variables, of which 22 proved predictive of VAr. The ensemble of logistic regression and naive Bayes classifiers, trained based on these 22 variables and corrected for data imbalance, was most effective in separating VAr from non-VAr cases (sensitivity = 0.73, specificity = 0.76, C-index = 0.83). Our method (HCM-VAr-Risk Model) identified 12 new predictors of VAr, in addition to 10 established SCD predictors. In conclusion, this is the first application of machine learning for identifying HC patients with VAr, using clinical attributes. Our model demonstrates good performance (C-index) compared with currently employed SCD prediction algorithms, while addressing imbalance inherent in clinical data.
- Subjects :
- Tachycardia
Male
FOS: Computer and information sciences
medicine.medical_specialty
Computer Science - Machine Learning
Cardiomyopathy
Magnetic Resonance Imaging, Cine
030204 cardiovascular system & hematology
Logistic regression
Machine learning
computer.software_genre
Ventricular tachycardia
Risk Assessment
Sudden cardiac death
Machine Learning (cs.LG)
Machine Learning
03 medical and health sciences
Electrocardiography
0302 clinical medicine
Predictive Value of Tests
Risk Factors
Internal medicine
medicine
Electronic Health Records
Humans
030212 general & internal medicine
Registries
cardiovascular diseases
Retrospective Studies
medicine.diagnostic_test
business.industry
fungi
Hypertrophic cardiomyopathy
Reproducibility of Results
Cardiomyopathy, Hypertrophic
Middle Aged
medicine.disease
Prognosis
Ventricular fibrillation
Cardiology
Tachycardia, Ventricular
Female
Artificial intelligence
medicine.symptom
Cardiology and Cardiovascular Medicine
business
computer
Echocardiography, Stress
Subjects
Details
- Database :
- OpenAIRE
- Accession number :
- edsair.doi.dedup.....123e6cae63f1ebc2cde37b59a8d0af13
- Full Text :
- https://doi.org/10.48550/arxiv.2109.09210