Back to Search Start Over

Natural language processing for identification of hypertrophic cardiomyopathy patients from cardiac magnetic resonance reports

Authors :
Nakeya Dewaswala
David Chen
Huzefa Bhopalwala
Vinod C. Kaggal
Sean P. Murphy
J. Martijn Bos
Jeffrey B. Geske
Bernard J. Gersh
Steve R. Ommen
Philip A. Araoz
Michael J. Ackerman
Adelaide M. Arruda-Olson
Source :
BMC Medical Informatics and Decision Making, Vol 22, Iss 1, Pp 1-9 (2022)
Publication Year :
2022
Publisher :
BMC, 2022.

Abstract

Abstract Background Cardiac magnetic resonance (CMR) imaging is important for diagnosis and risk stratification of hypertrophic cardiomyopathy (HCM) patients. However, collection of information from large numbers of CMR reports by manual review is time-consuming, error-prone and costly. Natural language processing (NLP) is an artificial intelligence method for automated extraction of information from narrative text including text in CMR reports in electronic health records (EHR). Our objective was to assess whether NLP can accurately extract diagnosis of HCM from CMR reports. Methods An NLP system with two tiers was developed for information extraction from narrative text in CMR reports; the first tier extracted information regarding HCM diagnosis while the second extracted categorical and numeric concepts for HCM classification. We randomly allocated 200 HCM patients with CMR reports from 2004 to 2018 into training (100 patients with 185 CMR reports) and testing sets (100 patients with 206 reports). Results NLP algorithms demonstrated very high performance compared to manual annotation. The algorithm to extract HCM diagnosis had accuracy of 0.99. The accuracy for categorical concepts included HCM morphologic subtype 0.99, systolic anterior motion of the mitral valve 0.96, mitral regurgitation 0.93, left ventricular (LV) obstruction 0.94, location of obstruction 0.92, apical pouch 0.98, LV delayed enhancement 0.93, left atrial enlargement 0.99 and right atrial enlargement 0.98. Accuracy for numeric concepts included maximal LV wall thickness 0.96, LV mass 0.99, LV mass index 0.98, LV ejection fraction 0.98 and right ventricular ejection fraction 0.99. Conclusions NLP identified and classified HCM from CMR narrative text reports with very high performance.

Details

Language :
English
ISSN :
14726947
Volume :
22
Issue :
1
Database :
Directory of Open Access Journals
Journal :
BMC Medical Informatics and Decision Making
Publication Type :
Academic Journal
Accession number :
edsdoj.9a6ce6d492547be984e9f1a91551f18
Document Type :
article
Full Text :
https://doi.org/10.1186/s12911-022-02017-y