Back to Search Start Over

Reducing diagnostic delays in acute hepatic porphyria using health records data and machine learning.

Authors :
Bhasuran B
Schmolly K
Kapoor Y
Jayakumar NL
Doan R
Amin J
Meninger S
Cheng N
Deering R
Anderson K
Beaven SW
Wang B
Rudrapatna VA
Source :
Journal of the American Medical Informatics Association : JAMIA [J Am Med Inform Assoc] 2025 Jan 01; Vol. 32 (1), pp. 63-70.
Publication Year :
2025

Abstract

Background: Acute hepatic porphyria (AHP) is a group of rare but treatable conditions associated with diagnostic delays of 15 years on average. The advent of electronic health records (EHR) data and machine learning (ML) may improve the timely recognition of rare diseases like AHP. However, prediction models can be difficult to train given the limited case numbers, unstructured EHR data, and selection biases intrinsic to healthcare delivery. We sought to train and characterize models for identifying patients with AHP.<br />Methods: This diagnostic study used structured and notes-based EHR data from 2 centers at the University of California, UCSF (2012-2022) and UCLA (2019-2022). The data were split into 2 cohorts (referral and diagnosis) and used to develop models that predict (1) who will be referred for testing of acute porphyria, among those who presented with abdominal pain (a cardinal symptom of AHP), and (2) who will test positive, among those referred. The referral cohort consisted of 747 patients referred for testing and 99 849 contemporaneous patients who were not. The diagnosis cohort consisted of 72 confirmed AHP cases and 347 patients who tested negative. The case cohort was 81% female and 6-75 years old at the time of diagnosis. Candidate models used a range of architectures. Feature selection was semi-automated and incorporated publicly available data from knowledge graphs. Our primary outcome was the F-score on an outcome-stratified test set.<br />Results: The best center-specific referral models achieved an F-score of 86%-91%. The best diagnosis model achieved an F-score of 92%. To further test our model, we contacted 372 current patients who lack an AHP diagnosis but were predicted by our models as potentially having it (≥10% probability of referral, ≥50% of testing positive). However, we were only able to recruit 10 of these patients for biochemical testing, all of whom were negative. Nonetheless, post hoc evaluations suggested that these models could identify 71% of cases earlier than their diagnosis date, saving 1.2 years.<br />Conclusions: ML can reduce diagnostic delays in AHP and other rare diseases. Robust recruitment strategies and multicenter coordination will be needed to validate these models before they can be deployed.<br /> (© The Author(s) 2024. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For permissions, please email: journals.permissions@oup.com.)

Details

Language :
English
ISSN :
1527-974X
Volume :
32
Issue :
1
Database :
MEDLINE
Journal :
Journal of the American Medical Informatics Association : JAMIA
Publication Type :
Academic Journal
Accession number :
38946554
Full Text :
https://doi.org/10.1093/jamia/ocae141