Back to Search Start Over

Voice EHR: introducing multimodal audio data for health.

Authors :
Anibal J
Huth H
Li M
Hazen L
Daoud V
Ebedes D
Lam YM
Nguyen H
Hong PV
Kleinman M
Ost S
Jackson C
Sprabery L
Elangovan C
Krishnaiah B
Akst L
Lina I
Elyazar I
Ekawati L
Jansen S
Nduwayezu R
Garcia C
Plum J
Brenner J
Song M
Ricotta E
Clifton D
Thwaites CL
Bensoussan Y
Wood B
Source :
Frontiers in digital health [Front Digit Health] 2025 Jan 28; Vol. 6, pp. 1448351. Date of Electronic Publication: 2025 Jan 28 (Print Publication: 2024).
Publication Year :
2025

Abstract

Introduction: Artificial intelligence (AI) models trained on audio data may have the potential to rapidly perform clinical tasks, enhancing medical decision-making and potentially improving outcomes through early detection. Existing technologies depend on limited datasets collected with expensive recording equipment in high-income countries, which challenges deployment in resource-constrained, high-volume settings where audio data may have a profound impact on health equity.<br />Methods: This report introduces a novel protocol for audio data collection and a corresponding application that captures health information through guided questions.<br />Results: To demonstrate the potential of Voice EHR as a biomarker of health, initial experiments on data quality and multiple case studies are presented in this report. Large language models (LLMs) were used to compare transcribed Voice EHR data with data (from the same patients) collected through conventional techniques like multiple choice questions. Information contained in the Voice EHR samples was consistently rated as equally or more relevant to a health evaluation.<br />Discussion: The HEAR application facilitates the collection of an audio electronic health record ("Voice EHR") that may contain complex biomarkers of health from conventional voice/respiratory features, speech patterns, and spoken language with semantic meaning and longitudinal context-potentially compensating for the typical limitations of unimodal clinical datasets.<br />Competing Interests: The authors declare no competing non-financial interests but the following competing financial interests. NIH may own intellectual property in the field. NIH and BJW receive royalties for licensed patents from Philips, unrelated to this work. BW is Principal Investigator on the following CRADA's=Cooperative Research & Development Agreements, between NIH and industry: Philips, Philips Research, Celsion Corp, BTG Biocompatibles/Boston Scientific, Siemens, NVIDIA, XACT Robotics. Promaxo (in progress). The following industry partners also support research in CIO lab via equipment, personnel, devices and/or drugs: 3T Technologies (devices), Exact Imaging (data), AngioDynamics (equipment), AstraZeneca (pharmaceuticals, NCI CRADA), ArciTrax (devices and equipment), Imactis (Equipment), Johnson & Johnson (equipment), Medtronic (equipment), Theromics (Supplies), Profound Medical (equipment and supplies), QT Imaging (equipment and supplies). The content of this manuscript does not necessarily reflect the views, policies, or opinions of the Uniformed Services University of the Health Sciences, the National Institutes of Health, the US Department of Health and Human Services, the US Department of Defense, the U.K. National Health Service, the U.K. National Institute for Health Research, the U.K. Department of Health, InnoHK – ITC, or the University of Oxford. The mention of commercial products, their source, or their use in connection with material reported herein is not to be construed as an actual or implied endorsement of such products by the U.S. government. This work was prepared by a military or civilian employee of the US Government as part of the individual's official duties and therefore is in the public domain and does not possess copyright protection (public domain information may be freely distributed and copied; however, as a courtesy it is requested that the Uniformed Services University and the author be given an appropriate acknowledgement). The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.<br /> (© 2025 Anibal, Huth, Li, Hazen, Daoud, Ebedes, Lam, Nguyen, Hong, Kleinman, Ost, Jackson, Sprabery, Elangovan, Krishnaiah, Akst, Lina, Elyazar, Ekawati, Jansen, Nduwayezu, Garcia, Plum, Brenner, Song, Ricotta, Clifton, Thwaites, Bensoussan and Wood.)

Details

Language :
English
ISSN :
2673-253X
Volume :
6
Database :
MEDLINE
Journal :
Frontiers in digital health
Publication Type :
Academic Journal
Accession number :
39936096
Full Text :
https://doi.org/10.3389/fdgth.2024.1448351