1. Natural language processing for automated annotation of medication mentions in primary care visit conversations
- Author
-
Amar K. Das, James Ryan, Michelle D Dannenberg, James C Finora, Craig H. Ganoe, Jesse A Schoonmaker, Glyn Elwyn, William Haslett, Paul Barr, Martha L. Bruce, Kyra Bonasia, Weiyi Wu, Saeed Hassanpour, and Wambui M Onsando
- Subjects
medication information extraction ,Recall ,AcademicSubjects/SCI01060 ,business.industry ,Computer science ,Medical record ,clinic visit recording ,Unified Medical Language System ,Health Informatics ,computer.software_genre ,Research and Applications ,Set (abstract data type) ,Annotation ,Test set ,Controlled vocabulary ,False positive paradox ,Data set (IBM mainframe) ,Artificial intelligence ,AcademicSubjects/SCI01530 ,natural language processing ,Set (psychology) ,business ,AcademicSubjects/MED00010 ,computer ,Natural language processing - Abstract
ObjectivesThe objective of this study is to build and evaluate a natural language processing approach to identify medication mentions in primary care visit conversations between patients and physicians.Materials and MethodsEight clinicians contributed to a data set of 85 clinic visit transcripts, and 10 transcripts were randomly selected from this data set as a development set. Our approach utilizes Apache cTAKES and Unified Medical Language System controlled vocabulary to generate a list of medication candidates in the transcribed text and then performs multiple customized filters to exclude common false positives from this list while including some additional common mentions of the supplements and immunizations.ResultsSixty-five transcripts with 1121 medication mentions were randomly selected as an evaluation set. Our proposed method achieved an F-score of 85.0% for identifying the medication mentions in the test set, significantly outperforming existing medication information extraction systems for medical records with F-scores ranging from 42.9% to 68.9% on the same test set.DiscussionOur medication information extraction approach for primary care visit conversations showed promising results, extracting about 27% more medication mentions from our evaluation set while eliminating many false positives in comparison to existing baseline systems. We made our approach publicly available on the web as an open-source software.ConclusionIntegration of our annotation system with clinical recording applications has the potential to improve patients’ understanding and recall of key information from their clinic visits, and, in turn, to positively impact health outcomes.
- Published
- 2021