Descriptor: "Electronic Health Records classification" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Electronic Health Records classification"' showing total 202 results

Start Over Descriptor "Electronic Health Records classification"

202 results on '"Electronic Health Records classification"'

1. Building a Natural Language Interface for FHIR Clinical Terminology Server.

Author: Ngo H
Subjects: User-Computer Interface, Terminology as Topic, Health Information Interoperability, Vocabulary, Controlled, Information Storage and Retrieval methods, Humans, Electronic Health Records classification, Semantics, Artificial Intelligence, Natural Language Processing
Abstract: While Fast Healthcare Interoperability Resources (FHIR) clinical terminology server enables quick and easy search and retrieval of coded medical data, it still has some drawbacks. When searching, any typographical errors, variations in word forms, or deviations in word sequence might lead to incorrect search outcomes. For retrieval, queries to the server must strictly follow the FHIR application programming interface format, which requires users to know the syntax and remember the attribute codes they wish to retrieve. To improve its functionalities, a natural language interface was built, that harnesses the capabilities of two preeminent large language models, along with other cutting-edge technologies such as speech-to-text conversion, vector semantic searching, and conversational artificial intelligence. Preliminary evaluation shows promising results in building a natural language interface for the FHIR clinical terminology system.
Published: 2024
Full Text: View/download PDF

2. What Kind of Transformer Models to Use for the ICD-10 Codes Classification Task.

Author: Mansour M, Yilmaz F, Miletic M, and Sariyar M
Subjects: Humans, Clinical Coding, International Classification of Diseases, Natural Language Processing, Electronic Health Records classification
Abstract: Coding according to the International Classification of Diseases (ICD)-10 and its clinical modifications (CM) is inherently complex and expensive. Natural Language Processing (NLP) assists by simplifying the analysis of unstructured data from electronic health records, thereby facilitating diagnosis coding. This study investigates the suitability of transformer models for ICD-10 classification, considering both encoder and encoder-decoder architectures. The analysis is performed on clinical discharge summaries from the Medical Information Mart for Intensive Care (MIMIC)-IV dataset, which contains an extensive collection of electronic health records. Pre-trained models such as BioBERT, ClinicalBERT, ClinicalLongformer, and ClinicalBigBird are adapted for the coding task, incorporating specific preprocessing techniques to enhance performance. The findings indicate that increasing context length improves accuracy, and that the difference in accuracy between encoder and encoder-decoder models is negligible.
Published: 2024
Full Text: View/download PDF

3. Exploring Explainable AI Techniques for Text Classification in Healthcare: A Scoping Review.

Author: Madi IAE, Redjdal A, Bouaud J, and Seroussi B
Subjects: Natural Language Processing, Humans, Machine Learning, Electronic Health Records classification, Deep Learning, Artificial Intelligence
Abstract: Text classification plays an essential role in the medical domain by organizing and categorizing vast amounts of textual data through machine learning (ML) and deep learning (DL). The adoption of Artificial Intelligence (AI) technologies in healthcare has raised concerns about the interpretability of AI models, often perceived as "black boxes." Explainable AI (XAI) techniques aim to mitigate this issue by elucidating AI model decision-making process. In this paper, we present a scoping review exploring the application of different XAI techniques in medical text classification, identifying two main types: model-specific and model-agnostic methods. Despite some positive feedback from developers, formal evaluations with medical end users of these techniques remain limited. The review highlights the necessity for further research in XAI to enhance trust and transparency in AI-driven decision-making processes in healthcare.
Published: 2024
Full Text: View/download PDF

4. Term Candidate Generation to Enrich Clinical Terminologies with Large Language Models.

Author: Kugic A, Schulz S, and Kreuzthaler M
Subjects: Humans, Terminology as Topic, Electronic Health Records classification, Germany, International Classification of Diseases, Vocabulary, Controlled, Natural Language Processing
Abstract: Annotated language resources derived from clinical routine documentation form an intriguing asset for secondary use case scenarios. In this investigation, we report on how such a resource can be leveraged to identify additional term candidates for a chosen set of ICD-10 codes. We conducted a log-likelihood analysis, considering the co-occurrence of approximately 1.9 million de-identified ICD-10 codes alongside corresponding brief textual entries from problem lists in German. This analysis aimed to identify potential candidates with statistical significance set at p < 0.01, which were used as seed terms to harvest additional candidates by interfacing to a large language model in a second step. The proposed approach can identify additional term candidates at suitable performance values: hypernyms MAP@5=0.801, synonyms MAP@5 = 0.723 and hyponyms MAP@5 = 0.507. The re-use of existing annotated clinical datasets, in combination with large language models, presents an interesting strategy to bridge the lexical gap in standardized clinical terminologies and real-world jargon.
Published: 2024
Full Text: View/download PDF

5. Challenges of Integrating ICD 11 into Automatic Alerting Systems.

Author: Tegegne MD, Krips M, Ibrahim I, and Deserno TM
Subjects: Humans, Systems Integration, International Classification of Diseases, Electronic Health Records classification
Abstract: Automatic alerting systems (AASs) can identify adverse health events but emergency communication relies on human operators and natural languages. For complete automation, we need to code the diversity of adverse events in a granularity that supports optimal dispatches. Hence, AAs shall integrate with the International Classification of Diseases (ICD). The ICD-11 coding system includes chapters for external causes of injury. However, ICD-11 supports coding injury incidents in electronic health records (EHRs) after they have occurred, while disregarding integrating real-time injury reporting within its framework. We explore the potential challenges associated with integrating ICD-11 into AAS by analyzing external causes of morbidity or mortality and the dimensions of external causes as potential areas of integration. We recognize the themes: (i) incident of injury, (ii) mode of transport, (iii) indoor location, (iv) outdoor location, and (v) type of building, and identify four challenges: (i) conceptual differences between the two systems, (ii) injury identification, (iii) presence of entities below the shoreline in ICD-11, and (iv) lack of specificity in certain ICD-11 codes related to AASs. For easy integration of ICD-11 into AASs, we recommend an AAS data dictionary and propose ICD-11 updates related to external causes of injury.
Published: 2024
Full Text: View/download PDF

6. Classifiers of Data Sharing Statements in Clinical Trial Records.

Author: Jelodari Mamaghani S, Strantz C, and Toddenroth D
Subjects: Humans, Natural Language Processing, Electronic Health Records classification, Clinical Trials as Topic, Information Dissemination
Abstract: Digital individual participant data (IPD) from clinical trials are increasingly distributed for potential scientific reuse. The identification of available IPD, however, requires interpretations of textual data-sharing statements (DSS) in large databases. Recent advancements in computational linguistics include pre-trained language models that promise to simplify the implementation of effective classifiers based on textual inputs. In a subset of 5,000 textual DSS from ClinicalTrials.gov, we evaluate how well classifiers based on domain-specific pre-trained language models reproduce original availability categories as well as manually annotated labels. Typical metrics indicate that classifiers that predicted manual annotations outperformed those that learned to output the original availability categories. This suggests that the textual DSS descriptions contain applicable information that the availability categories do not, and that such classifiers could thus aid the automatic identification of available IPD in large trial databases.
Published: 2024
Full Text: View/download PDF

7. Fairness in Classifying and Grouping Health Equity Information.

Author: Jin R, Li X, Block LJ, Beschastnikh I, Currie LM, and Ronquillo CE
Subjects: Humans, Social Determinants of Health, Health Equity, Electronic Health Records classification, Machine Learning
Abstract: This paper explores the balance between fairness and performance in machine learning classification, predicting the likelihood of a patient receiving anti-microbial treatment using structured data in community nursing wound care electronic health records. The data includes two important predictors (gender and language) of the social determinants of health, which we used to evaluate the fairness of the classifiers. At the same time, the impact of various groupings of language codes on classifiers' performance and fairness is analyzed. Most common statistical learning-based classifiers are evaluated. The findings indicate that while K-Nearest Neighbors offers the best fairness metrics among different grouping settings, the performance of all classifiers is generally consistent across different language code groupings. Also, grouping more variables tends to improve the fairness metrics over all classifiers while maintaining their performance.
Published: 2024
Full Text: View/download PDF

8. Predicting Future Disorders via Temporal Knowledge Graphs and Medical Ontologies.

Author: Postiglione M, Bean D, Kraljevic Z, Dobson RJ, and Moscato V
Subjects: Humans, Algorithms, Electronic Health Records classification, Biological Ontologies
Abstract: Despite the vast potential for insights and value present in Electronic Health Records (EHRs), it is challenging to fully leverage all the available information, particularly that contained in the free-text data written by clinicians describing the health status of patients. The utilization of Named Entity Recognition and Linking tools allows not only for the structuring of information contained within free-text data, but also for the integration with medical ontologies, which may prove highly beneficial for the analysis of patient medical histories with the aim of forecasting future medical outcomes, such as the diagnosis of a new disorder. In this paper, we propose MedTKG, a Temporal Knowledge Graph (TKG) framework that incorporates both the dynamic information of patient clinical histories and the static information of medical ontologies. The TKG is used to model a medical history as a series of snapshots at different points in time, effectively capturing the dynamic nature of the patient's health status, while a static graph is used to model the hierarchies of concepts extracted from domain ontologies. The proposed method aims to predict future disorders by identifying missing objects in the quadruple 〈s, r, ?, t 〉, where s and r denote the patient and the disorder relation type, respectively, and t is the timestamp of the query. The method is evaluated on clinical notes extracted from MIMIC-III and demonstrates the effectiveness of the TKG framework in predicting future disorders and of medical ontologies in improving its performance.
Published: 2024
Full Text: View/download PDF

9. A taxonomy for advancing systematic error analysis in multi-site electronic health record-based clinical concept extraction.

Author: Fu S, Wang L, He H, Wen A, Zong N, Kumari A, Liu F, Zhou S, Zhang R, Li C, Wang Y, St Sauver J, Liu H, and Sohn S
Subjects: Humans, Classification methods, Medical Errors classification, Electronic Health Records classification, Natural Language Processing
Abstract: Background: Error analysis plays a crucial role in clinical concept extraction, a fundamental subtask within clinical natural language processing (NLP). The process typically involves a manual review of error types, such as contextual and linguistic factors contributing to their occurrence, and the identification of underlying causes to refine the NLP model and improve its performance. Conducting error analysis can be complex, requiring a combination of NLP expertise and domain-specific knowledge. Due to the high heterogeneity of electronic health record (EHR) settings across different institutions, challenges may arise when attempting to standardize and reproduce the error analysis process., Objectives: This study aims to facilitate a collaborative effort to establish common definitions and taxonomies for capturing diverse error types, fostering community consensus on error analysis for clinical concept extraction tasks., Materials and Methods: We iteratively developed and evaluated an error taxonomy based on existing literature, standards, real-world data, multisite case evaluations, and community feedback. The finalized taxonomy was released in both .dtd and .owl formats at the Open Health Natural Language Processing Consortium. The taxonomy is compatible with several different open-source annotation tools, including MAE, Brat, and MedTator., Results: The resulting error taxonomy comprises 43 distinct error classes, organized into 6 error dimensions and 4 properties, including model type (symbolic and statistical machine learning), evaluation subject (model and human), evaluation level (patient, document, sentence, and concept), and annotation examples. Internal and external evaluations revealed strong variations in error types across methodological approaches, tasks, and EHR settings. Key points emerged from community feedback, including the need to enhancing clarity, generalizability, and usability of the taxonomy, along with dissemination strategies., Conclusion: The proposed taxonomy can facilitate the acceleration and standardization of the error analysis process in multi-site settings, thus improving the provenance, interpretability, and portability of NLP models. Future researchers could explore the potential direction of developing automated or semi-automated methods to assist in the classification and standardization of error analysis., (© The Author(s) 2024. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For permissions, please email: journals.permissions@oup.com.)
Published: 2024
Full Text: View/download PDF

10. A Taxonomy for Health Information Systems.

Author: Janssen A, Donnelly C, and Shaw T
Subjects: Humans, Electronic Health Records classification, Health Information Systems
Abstract: The health sector is highly digitized, which is enabling the collection of vast quantities of electronic data about health and well-being. These data are collected by a diverse array of information and communication technologies, including systems used by health care organizations, consumer and community sources such as information collected on the web, and passively collected data from technologies such as wearables and devices. Understanding the breadth of IT that collect these data and how it can be actioned is a challenge for the significant portion of the digital health workforce that interact with health data as part of their duties but are not for informatics experts. This viewpoint aims to present a taxonomy categorizing common information and communication technologies that collect electronic data. An initial classification of key information systems collecting electronic health data was undertaken via a rapid review of the literature. Subsequently, a purposeful search of the scholarly and gray literature was undertaken to extract key information about the systems within each category to generate definitions of the systems and describe the strengths and limitations of these systems., (©Anna Janssen, Candice Donnelly, Tim Shaw. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 31.05.2024.)
Published: 2024
Full Text: View/download PDF

11. Selection of Clinical Text Features for Classifying Suicide Attempts.

Author: Buckland RS, Hogan JW, and Chen ES
Subjects: Cohort Studies, Female, Humans, International Classification of Diseases, Machine Learning, Male, Phenotype, Prevalence, ROC Curve, Algorithms, Data Mining methods, Electronic Health Records classification, Natural Language Processing, Suicide, Attempted classification
Abstract: Research has demonstrated cohort misclassification when studies of suicidal thoughts and behaviors (STBs) rely on ICD-9/10-CM diagnosis codes. Electronic health record (EHR) data are being explored to better identify patients, a process called EHR phenotyping. Most STB phenotyping studies have used structured EHR data, but some are beginning to incorporate unstructured clinical text. In this study, we used a publicly-accessible natural language processing (NLP) program for biomedical text (MetaMap) and iterative elastic net regression to extract and select predictive text features from the discharge summaries of 810 inpatient admissions of interest. Initial sets of 5,866 and 2,709 text features were reduced to 18 and 11, respectively. The two models fit with these features obtained an area under the receiver operating characteristic curve of 0.866-0.895 and an area under the precision-recall curve of 0.800-0.838, demonstrating the approach's potential to identify textual features to incorporate in phenotyping models., (©2020 AMIA - All rights reserved.)
Published: 2021

12. Classification of drug use patterns.

Author: Righolt CH, Zhang G, and Mahmud SM
Subjects: Adult, Aged, Algorithms, Dose-Response Relationship, Drug, Female, Follow-Up Studies, Humans, Male, Middle Aged, Ontario epidemiology, Universal Health Care, Databases, Factual classification, Diabetes Mellitus drug therapy, Diabetes Mellitus epidemiology, Electronic Health Records classification, Hypoglycemic Agents therapeutic use, Metformin therapeutic use
Abstract: Characterizing long-term prescription data is challenging due to the time-varying nature of drug use. Conventional approaches summarize time-varying data into categorical variables based on simple measures, such as cumulative dose, while ignoring patterns of use. The loss of information can lead to misclassification and biased estimates of the exposure-outcome association. We introduce a classification method to characterize longitudinal prescription data with an unsupervised machine learning algorithm. We used administrative databases covering virtually all 1.3 million residents of Manitoba and explicitly designed features to describe the average dose, proportion of days covered (PDC), dose change, and dose variability, and clustered the resulting feature space using K-means clustering. We applied this method to metformin use in diabetes patients. We identified 27,786 metformin users and showed that the feature distributions of their metformin use are stable for varying the lengths of follow-up and that these distributions have clear interpretations. We found six distinct metformin user groups: patients with intermittent use, decreasing dose, increasing dose, high dose, and two medium dose groups (one with stable dose and one with highly variable use). Patients in the varying and decreasing dose groups had a higher chance of progression of diabetes than other patients. The method presented in this paper allows for characterization of drug use into distinct and clinically relevant groups in a way that cannot be obtained from merely classifying use by quantiles of overall use., (© 2020 The Authors. Pharmacology Research & Perspectives published by British Pharmacological Society and American Society for Pharmacology and Experimental Therapeutics and John Wiley & Sons Ltd.)
Published: 2020
Full Text: View/download PDF

13. The Inadequacy of Coding Nomenclature to Represent the Timeline of a Disease (Like Diabetes).

Author: Millares Martin P
Subjects: Diabetes Mellitus diagnosis, Diabetes Mellitus therapy, Disease Progression, Humans, Prognosis, Time Factors, Diabetes Mellitus classification, Electronic Health Records classification, International Classification of Diseases, Terminology as Topic
Published: 2020
Full Text: View/download PDF

14. sureLDA: A multidisease automated phenotyping method for the electronic health record.

Author: Ahuja Y, Zhou D, He Z, Sun J, Castro VM, Gainer V, Murphy SN, Hong C, and Cai T
Subjects: Humans, Precision Medicine, ROC Curve, Translational Research, Biomedical, Algorithms, Electronic Health Records classification, Natural Language Processing
Abstract: Objective: A major bottleneck hindering utilization of electronic health record data for translational research is the lack of precise phenotype labels. Chart review as well as rule-based and supervised phenotyping approaches require laborious expert input, hampering applicability to studies that require many phenotypes to be defined and labeled de novo. Though International Classification of Diseases codes are often used as surrogates for true labels in this setting, these sometimes suffer from poor specificity. We propose a fully automated topic modeling algorithm to simultaneously annotate multiple phenotypes., Materials and Methods: Surrogate-guided ensemble latent Dirichlet allocation (sureLDA) is a label-free multidimensional phenotyping method. It first uses the PheNorm algorithm to initialize probabilities based on 2 surrogate features for each target phenotype, and then leverages these probabilities to constrain the LDA topic model to generate phenotype-specific topics. Finally, it combines phenotype-feature counts with surrogates via clustering ensemble to yield final phenotype probabilities., Results: sureLDA achieves reliably high accuracy and precision across a range of simulated and real-world phenotypes. Its performance is robust to phenotype prevalence and relative informativeness of surogate vs nonsurrogate features. It also exhibits powerful feature selection properties., Discussion: sureLDA combines attractive properties of PheNorm and LDA to achieve high accuracy and precision robust to diverse phenotype characteristics. It offers particular improvement for phenotypes insufficiently captured by a few surrogate features. Moreover, sureLDA's feature selection ability enables it to handle high feature dimensions and produce interpretable computational phenotypes., Conclusions: sureLDA is well suited toward large-scale electronic health record phenotyping for highly multiphenotype applications such as phenome-wide association studies ., (© The Author(s) 2020. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For permissions, please email: journals.permissions@oup.com.)
Published: 2020
Full Text: View/download PDF

15. Development and validation of phenotype classifiers across multiple sites in the observational health data sciences and informatics network.

Author: Kashyap M, Seneviratne M, Banda JM, Falconer T, Ryu B, Yoo S, Hripcsak G, and Shah NH
Subjects: Classification methods, Data Science, Humans, Observational Studies as Topic, Electronic Health Records classification, Medical Informatics, Supervised Machine Learning
Abstract: Objective: Accurate electronic phenotyping is essential to support collaborative observational research. Supervised machine learning methods can be used to train phenotype classifiers in a high-throughput manner using imperfectly labeled data. We developed 10 phenotype classifiers using this approach and evaluated performance across multiple sites within the Observational Health Data Sciences and Informatics (OHDSI) network., Materials and Methods: We constructed classifiers using the Automated PHenotype Routine for Observational Definition, Identification, Training and Evaluation (APHRODITE) R-package, an open-source framework for learning phenotype classifiers using datasets in the Observational Medical Outcomes Partnership Common Data Model. We labeled training data based on the presence of multiple mentions of disease-specific codes. Performance was evaluated on cohorts derived using rule-based definitions and real-world disease prevalence. Classifiers were developed and evaluated across 3 medical centers, including 1 international site., Results: Compared to the multiple mentions labeling heuristic, classifiers showed a mean recall boost of 0.43 with a mean precision loss of 0.17. Performance decreased slightly when classifiers were shared across medical centers, with mean recall and precision decreasing by 0.08 and 0.01, respectively, at a site within the USA, and by 0.18 and 0.10, respectively, at an international site., Discussion and Conclusion: We demonstrate a high-throughput pipeline for constructing and sharing phenotype classifiers across sites within the OHDSI network using APHRODITE. Classifiers exhibit good portability between sites within the USA, however limited portability internationally, indicating that classifier generalizability may have geographic limitations, and, consequently, sharing the classifier-building recipe, rather than the pretrained classifiers, may be more useful for facilitating collaborative observational research., (© The Author(s) 2020. Published by Oxford University Press on behalf of the American Medical Informatics Association.)
Published: 2020
Full Text: View/download PDF

16. Inferring multimodal latent topics from electronic health records.

Author: Li Y, Nair P, Lu XH, Wen Z, Wang Y, Dehaghi AAK, Miao Y, Liu W, Ordog T, Biernacka JM, Ryu E, Olson JE, Frye MA, Liu A, Guo L, Marelli A, Ahuja Y, Davila-Velderrain J, and Kellis M
Subjects: Bayes Theorem, Databases, Factual, Electronic Health Records statistics & numerical data, Humans, Machine Learning, Models, Statistical, Phenotype, Electronic Health Records classification, Medical Informatics methods
Abstract: Electronic health records (EHR) are rich heterogeneous collections of patient health information, whose broad adoption provides clinicians and researchers unprecedented opportunities for health informatics, disease-risk prediction, actionable clinical recommendations, and precision medicine. However, EHRs present several modeling challenges, including highly sparse data matrices, noisy irregular clinical notes, arbitrary biases in billing code assignment, diagnosis-driven lab tests, and heterogeneous data types. To address these challenges, we present MixEHR, a multi-view Bayesian topic model. We demonstrate MixEHR on MIMIC-III, Mayo Clinic Bipolar Disorder, and Quebec Congenital Heart Disease EHR datasets. Qualitatively, MixEHR disease topics reveal meaningful combinations of clinical features across heterogeneous data types. Quantitatively, we observe superior prediction accuracy of diagnostic codes and lab test imputations compared to the state-of-art methods. We leverage the inferred patient topic mixtures to classify target diseases and predict mortality of patients in critical conditions. In all comparison, MixEHR confers competitive performance and reveals meaningful disease-related topics.
Published: 2020
Full Text: View/download PDF

17. Using case-level context to classify cancer pathology reports.

Author: Gao S, Alawad M, Schaefferkoetter N, Penberthy L, Wu XC, Durbin EB, Coyle L, Ramanathan A, and Tourassi G
Subjects: Histological Techniques, Humans, Natural Language Processing, SEER Program, Electronic Health Records classification, Neoplasms pathology
Abstract: Individual electronic health records (EHRs) and clinical reports are often part of a larger sequence-for example, a single patient may generate multiple reports over the trajectory of a disease. In applications such as cancer pathology reports, it is necessary not only to extract information from individual reports, but also to capture aggregate information regarding the entire cancer case based off case-level context from all reports in the sequence. In this paper, we introduce a simple modular add-on for capturing case-level context that is designed to be compatible with most existing deep learning architectures for text classification on individual reports. We test our approach on a corpus of 431,433 cancer pathology reports, and we show that incorporating case-level context significantly boosts classification accuracy across six classification tasks-site, subsite, laterality, histology, behavior, and grade. We expect that with minimal modifications, our add-on can be applied towards a wide range of other clinical text-based tasks., Competing Interests: Author LC is employed by the commercial company Information Management Services Inc (IMS). This does not alter our adherence to PLOS ONE policies on sharing data and materials.
Published: 2020
Full Text: View/download PDF

18. Boosting ICD multi-label classification of health records with contextual embeddings and label-granularity.

Author: Blanco A, Perez-de-Viñaspre O, Pérez A, and Casillas A
Subjects: Algorithms, Computer Graphics, Data Mining, Humans, International Classification of Diseases, Natural Language Processing, Neural Networks, Computer, Software, Spain, Deep Learning, Electronic Health Records classification, Medical Informatics, Pattern Recognition, Automated
Abstract: Background and Objective: This work deals with clinical text mining, a field of Natural Language Processing applied to biomedical informatics. The aim is to classify Electronic Health Records with respect to the International Classification of Diseases, which is the foundation for the identification of international health statistics, and the standard for reporting diseases and health conditions. Within the framework of data mining, the goal is the multi-label classification, as each health record has assigned multiple International Classification of Diseases codes. We investigate five Deep Learning architectures with a dataset obtained from the Basque Country Health System, and six different perspectives derived from shifts in the input and the output., Methods: We evaluate a Feed Forward Neural Network as the baseline and several Recurrent models based on the Bidirectional GRU architecture, putting our research focus on the text representation layer and testing three variants, from standard word embeddings to meta word embeddings techniques and contextual embeddings., Results: The results showed that the recurrent models overcome the non-recurrent model. The meta word embeddings techniques are capable of beating the standard word embeddings, but the contextual embeddings exhibit as the most robust for the downstream task overall. Additionally, the label-granularity alone has an impact on the classification performance., Conclusions: The contributions of this work are a) a comparison among five classification approaches based on Deep Learning on a Spanish dataset to cope with the multi-label health text classification problem; b) the study of the impact of document length and label-set size and granularity in the multi-label context; and c) the study of measures to mitigate multi-label text classification problems related to label-set size and sparseness., Competing Interests: Declaration of Competing Interest The authors declare that there is no conflict of interest., (Copyright © 2019. Published by Elsevier B.V.)
Published: 2020
Full Text: View/download PDF

19. Classification of Current Procedural Terminology Codes from Electronic Health Record Data Using Machine Learning.

Author: Burns ML, Mathis MR, Vandervest J, Tan X, Lu B, Colquhoun DA, Shah N, Kheterpal S, and Saager L
Subjects: Adolescent, Adult, Child, Child, Preschool, Female, Humans, Male, Middle Aged, Young Adult, Current Procedural Terminology, Databases, Factual classification, Electronic Health Records classification, Machine Learning classification, Neural Networks, Computer
Abstract: Background: Accurate anesthesiology procedure code data are essential to quality improvement, research, and reimbursement tasks within anesthesiology practices. Advanced data science techniques, including machine learning and natural language processing, offer opportunities to develop classification tools for Current Procedural Terminology codes across anesthesia procedures., Methods: Models were created using a Train/Test dataset including 1,164,343 procedures from 16 academic and private hospitals. Five supervised machine learning models were created to classify anesthesiology Current Procedural Terminology codes, with accuracy defined as first choice classification matching the institutional-assigned code existing in the perioperative database. The two best performing models were further refined and tested on a Holdout dataset from a single institution distinct from Train/Test. A tunable confidence parameter was created to identify cases for which models were highly accurate, with the goal of at least 95% accuracy, above the reported 2018 Centers for Medicare and Medicaid Services (Baltimore, Maryland) fee-for-service accuracy. Actual submitted claim data from billing specialists were used as a reference standard., Results: Support vector machine and neural network label-embedding attentive models were the best performing models, respectively, demonstrating overall accuracies of 87.9% and 84.2% (single best code), and 96.8% and 94.0% (within top three). Classification accuracy was 96.4% in 47.0% of cases using support vector machine and 94.4% in 62.2% of cases using label-embedding attentive model within the Train/Test dataset. In the Holdout dataset, respective classification accuracies were 93.1% in 58.0% of cases and 95.0% among 62.0%. The most important feature in model training was procedure text., Conclusions: Through application of machine learning and natural language processing techniques, highly accurate real-time models were created for anesthesiology Current Procedural Terminology code classification. The increased processing speed and a priori targeted accuracy of this classification approach may provide performance optimization and cost reduction for quality improvement, research, and reimbursement tasks reliant on anesthesiology procedure codes.
Published: 2020
Full Text: View/download PDF

20. A Random Forest-Assisted Evolutionary Algorithm for Data-Driven Constrained Multiobjective Combinatorial Optimization of Trauma Systems.

Author: Wang H and Jin Y
Subjects: Electronic Health Records classification, Humans, Wounds and Injuries classification, Algorithms, Decision Trees, Machine Learning
Abstract: Many real-world optimization problems can be solved by using the data-driven approach only, simply because no analytic objective functions are available for evaluating candidate solutions. In this paper, we address a class of expensive data-driven constrained multiobjective combinatorial optimization problems, where the objectives and constraints can be calculated only on the basis of a large amount of data. To solve this class of problems, we propose using random forests (RFs) and radial basis function networks as surrogates to approximate both objective and constraint functions. In addition, logistic regression models are introduced to rectify the surrogate-assisted fitness evaluations and a stochastic ranking selection is adopted to further reduce the influences of the approximated constraint functions. Three variants of the proposed algorithm are empirically evaluated on multiobjective knapsack benchmark problems and two real-world trauma system design problems. Experimental results demonstrate that the variant using RF models as the surrogates is effective and efficient in solving data-driven constrained multiobjective combinatorial optimization problems.
Published: 2020
Full Text: View/download PDF

21. An augmented estimation procedure for EHR-based association studies accounting for differential misclassification.

Author: Tong J, Huang J, Chubak J, Wang X, Moore JH, Hubbard RA, and Chen Y
Subjects: Bias, Data Warehousing, Humans, Algorithms, Electronic Health Records classification
Abstract: Objectives: The ability to identify novel risk factors for health outcomes is a key strength of electronic health record (EHR)-based research. However, the validity of such studies is limited by error in EHR-derived phenotypes. The objective of this study was to develop a novel procedure for reducing bias in estimated associations between risk factors and phenotypes in EHR data., Materials and Methods: The proposed method combines the strengths of a gold-standard phenotype obtained through manual chart review for a small validation set of patients and an automatically-derived phenotype that is available for all patients but is potentially error-prone (hereafter referred to as the algorithm-derived phenotype). An augmented estimator of associations is obtained by optimally combining these 2 phenotypes. We conducted simulation studies to evaluate the performance of the augmented estimator and conducted an analysis of risk factors for second breast cancer events using data on a cohort from Kaiser Permanente Washington., Results: The proposed method was shown to reduce bias relative to an estimator using only the algorithm-derived phenotype and reduce variance compared to an estimator using only the validation data., Discussion: Our simulation studies and real data application demonstrate that, compared to the estimator using validation data only, the augmented estimator has lower variance (ie, higher statistical efficiency). Compared to the estimator using error-prone EHR-derived phenotypes, the augmented estimator has smaller bias., Conclusions: The proposed estimator can effectively combine an error-prone phenotype with gold-standard data from a limited chart review in order to improve analyses of risk factors using EHR data., (© The Author(s) 2019. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For permissions, please email: journals.permissions@oup.com.)
Published: 2020
Full Text: View/download PDF

22. Named entity recognition in electronic health records using transfer learning bootstrapped Neural Networks.

Author: Gligic L, Kormilitzin A, Goldberg P, and Nevado-Holgado A
Subjects: Data Collection classification, Data Collection methods, Humans, Electronic Health Records classification, Machine Learning classification, Natural Language Processing, Neural Networks, Computer
Abstract: Neural networks (NNs) have become the state of the art in many machine learning applications, such as image, sound (LeCun et al., 2015) and natural language processing (Young et al., 2017; Linggard et al., 2012). However, the success of NNs remains dependent on the availability of large labelled datasets, such as in the case of electronic health records (EHRs). With scarce data, NNs are unlikely to be able to extract this hidden information with practical accuracy. In this study, we develop an approach that solves these problems for named entity recognition, obtaining 94.6 F1 score in I2B2 2009 Medical Extraction Challenge (Uzuner et al., 2010), 4.3 above the architecture that won the competition. To achieve this, we bootstrap our NN models through transfer learning by pretraining word embeddings on a secondary task performed on a large pool of unannotated EHRs and using the output embeddings as a foundation of a range of NN architectures. Beyond the official I2B2 challenge, we further achieve 82.4 F1 on extracting relationships between medical terms using attention-based seq2seq models bootstrapped in the same manner., (Crown Copyright © 2019. Published by Elsevier Ltd. All rights reserved.)
Published: 2020
Full Text: View/download PDF

23. A maximum likelihood approach to electronic health record phenotyping using positive and unlabeled patients.

Author: Zhang L, Ding X, Ma Y, Muthu N, Ajmal I, Moore JH, Herman DS, and Chen J
Subjects: Humans, Monte Carlo Method, Algorithms, Electronic Health Records classification, Likelihood Functions
Abstract: Objective: Phenotyping patients using electronic health record (EHR) data conventionally requires labeled cases and controls. Assigning labels requires manual medical chart review and therefore is labor intensive. For some phenotypes, identifying gold-standard controls is prohibitive. We developed an accurate EHR phenotyping approach that does not require labeled controls., Materials and Methods: Our framework relies on a random subset of cases, which can be specified using an anchor variable that has excellent positive predictive value and sensitivity independent of predictors. We proposed a maximum likelihood approach that efficiently leverages data from the specified cases and unlabeled patients to develop logistic regression phenotyping models, and compare model performance with existing algorithms., Results: Our method outperformed the existing algorithms on predictive accuracy in Monte Carlo simulation studies, application to identify hypertension patients with hypokalemia requiring oral supplementation using a simulated anchor, and application to identify primary aldosteronism patients using real-world cases and anchor variables. Our method additionally generated consistent estimates of 2 important parameters, phenotype prevalence and the proportion of true cases that are labeled., Discussion: Upon identification of an anchor variable that is scalable and transferable to different practices, our approach should facilitate development of scalable, transferable, and practice-specific phenotyping models., Conclusions: Our proposed approach enables accurate semiautomated EHR phenotyping with minimal manual labeling and therefore should greatly facilitate EHR clinical decision support and research., (© The Author(s) 2019. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For permissions, please email: journals.permissions@oup.com.)
Published: 2020
Full Text: View/download PDF

24. Evaluating global and local sequence alignment methods for comparing patient medical records.

Author: Huang M, Shah ND, and Yao L
Subjects: Algorithms, Diagnosis, Electronic Health Records classification, Humans, Prognosis, Therapeutics, Cross-Cultural Comparison, Electronic Health Records statistics & numerical data, Sequence Alignment
Abstract: Background: Sequence alignment is a way of arranging sequences (e.g., DNA, RNA, protein, natural language, financial data, or medical events) to identify the relatedness between two or more sequences and regions of similarity. For Electronic Health Records (EHR) data, sequence alignment helps to identify patients of similar disease trajectory for more relevant and precise prognosis, diagnosis and treatment of patients., Methods: We tested two cutting-edge global sequence alignment methods, namely dynamic time warping (DTW) and Needleman-Wunsch algorithm (NWA), together with their local modifications, DTW for Local alignment (DTWL) and Smith-Waterman algorithm (SWA), for aligning patient medical records. We also used 4 sets of synthetic patient medical records generated from a large real-world EHR database as gold standard data, to objectively evaluate these sequence alignment algorithms., Results: For global sequence alignments, 47 out of 80 DTW alignments and 11 out of 80 NWA alignments had superior similarity scores than reference alignments while the rest 33 DTW alignments and 69 NWA alignments had the same similarity scores as reference alignments. Forty-six out of 80 DTW alignments had better similarity scores than NWA alignments with the rest 34 cases having the equal similarity scores from both algorithms. For local sequence alignments, 70 out of 80 DTWL alignments and 68 out of 80 SWA alignments had larger coverage and higher similarity scores than reference alignments while the rest DTWL alignments and SWA alignments received the same coverage and similarity scores as reference alignments. Six out of 80 DTWL alignments showed larger coverage and higher similarity scores than SWA alignments. Thirty DTWL alignments had the equal coverage but better similarity scores than SWA. DTWL and SWA received the equal coverage and similarity scores for the rest 44 cases., Conclusions: DTW, NWA, DTWL and SWA outperformed the reference alignments. DTW (or DTWL) seems to align better than NWA (or SWA) by inserting new daily events and identifying more similarities between patient medical records. The evaluation results could provide valuable information on the strengths and weakness of these sequence alignment methods for future development of sequence alignment methods and patient similarity-based studies.
Published: 2019
Full Text: View/download PDF

25. Two algorithms for the reorganisation of the problem list by organ system.

Author: Hier DB and Pearson J
Subjects: Humans, International Classification of Diseases, Primary Health Care, Systematized Nomenclature of Medicine, Algorithms, Electronic Health Records classification, Electronic Health Records standards, Health Information Management
Abstract: Objective: Long problem lists can be challenging to use. Reorganisation of the problem list by organ system is a strategy for making long problem lists more manageable., Methods: In a small-town primary care setting, we examined 4950 unique problem lists over 5 years (24 033 total problems and 2170 unique problems) from our electronic health record. All problems were mapped to the International Classification of Diseases, 10th Revision, Clinical Modification (ICD-10-CM) and SNOMED CT codes. We developed two different algorithms for reorganising the problem list by organ system based on either the ICD-10-CM or the SNOMED CT code., Results: The mean problem list length was 4.9±4.6 problems. The two reorganisation algorithms allocated problems to one of 15 different categories (12 aligning with organ systems). 26.2% of problems were assigned to a more general category of 'signs and symptoms' that did not correspond to a single organ system. The two algorithms were concordant in allocation by organ system for 90% of the unique problems. Since ICD-10-CM is a monohierarchic classification system, problems coded by ICD-10-CM were assigned to a single category. Since SNOMED CT is a polyhierarchical ontology, 19.4% of problems coded by SNOMED CT were assigned to multiple categories., Conclusion: Reorganisation of the problem list by organ system is feasible using algorithms based on either ICD-10-CM or SNOMED CT codes, and the two algorithms are highly concordant., Competing Interests: Competing interests: None declared., (© Author(s) (or their employer(s)) 2019. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ.)
Published: 2019
Full Text: View/download PDF

26. Developing a FHIR-based EHR phenotyping framework: A case study for identification of patients with obesity and multiple comorbidities from discharge summaries.

Author: Hong N, Wen A, Stone DJ, Tsuji S, Kingsbury PR, Rasmussen LV, Pacheco JA, Adekkanattu P, Wang F, Luo Y, Pathak J, Liu H, and Jiang G
Subjects: Adult, Algorithms, Body Mass Index, Comorbidity, Female, Humans, Machine Learning, Male, Phenotype, Software, Electronic Health Records classification, Health Information Interoperability, Obesity epidemiology, Patient Discharge
Abstract: Background: Standards-based clinical data normalization has become a key component of effective data integration and accurate phenotyping for secondary use of electronic healthcare records (EHR) data. HL7 Fast Healthcare Interoperability Resources (FHIR) is an emerging clinical data standard for exchanging electronic healthcare data and has been used in modeling and integrating both structured and unstructured EHR data for a variety of clinical research applications. The overall objective of this study is to develop and evaluate a FHIR-based EHR phenotyping framework for identification of patients with obesity and its multiple comorbidities from semi-structured discharge summaries leveraging a FHIR-based clinical data normalization pipeline (known as NLP2FHIR)., Methods: We implemented a multi-class and multi-label classification system based on the i2b2 Obesity Challenge task to evaluate the FHIR-based EHR phenotyping framework. Two core parts of the framework are: (a) the conversion of discharge summaries into corresponding FHIR resources - Composition, Condition, MedicationStatement, Procedure and FamilyMemberHistory using the NLP2FHIR pipeline, and (b) the implementation of four machine learning algorithms (logistic regression, support vector machine, decision tree, and random forest) to train classifiers to predict disease state of obesity and 15 comorbidities using features extracted from standard FHIR resources and terminology expansions. We used the macro- and micro-averaged precision (P), recall (R), and F1 score (F1) measures to evaluate the classifier performance. We validated the framework using a second obesity dataset extracted from the MIMIC-III database., Results: Using the NLP2FHIR pipeline, 1237 clinical discharge summaries from the 2008 i2b2 obesity challenge dataset were represented as the instances of the FHIR Composition resource consisting of 5677 records with 16 unique section types. After the NLP processing and FHIR modeling, a set of 244,438 FHIR clinical resource instances were generated. As the results of the four machine learning classifiers, the random forest algorithm performed the best with F1-micro(0.9466)/F1-macro(0.7887) and F1-micro(0.9536)/F1-macro(0.6524) for intuitive classification (reflecting medical professionals' judgments) and textual classification (reflecting the judgments based on explicitly reported information of diseases), respectively. The MIMIC-III obesity dataset was successfully integrated for prediction with minimal configuration of the NLP2FHIR pipeline and machine learning models., Conclusions: The study demonstrated that the FHIR-based EHR phenotyping approach could effectively identify the state of obesity and multiple comorbidities using semi-structured discharge summaries. Our FHIR-based phenotyping approach is a first concrete step towards improving the data aspect of phenotyping portability across EHR systems and enhancing interpretability of the machine learning-based phenotyping algorithms., (Copyright © 2019 Elsevier Inc. All rights reserved.)
Published: 2019
Full Text: View/download PDF

27. Deep Sequential Models for Suicidal Ideation From Multiple Source Data.

Author: Peis I, Olmos PM, Vera-Varela C, Barrigon ML, Courtet P, Baca-Garcia E, and Artes-Rodriguez A
Subjects: Adult, Deep Learning, Ecological Momentary Assessment, Female, Humans, Male, Middle Aged, Models, Psychological, Electronic Health Records classification, Neural Networks, Computer, Suicidal Ideation, Suicide psychology, Suicide statistics & numerical data, Suicide Prevention
Abstract: This paper presents a novel method for predicting suicidal ideation from electronic health records (EHR) and ecological momentary assessment (EMA) data using deep sequential models. Both EHR longitudinal data and EMA question forms are defined by asynchronous, variable length, randomly sampled data sequences. In our method, we model each of them with a recurrent neural network, and both sequences are aligned by concatenating the hidden state of each of them using temporal marks. Furthermore, we incorporate attention schemes to improve performance in long sequences and time-independent pre-trained schemes to cope with very short sequences. Using a database of 1023 patients, our experimental results show that the addition of EMA records boosts the system recall to predict the suicidal ideation diagnosis from 48.13% obtained exclusively from EHR-based state-of-the-art methods to 67.78%. Additionally, our method provides interpretability through the t-distributed stochastic neighbor embedding (t-SNE) representation of the latent space. Furthermore, the most relevant input features are identified and interpreted medically.
Published: 2019
Full Text: View/download PDF

28. A two-stage deep learning approach for extracting entities and relationships from medical texts.

Author: Suárez-Paniagua V, Rivera Zavala RM, Segura-Bedmar I, and Martínez P
Subjects: Clinical Coding, Drug Interactions, Humans, Data Mining methods, Deep Learning, Electronic Health Records classification
Abstract: This work presents a two-stage deep learning system for Named Entity Recognition (NER) and Relation Extraction (RE) from medical texts. These tasks are a crucial step to many natural language understanding applications in the biomedical domain. Automatic medical coding of electronic medical records, automated summarizing of patient records, automatic cohort identification for clinical studies, text simplification of health documents for patients, early detection of adverse drug reactions or automatic identification of risk factors are only a few examples of the many possible opportunities that the text analysis can offer in the clinical domain. In this work, our efforts are primarily directed towards the improvement of the pharmacovigilance process by the automatic detection of drug-drug interactions (DDI) from texts. Moreover, we deal with the semantic analysis of texts containing health information for patients. Our two-stage approach is based on Deep Learning architectures. Concretely, NER is performed combining a bidirectional Long Short-Term Memory (Bi-LSTM) and a Conditional Random Field (CRF), while RE applies a Convolutional Neural Network (CNN). Since our approach uses very few language resources, only the pre-trained word embeddings, and does not exploit any domain resources (such as dictionaries or ontologies), this can be easily expandable to support other languages and clinical applications that require the exploitation of semantic information (concepts and relationships) from texts. During the last years, the task of DDI extraction has received great attention by the BioNLP community. However, the problem has been traditionally evaluated as two separate subtasks: drug name recognition and extraction of DDIs. To the best of our knowledge, this is the first work that provides an evaluation of the whole pipeline. Moreover, our system obtains state-of-the-art results on the eHealth-KD challenge, which was part of the Workshop on Semantic Analysis at SEPLN (TASS-2018)., (Copyright © 2019 Elsevier Inc. All rights reserved.)
Published: 2019
Full Text: View/download PDF

29. Making work visible for electronic phenotype implementation: Lessons learned from the eMERGE network.

Author: Shang N, Liu C, Rasmussen LV, Ta CN, Caroll RJ, Benoit B, Lingren T, Dikilitas O, Mentch FD, Carrell DS, Wei WQ, Luo Y, Gainer VS, Kullo IJ, Pacheco JA, Hakonarson H, Walunas TL, Denny JC, Wiley K, Murphy SN, Hripcsak G, and Weng C
Subjects: Algorithms, Genomics, Humans, Phenotype, Retrospective Studies, Electronic Health Records classification, Medical Informatics methods
Abstract: Background: Implementation of phenotype algorithms requires phenotype engineers to interpret human-readable algorithms and translate the description (text and flowcharts) into computable phenotypes - a process that can be labor intensive and error prone. To address the critical need for reducing the implementation efforts, it is important to develop portable algorithms., Methods: We conducted a retrospective analysis of phenotype algorithms developed in the Electronic Medical Records and Genomics (eMERGE) network and identified common customization tasks required for implementation. A novel scoring system was developed to quantify portability from three aspects: Knowledge conversion, clause Interpretation, and Programming (KIP). Tasks were grouped into twenty representative categories. Experienced phenotype engineers were asked to estimate the average time spent on each category and evaluate time saving enabled by a common data model (CDM), specifically the Observational Medical Outcomes Partnership (OMOP) model, for each category., Results: A total of 485 distinct clauses (phenotype criteria) were identified from 55 phenotype algorithms, corresponding to 1153 customization tasks. In addition to 25 non-phenotype-specific tasks, 46 tasks are related to interpretation, 613 tasks are related to knowledge conversion, and 469 tasks are related to programming. A score between 0 and 2 (0 for easy, 1 for moderate, and 2 for difficult portability) is assigned for each aspect, yielding a total KIP score range of 0 to 6. The average clause-wise KIP score to reflect portability is 1.37 ± 1.38. Specifically, the average knowledge (K) score is 0.64 ± 0.66, interpretation (I) score is 0.33 ± 0.55, and programming (P) score is 0.40 ± 0.64. 5% of the categories can be completed within one hour (median). 70% of the categories take from days to months to complete. The OMOP model can assist with vocabulary mapping tasks., Conclusion: This study presents firsthand knowledge of the substantial implementation efforts in phenotyping and introduces a novel metric (KIP) to measure portability of phenotype algorithms for quantifying such efforts across the eMERGE Network. Phenotype developers are encouraged to analyze and optimize the portability in regards to knowledge, interpretation and programming. CDMs can be used to improve the portability for some 'knowledge-oriented' tasks., (Copyright © 2019 Elsevier Inc. All rights reserved.)
Published: 2019
Full Text: View/download PDF

30. Comprehensive Word-Level Classification of Screening Mammography Reports Using a Neural Network Sequence Labeling Approach.

Author: Short RG, Bralich J, Bogaty D, and Befera NT
Subjects: Databases, Factual, Female, Humans, Reproducibility of Results, Research Report, Breast Neoplasms diagnostic imaging, Electronic Health Records classification, Image Interpretation, Computer-Assisted methods, Mammography methods, Neural Networks, Computer
Abstract: Radiology reports contain a large amount of potentially valuable unstructured data. Recently, neural networks have been employed to perform classification of radiology reports over a few classes at the document level. The success of neural networks in sequence-labeling problems such as named entity recognition and part of speech tagging suggests that they could be used to classify radiology report text with greater granularity. We employed a neural network architecture to comprehensively classify mammography report text at the word level using a sequence labeling approach. Two radiologists devised a comprehensive classification system for screening mammography reports. Each word in each report was manually categorized by a radiologist into one of 33 categories according to the classification system. Tagged words referencing the same finding were grouped into unique sets. We pre-labeled reports with a rule-based algorithm and then manually edited these annotations for 6705 screening mammography reports (25.1%, 66.8%, and 8.1% BI-RADS 0, 1, and 2, respectively). A combined convolutional and recurrent neural network model was used to label words in each sentence of the individual reports. A siamese recurrent neural network was then used to group findings into sets. Performance of the neural network-based method was compared to a rule-based algorithm and a conditional random field (CRF) model. Global accuracy (percentage of documents where all word tags were predicted correctly) and keyword accuracy (percentage of all words that were labeled correctly, excluding words tagged as unimportant) were calculated on an unseen 519 report test set. Two-tailed t tests were used to assess differences between algorithm performance, and p < 0.05 was used to determine statistical significance. The neural network-based approach showed significantly higher global accuracy compared to both the rule-based algorithm (88.3 vs 57.0%, p < 0.001) and the CRF model (88.3% vs. 75.8%, p < 0.001). The neural network also showed significantly higher keyword level accuracy compared to the rule-based algorithm (95.5% vs. 80.9% p < 0.001) and CRF model (95.5% vs. 76.9%, p < 0.001). We demonstrate the potential of neural networks to accurately perform word-level multilabel classification of free text radiology reports across 33 classes, thus showing the utility of a sequence labeling approach to NLP of radiology reports. We found that a neural network classifier outperforms a rule-based algorithm and a CRF classifier for comprehensive multilabel classification of free text screening mammography reports at the word level. By approaching radiology report classification as a sequence-labeling problem, we demonstrate the ability of neural networks to extract data from free text radiology reports at a level of granularity not previously reported.
Published: 2019
Full Text: View/download PDF

31. Electronic Medical Record Context Signatures Improve Diagnostic Classification Using Medical Image Computing.

Author: Chaganti S, Mawn LA, Kang H, Egan J, Resnick SM, Beason-Held LL, Landman BA, and Lasko TA
Subjects: Humans, Image Interpretation, Computer-Assisted, Optic Nerve diagnostic imaging, Optic Nerve Diseases diagnostic imaging, Diagnosis, Computer-Assisted methods, Diagnostic Imaging methods, Electronic Health Records classification, Software
Abstract: Composite models that combine medical imaging with electronic medical records (EMR) improve predictive power when compared to traditional models that use imaging alone. The digitization of EMR provides potential access to a wealth of medical information, but presents new challenges in algorithm design and inference. Previous studies, such as Phenome Wide Association Study (PheWAS), have shown that EMR data can be used to investigate the relationship between genotypes and clinical conditions. Here, we introduce Phenome-Disease Association Study to extend the statistical capabilities of the PheWAS software through a custom Python package, which creates diagnostic EMR signatures to capture system-wide co-morbidities for a disease population within a given time interval. We investigate the effect of integrating these EMR signatures with radiological data to improve diagnostic classification in disease domains known to have confounding factors because of variable and complex clinical presentation. Specifically, we focus on two studies: First, a study of four major optic nerve related conditions; and second, a study of diabetes. Addition of EMR signature vectors to radiologically derived structural metrics improves the area under the curve (AUC) for diagnostic classification using elastic net regression, for diseases of the optic nerve. For glaucoma, the AUC improves from 0.71 to 0.83, for intrinsic optic nerve disease it increases from 0.72 to 0.91, for optic nerve edema it increases from 0.95 to 0.96, and for thyroid eye disease from 0.79 to 0.89. The EMR signatures recapitulate known comorbidities with diabetes, such as abnormal glucose, but do not significantly modulate image-derived features. In summary, EMR signatures present a scalable and readily applicable.
Published: 2019
Full Text: View/download PDF

32. Efficient Mining Template of Predictive Temporal Clinical Event Patterns From Patient Electronic Medical Records.

Author: Li J, Tan X, Xu X, and Wang F
Subjects: Adult, Aged, Aged, 80 and over, Algorithms, Databases, Factual, Female, Humans, Male, Pattern Recognition, Automated, Time Factors, Data Mining methods, Electronic Health Records classification, Medical Informatics methods, Models, Statistical
Abstract: Exploring the temporal relationship among events in patient electronic medical records (EMR) is an important problem in biomedical informatics and the results can reveal patients' impending disease conditions. In this paper, we investigate the problem of mining patterns from a sequence of point events, i.e., we only have the information on when the event happens but no duration or numerical value available. We propose a whole pipeline, including event preprocessing, pattern mining, and outcome analysis to mine the patterns and evaluate their effectiveness and discriminative power. Finally, we treat those mined patterns as additional features and evaluate them in a predictive modeling task for the early detection of congestive heart failure. On a real-world EMR data warehouse, we found that by adding those sequential pattern features, the prediction performance could be significantly improved approximately 0.1.
Published: 2019
Full Text: View/download PDF

33. Deep learning for named entity recognition on Chinese electronic medical records: Combining deep transfer learning with multitask bi-directional LSTM RNN.

Author: Dong X, Chowdhury S, Qian L, Li X, Guan Y, Yang J, and Yu Q
Subjects: Asian People, Database Management Systems, Deep Learning trends, Humans, Machine Learning, Neural Networks, Computer, Records classification, Data Collection methods, Data Mining methods, Electronic Health Records classification
Abstract: Specific entity terms such as disease, test, symptom, and genes in Electronic Medical Record (EMR) can be extracted by Named Entity Recognition (NER). However, limited resources of labeled EMR pose a great challenge for mining medical entity terms. In this study, a novel multitask bi-directional RNN model combined with deep transfer learning is proposed as a potential solution of transferring knowledge and data augmentation to enhance NER performance with limited data. The proposed model has been evaluated using micro average F-score, macro average F-score and accuracy. It is observed that the proposed model outperforms the baseline model in the case of discharge datasets. For instance, for the case of discharge summary, the micro average F-score is improved by 2.55% and the overall accuracy is improved by 7.53%. For the case of progress notes, the micro average F-score and the overall accuracy are improved by 1.63% and 5.63%, respectively., Competing Interests: The authors have declared that no competing interests exist.
Published: 2019
Full Text: View/download PDF

34. Machine Learning, Natural Language Processing, and the Electronic Health Record: Innovations in Mental Health Services Research.

Author: Edgcomb JB and Zima B
Subjects: Health Services Research, Humans, Electronic Health Records classification, Machine Learning, Mental Health Services trends, Natural Language Processing
Abstract: An unprecedented amount of clinical information is now available via electronic health records (EHRs). These massive data sets have stimulated opportunities to adapt computational approaches to track and identify target areas for quality improvement in mental health care. In this column, three key areas of EHR data science are described: EHR phenotyping, natural language processing, and predictive modeling. For each of these computational approaches, case examples are provided to illustrate their role in mental health services research. Together, adaptation of these methods underscores the need for standardization and transparency while recognizing the opportunities and challenges ahead.
Published: 2019
Full Text: View/download PDF

35. Assessment of a Deep Learning Model Based on Electronic Health Record Data to Forecast Clinical Outcomes in Patients With Rheumatoid Arthritis.

Author: Norgeot B, Glicksberg BS, Trupin L, Lituiev D, Gianfrancesco M, Oskotsky B, Schmajuk G, Yazdany J, and Butte AJ
Subjects: Adult, Aged, Cohort Studies, Female, Forecasting, Humans, Male, Middle Aged, Prognosis, Arthritis, Rheumatoid diagnosis, Arthritis, Rheumatoid epidemiology, Deep Learning, Diagnosis, Computer-Assisted methods, Electronic Health Records classification
Abstract: Importance: Knowing the future condition of a patient would enable a physician to customize current therapeutic options to prevent disease worsening, but predicting that future condition requires sophisticated modeling and information. If artificial intelligence models were capable of forecasting future patient outcomes, they could be used to aid practitioners and patients in prognosticating outcomes or simulating potential outcomes under different treatment scenarios., Objective: To assess the ability of an artificial intelligence system to prognosticate the state of disease activity of patients with rheumatoid arthritis (RA) at their next clinical visit., Design, Setting, and Participants: This prognostic study included 820 patients with RA from rheumatology clinics at 2 distinct health care systems with different electronic health record platforms: a university hospital (UH) and a public safety-net hospital (SNH). The UH and SNH had substantially different patient populations and treatment patterns. The UH has records on approximately 1 million total patients starting in January 2012. The UH data for this study were accessed on July 1, 2017. The SNH has records on 65 000 unique individuals starting in January 2013. The SNH data for the study were collected on February 27, 2018., Exposures: Structured data were extracted from the electronic health record, including exposures (medications), patient demographics, laboratories, and prior measures of disease activity. A longitudinal deep learning model was used to predict disease activity for patients with RA at their next rheumatology clinic visit and to evaluate interhospital performance and model interoperability strategies., Main Outcomes and Measures: Model performance was quantified using the area under the receiver operating characteristic curve (AUROC). Disease activity in RA was measured using a composite index score., Results: A total of 578 UH patients (mean [SD] age, 57 [15] years; 477 [82.5%] female; 296 [51.2%] white) and 242 SNH patients (mean [SD] age, 60 [15] years; 195 [80.6%] female; 30 [12.4%] white) were included in the study. Patients at the UH compared with those at the SNH were seen more frequently (median time between visits, 100 vs 180 days) and were more frequently prescribed higher-class medications (biologics) (364 [63.0%] vs 70 [28.9%]). At the UH, the model reached an AUROC of 0.91 (95% CI, 0.86-0.96) in a test cohort of 116 patients. The UH-trained model had an AUROC of 0.74 (95% CI, 0.65-0.83) in the SNH test cohort (n = 117) despite marked differences in the patient populations. In both settings, baseline prediction using each patients' most recent disease activity score had statistically random performance., Conclusions and Relevance: The findings suggest that building accurate models to forecast complex disease outcomes using electronic health record data is possible and these models can be shared across hospitals with diverse patient populations.
Published: 2019
Full Text: View/download PDF

36. Classifying relations in clinical narratives using segment graph convolutional and recurrent neural networks (Seg-GCRNs).

Author: Li Y, Jin R, and Luo Y
Subjects: Datasets as Topic, Humans, Data Mining methods, Electronic Health Records classification, Natural Language Processing, Neural Networks, Computer
Abstract: We propose to use segment graph convolutional and recurrent neural networks (Seg-GCRNs), which use only word embedding and sentence syntactic dependencies, to classify relations from clinical notes without manual feature engineering. In this study, the relations between 2 medical concepts are classified by simultaneously learning representations of text segments in the context of sentence syntactic dependency: preceding, concept1, middle, concept2, and succeeding segments. Seg-GCRN was systematically evaluated on the i2b2/VA relation classification challenge datasets. Experiments show that Seg-GCRN attains state-of-the-art micro-averaged F-measure for all 3 relation categories: 0.692 for classifying medical treatment-problem relations, 0.827 for medical test-problem relations, and 0.741 for medical problem-medical problem relations. Comparison with the previous state-of-the-art segment convolutional neural network (Seg-CNN) suggests that adding syntactic dependency information helps refine medical word embedding and improves concept relation classification without manual feature engineering. Seg-GCRN can be trained efficiently for the i2b2/VA dataset on a GPU platform.
Published: 2019
Full Text: View/download PDF

37. Combined SNA and LDA methods to understand adverse medical events.

Author: Zhu L, Reychav I, McHaney R, Broda A, Tal Y, and Manor O
Subjects: Algorithms, Databases, Factual standards, Electronic Health Records classification, Humans, Medical Errors classification, Medical Errors prevention & control, Medication Errors classification, Medication Errors prevention & control, Models, Statistical, Safety Management classification, Electronic Health Records statistics & numerical data, Medical Errors statistics & numerical data, Medication Errors statistics & numerical data, Safety Management standards
Abstract: Objective: To compare primary medical adverse event keywords from reporters (e.g. physicians and nurses) and harm level perspectives to explore the underlying behaviors of medical adverse events using social network analysis (SNA) and latent Dirichlet allocation (LDA) leading to process improvements., Design: Used SNA methods to explore primary keywords used to describe the medical adverse events reported by physicians and nurses. Used LDA methods to investigate topics used for various harm levels. Combined the SNA and LDA methods to discover common shared topic keywords to better understand underlying behaviors of physicians and nurses in different harm level medical adverse events., Setting: Maccabi Healthcare Community is the second largest healthcare organization in Israel., Data: 17,868 medical adverse event data records collected between 2000 and 2017., Methods: Big data analysis techniques using social network analysis (SNA) and latent Dirichlet allocation (LDA)., Results: Shared topic keywords used by both physicians and nurses were determined. The study revealed that communication, information transfer, and inattentiveness were the most common problems reported in the medical adverse events data., Conclusions: Communication and inattentiveness were the most common problems reported in medical adverse events regardless of healthcare professional reporting or harm levels. Findings suggested that an information-sharing and feedback mechanism should be implemented to eliminate preventable medical adverse events. Healthcare institutions managers and government officials should take targeted actions to decrease these preventable medical adverse events through quality improvement efforts.
Published: 2019
Full Text: View/download PDF

38. Natural Language Processing for EHR-Based Computational Phenotyping.

Author: Zeng Z, Deng Y, Li X, Naumann T, and Luo Y
Subjects: Algorithms, Databases, Factual, Humans, Computational Biology methods, Electronic Health Records classification, Machine Learning, Natural Language Processing
Abstract: This article reviews recent advances in applying natural language processing (NLP) to Electronic Health Records (EHRs) for computational phenotyping. NLP-based computational phenotyping has numerous applications including diagnosis categorization, novel phenotype discovery, clinical trial screening, pharmacogenomics, drug-drug interaction (DDI), and adverse drug event (ADE) detection, as well as genome-wide and phenome-wide association studies. Significant progress has been made in algorithm development and resource construction for computational phenotyping. Among the surveyed methods, well-designed keyword search and rule-based systems often achieve good performance. However, the construction of keyword and rule lists requires significant manual effort, which is difficult to scale. Supervised machine learning models have been favored because they are capable of acquiring both classification patterns and structures from data. Recently, deep learning and unsupervised learning have received growing attention, with the former favored for its performance and the latter for its ability to find novel phenotypes. Integrating heterogeneous data sources have become increasingly important and have shown promise in improving model performance. Often, better performance is achieved by combining multiple modalities of information. Despite these many advances, challenges and opportunities remain for NLP-based computational phenotyping, including better model interpretability and generalizability, and proper characterization of feature relations in clinical narratives.
Published: 2019
Full Text: View/download PDF

39. Validation of Prediction Models for Critical Care Outcomes Using Natural Language Processing of Electronic Health Record Data.

Author: Marafino BJ, Park M, Davies JM, Thombley R, Luft HS, Sing DC, Kazi DS, DeJong C, Boscardin WJ, Dean ML, and Dudley RA
Subjects: Adult, Aged, Aged, 80 and over, Female, Humans, Intensive Care Units, Male, Middle Aged, Models, Statistical, Reproducibility of Results, Retrospective Studies, Critical Care Outcomes, Critical Illness mortality, Electronic Health Records classification, Natural Language Processing, Severity of Illness Index
Abstract: Importance: Accurate prediction of outcomes among patients in intensive care units (ICUs) is important for clinical research and monitoring care quality. Most existing prediction models do not take full advantage of the electronic health record, using only the single worst value of laboratory tests and vital signs and largely ignoring information present in free-text notes. Whether capturing more of the available data and applying machine learning and natural language processing (NLP) can improve and automate the prediction of outcomes among patients in the ICU remains unknown., Objectives: To evaluate the change in power for a mortality prediction model among patients in the ICU achieved by incorporating measures of clinical trajectory together with NLP of clinical text and to assess the generalizability of this approach., Design, Setting, and Participants: This retrospective cohort study included 101 196 patients with a first-time admission to the ICU and a length of stay of at least 4 hours. Twenty ICUs at 2 academic medical centers (University of California, San Francisco [UCSF], and Beth Israel Deaconess Medical Center [BIDMC], Boston, Massachusetts) and 1 community hospital (Mills-Peninsula Medical Center [MPMC], Burlingame, California) contributed data from January 1, 2001, through June 1, 2017. Data were analyzed from July 1, 2017, through August 1, 2018., Main Outcomes and Measures: In-hospital mortality and model discrimination as assessed by the area under the receiver operating characteristic curve (AUC) and model calibration as assessed by the modified Hosmer-Lemeshow statistic., Results: Among 101 196 patients included in the analysis, 51.3% (n = 51 899) were male, with a mean (SD) age of 61.3 (17.1) years; their in-hospital mortality rate was 10.4% (n = 10 505). A baseline model using only the highest and lowest observed values for each laboratory test result or vital sign achieved a cross-validated AUC of 0.831 (95% CI, 0.830-0.832). In contrast, that model augmented with measures of clinical trajectory achieved an AUC of 0.899 (95% CI, 0.896-0.902; P < .001 for AUC difference). Further augmenting this model with NLP-derived terms associated with mortality further increased the AUC to 0.922 (95% CI, 0.916-0.924; P < .001). These NLP-derived terms were associated with improved model performance even when applied across sites (AUC difference for UCSF: 0.077 to 0.021; AUC difference for MPMC: 0.071 to 0.051; AUC difference for BIDMC: 0.035 to 0.043; P < .001) when augmenting with NLP at each site., Conclusions and Relevance: Intensive care unit mortality prediction models incorporating measures of clinical trajectory and NLP-derived terms yielded excellent predictive performance and generalized well in this sample of hospitals. The role of these automated algorithms, particularly those using unstructured data from notes and other sources, in clinical research and quality improvement seems to merit additional investigation.
Published: 2018
Full Text: View/download PDF

40. Machine-learning analysis outperforms conventional statistical models and CT classification systems in predicting 6-month outcomes in pediatric patients sustaining traumatic brain injury.

Author: Hale AT, Stonko DP, Brown A, Lim J, Voce DJ, Gannon SR, Le TM, and Shannon CN
Subjects: Adolescent, Child, Child, Preschool, Electronic Health Records standards, Electronic Health Records trends, Female, Humans, Infant, Infant, Newborn, Machine Learning standards, Male, Time Factors, Treatment Outcome, Brain Injuries, Traumatic classification, Brain Injuries, Traumatic diagnosis, Electronic Health Records classification, International Classification of Diseases standards, International Classification of Diseases trends, Machine Learning classification, Models, Statistical
Abstract: OBJECTIVEModern surgical planning and prognostication requires the most accurate outcomes data to practice evidence-based medicine. For clinicians treating children following traumatic brain injury (TBI) these data are severely lacking. The first aim of this study was to assess published CT classification systems in the authors' pediatric cohort. A pediatric-specific machine-learning algorithm called an artificial neural network (ANN) was then created that robustly outperformed traditional CT classification systems in predicting TBI outcomes in children.METHODSThe clinical records of children under the age of 18 who suffered a TBI and underwent head CT within 24 hours after TBI (n = 565) were retrospectively reviewed.RESULTS"Favorable" outcome (alive with Glasgow Outcome Scale [GOS] score ≥ 4 at 6 months postinjury, n = 533) and "unfavorable" outcome (death at 6 months or GOS score ≤ 3 at 6 months postinjury, n = 32) were used as the primary outcomes. The area under the receiver operating characteristic (ROC) curve (AUC) was used to delineate the strength of each CT grading system in predicting survival (Helsinki, 0.814; Rotterdam, 0.838; and Marshall, 0.781). The AUC for CT score in predicting GOS score ≤ 3, a measure of overall functionality, was similarly predictive (Helsinki, 0.717; Rotterdam, 0.748; and Marshall, 0.663). An ANN was then constructed that was able to predict 6-month outcomes with profound accuracy (AUC = 0.9462 ± 0.0422).CONCLUSIONSThis study showed that machine-learning can be leveraged to more accurately predict TBI outcomes in children.
Published: 2018
Full Text: View/download PDF

41. Cardiology record multi-label classification using latent Dirichlet allocation.

Author: Pérez J, Pérez A, Casillas A, and Gojenola K
Subjects: Cardiology trends, Data Mining, Electronic Health Records statistics & numerical data, Humans, International Classification of Diseases, Models, Statistical, Cardiology statistics & numerical data, Electronic Health Records classification
Abstract: Background and Objectives: Electronic health records (EHRs) convey vast and valuable knowledge about dynamically changing clinical practices. Indeed, clinical documentation entails the inspection of massive number of records across hospitals and hospital sections. The goal of this study is to provide an efficient framework that will help clinicians explore EHRs and attain alternative views related to both patient-segments and diseases, like clustering and statistical information about the development of heart diseases (replacement of pacemakers, valve implantation etc.) in co-occurrence with other diseases. The task is challenging, dealing with lengthy health records and a high number of classes in a multi-label setting., Methods: LDA is a statistical procedure optimized to explain a document by multinomial distributions on their latent topics and the topics by distributions on related words. These distributions allow to represent collections of texts into a continuous space enabling distance-based associations between documents and also revealing the underlying topics. The topic models were assessed by means of four divergence metrics. In addition, we applied LDA to the task of multi-label document classification of EHRs according to the International Classification of Diseases 10th Clinical Modification (ICD-10). The set of EHRs had assigned 7 codes on average over 970 different codes corresponding to cardiology., Results: First, the discriminative ability of topic models was assessed using dissimilarity metrics. Nevertheless, there was an open question regarding the interpretability of automatically discovered topics. To address this issue, we explored the connection between the latent topics and ICD-10. EHRs were represented by means of LDA and, next, supervised classifiers were inferred from those representations. Given the low-dimensional representation provided by LDA, the search was computationally efficient compared to symbolic approaches such as TF-IDF. The classifiers achieved an average AUC of 77.79. As a side contribution, with this work we released the software implemented in Python and R to both train and evaluate the models., Conclusions: Topic modeling offers a means of representing EHRs in a small dimensional continuous space. This representation conveys relevant information as hidden topics in a comprehensive manner. Moreover, in practice, this compact representation allowed to extract the ICD-10 codes associated to EHRs., (Copyright © 2018 Elsevier B.V. All rights reserved.)
Published: 2018
Full Text: View/download PDF

42. Classification of glucose records from patients at diabetes risk using a combined permutation entropy algorithm.

Author: Cuesta-Frau D, Miró-Martínez P, Oltra-Crespo S, Jordán-Núñez J, Vargas B, and Vigil L
Subjects: Area Under Curve, Diagnosis, Computer-Assisted statistics & numerical data, Early Diagnosis, Electronic Health Records classification, Electronic Health Records statistics & numerical data, Humans, Predictive Value of Tests, Risk Factors, Algorithms, Blood Glucose metabolism, Blood Glucose Self-Monitoring statistics & numerical data, Diabetes Mellitus blood, Diabetes Mellitus diagnosis, Diagnosis, Computer-Assisted methods
Abstract: Background and Objectives: The adoption in clinical practice of electronic portable blood or interstitial glucose monitors has enabled the collection, storage, and sharing of massive amounts of glucose level readings. This availability of data opened the door to the application of a multitude of mathematical methods to extract clinical information not discernible with conventional visual inspection. The objective of this study is to assess the capability of Permutation Entropy (PE) to find differences between glucose records of healthy and potentially diabetic subjects., Methods: PE is a mathematical method based on the relative frequency analysis of ordinal patterns in time series that has gained a lot of attention in the last years due to its simplicity, robustness, and performance. We study in this paper the applicability of this method to glucose records of subjects at risk of diabetes in order to assess the predictability value of this metric in this context., Results: PE, along with some of its derivatives, was able to find significant differences between diabetic and non-diabetic patients from records acquired up to 3 years before the diagnosis. The quantitative results for PE were 3.5878 ± 0.3916 for the nondiabetic class, and 3.1564 ± 0.4166 for the diabetic class. With a classification accuracy higher than 70%, and by means of a Cox regression model, PE demonstrated that it is a very promising candidate as a risk stratification tool for continuous glucose monitoring., Conclusion: PE can be considered as a prospective tool for the early diagnosis of the glucoregulatory system., (Copyright © 2018 Elsevier B.V. All rights reserved.)
Published: 2018
Full Text: View/download PDF

43. Multiobjective Patient Stratification Using Evolutionary Multiobjective Optimization.

Author: Li X and Wong KC
Subjects: Algorithms, Cluster Analysis, Humans, Transcriptome, Databases, Genetic classification, Electronic Health Records classification, Medical Informatics methods, Precision Medicine methods
Abstract: One of the main challenges in modern medic-ine is to stratify patients for personalized care. Many different clustering methods have been proposed to solve the problem in both quantitative and biologically meaningful manners. However, existing clustering algorithms suffer from numerous restrictions such as experimental noises, high dimensionality, and poor interpretability. To overcome those limitations altogether, we propose and formulate a multiobjective framework based on evolutionary multiobjective optimization to balance the feature relevance and redundancy for patient stratification. To demonstrate the effectiveness of our proposed algorithms, we benchmark our algorithms across 55 synthetic datasets based on a real human transcription regulation network model, 35 real cancer gene expression datasets, and two case studies. Experimental results suggest that the proposed algorithms perform better than the recent state-of-the-arts. In addition, time complexity analysis, convergence analysis, and parameter analysis are conducted to demonstrate the robustness of the proposed methods from different perspectives. Finally, the t-Distributed Stochastic Neighbor Embedding (t-SNE) is applied to project the selected feature subsets onto two or three dimensions to visualize the high-dimensional patient stratification data.
Published: 2018
Full Text: View/download PDF

44. Concurrence of big data analytics and healthcare: A systematic review.

Author: Mehta N and Pandit A
Subjects: Data Interpretation, Statistical, Datasets as Topic, Decision Support Systems, Clinical, Electronic Health Records classification, Humans, Big Data, Data Mining methods, Electronic Health Records organization & administration, Meaningful Use organization & administration, Medical Record Linkage methods, Quality of Health Care standards
Abstract: Background: The application of Big Data analytics in healthcare has immense potential for improving the quality of care, reducing waste and error, and reducing the cost of care., Purpose: This systematic review of literature aims to determine the scope of Big Data analytics in healthcare including its applications and challenges in its adoption in healthcare. It also intends to identify the strategies to overcome the challenges., Data Sources: A systematic search of the articles was carried out on five major scientific databases: ScienceDirect, PubMed, Emerald, IEEE Xplore and Taylor & Francis. The articles on Big Data analytics in healthcare published in English language literature from January 2013 to January 2018 were considered., Study Selection: Descriptive articles and usability studies of Big Data analytics in healthcare and medicine were selected., Data Extraction: Two reviewers independently extracted information on definitions of Big Data analytics; sources and applications of Big Data analytics in healthcare; challenges and strategies to overcome the challenges in healthcare., Results: A total of 58 articles were selected as per the inclusion criteria and analyzed. The analyses of these articles found that: (1) researchers lack consensus about the operational definition of Big Data in healthcare; (2) Big Data in healthcare comes from the internal sources within the hospitals or clinics as well external sources including government, laboratories, pharma companies, data aggregators, medical journals etc.; (3) natural language processing (NLP) is most widely used Big Data analytical technique for healthcare and most of the processing tools used for analytics are based on Hadoop; (4) Big Data analytics finds its application for clinical decision support; optimization of clinical operations and reduction of cost of care (5) major challenge in adoption of Big Data analytics is non-availability of evidence of its practical benefits in healthcare., Conclusion: This review study unveils that there is a paucity of information on evidence of real-world use of Big Data analytics in healthcare. This is because, the usability studies have considered only qualitative approach which describes potential benefits but does not take into account the quantitative study. Also, majority of the studies were from developed countries which brings out the need for promotion of research on Healthcare Big Data analytics in developing countries., (Copyright © 2018 Elsevier B.V. All rights reserved.)
Published: 2018
Full Text: View/download PDF

45. Exploiting Unlabeled Texts with Clustering-based Instance Selection for Medical Relation Classification.

Author: Kim Y, Riloff E, and Meystre SM
Subjects: Algorithms, Cluster Analysis, Humans, Natural Language Processing, Vocabulary, Controlled, Electronic Health Records classification, Information Storage and Retrieval methods, Supervised Machine Learning
Abstract: Classifying relations between pairs of medical concepts in clinical texts is a crucial task to acquire empirical evidence relevant to patient care. Due to limited labeled data and extremely unbalanced class distributions, medical relation classification systems struggle to achieve good performance on less common relation types, which capture valuable information that is important to identify. Our research aims to improve relation classification using weakly supervised learning. We present two clustering-based instance selection methods that acquire a diverse and balanced set of additional training instances from unlabeled data. The first method selects one representative instance from each cluster containing only unlabeled data. The second method selects a counterpart for each training instance using clusters containing both labeled and unlabeled data. These new instance selection methods for weakly supervised learning achieve substantial recall gains for the minority relation classes compared to supervised learning, while yielding comparable performance on the majority relation classes.
Published: 2018

46. Flexible, cluster-based analysis of the electronic medical record of sepsis with composite mixture models.

Author: Mayhew MB, Petersen BK, Sales AP, Greene JD, Liu VX, and Wasson TS
Subjects: Cluster Analysis, Databases, Factual, Humans, Risk, Electronic Health Records classification, Electronic Health Records statistics & numerical data, Models, Statistical, Sepsis diagnosis, Sepsis epidemiology
Abstract: The widespread adoption of electronic medical records (EMRs) in healthcare has provided vast new amounts of data for statistical machine learning researchers in their efforts to model and predict patient health status, potentially enabling novel advances in treatment. In the case of sepsis, a debilitating, dysregulated host response to infection, extracting subtle, uncataloged clinical phenotypes from the EMR with statistical machine learning methods has the potential to impact patient diagnosis and treatment early in the course of their hospitalization. However, there are significant barriers that must be overcome to extract these insights from EMR data. First, EMR datasets consist of both static and dynamic observations of discrete and continuous-valued variables, many of which may be missing, precluding the application of standard multivariate analysis techniques. Second, clinical populations observed via EMRs and relevant to the study and management of conditions like sepsis are often heterogeneous; properly accounting for this heterogeneity is critical. Here, we describe an unsupervised, probabilistic framework called a composite mixture model that can simultaneously accommodate the wide variety of observations frequently observed in EMR datasets, characterize heterogeneous clinical populations, and handle missing observations. We demonstrate the efficacy of our approach on a large-scale sepsis cohort, developing novel techniques built on our model-based clusters to track patient mortality risk over time and identify physiological trends and distinct subgroups of the dataset associated with elevated risk of mortality during hospitalization., (Copyright © 2017 Elsevier Inc. All rights reserved.)
Published: 2018
Full Text: View/download PDF

47. Patient ranking with temporally annotated data.

Author: Bonomi L and Jiang X
Subjects: Data Curation, Databases, Factual, Delivery of Health Care, Humans, Time Factors, Data Mining methods, Electronic Health Records classification, Patients classification, Pattern Recognition, Automated methods
Abstract: Modern medical information systems enable the collection of massive temporal health data. Albeit these data have great potentials for advancing medical research, the data exploration and extraction of useful knowledge present significant challenges. In this work, we develop a new pattern matching technique which aims to facilitate the discovery of clinically useful knowledge from large temporal datasets. Our approach receives in input a set of temporal patterns modeling specific events of interest (e.g., doctor's knowledge, symptoms of diseases) and it returns data instances matching these patterns (e.g., patients exhibiting the specified symptoms). The resulting instances are ranked according to a significance score based on the p-value. Our experimental evaluations on a real-world dataset demonstrate the efficiency and effectiveness of our approach., (Copyright © 2017 Elsevier Inc. All rights reserved.)
Published: 2018
Full Text: View/download PDF

48. Segment convolutional neural networks (Seg-CNNs) for classifying relations in clinical notes.

Author: Luo Y, Cheng Y, Uzuner Ö, Szolovits P, and Starren J
Subjects: Datasets as Topic, Humans, Machine Learning, Electronic Health Records classification, Natural Language Processing, Neural Networks, Computer
Abstract: We propose Segment Convolutional Neural Networks (Seg-CNNs) for classifying relations from clinical notes. Seg-CNNs use only word-embedding features without manual feature engineering. Unlike typical CNN models, relations between 2 concepts are identified by simultaneously learning separate representations for text segments in a sentence: preceding, concept1, middle, concept2, and succeeding. We evaluate Seg-CNN on the i2b2/VA relation classification challenge dataset. We show that Seg-CNN achieves a state-of-the-art micro-average F-measure of 0.742 for overall evaluation, 0.686 for classifying medical problem-treatment relations, 0.820 for medical problem-test relations, and 0.702 for medical problem-medical problem relations. We demonstrate the benefits of learning segment-level representations. We show that medical domain word embeddings help improve relation classification. Seg-CNNs can be trained quickly for the i2b2/VA dataset on a graphics processing unit (GPU) platform. These results support the use of CNNs computed over segments of text for classifying medical relations, as they show state-of-the-art performance while requiring no manual feature engineering., (© The Author 2017. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For Permissions, please email: journals.permissions@oup.com.)
Published: 2018
Full Text: View/download PDF

49. Inter-labeler and intra-labeler variability of condition severity classification models using active and passive learning methods.

Author: Nissim N, Shahar Y, Elovici Y, Hripcsak G, and Moskovitch R
Subjects: Area Under Curve, Humans, Learning Curve, Observer Variation, Phenotype, Reproducibility of Results, Severity of Illness Index, Time Factors, Data Mining methods, Electronic Health Records classification, Supervised Machine Learning
Abstract: Background and Objectives: Labeling instances by domain experts for classification is often time consuming and expensive. To reduce such labeling efforts, we had proposed the application of active learning (AL) methods, introduced our CAESAR-ALE framework for classifying the severity of clinical conditions, and shown its significant reduction of labeling efforts. The use of any of three AL methods (one well known [SVM-Margin], and two that we introduced [Exploitation and Combination_XA]) significantly reduced (by 48% to 64%) condition labeling efforts, compared to standard passive (random instance-selection) SVM learning. Furthermore, our new AL methods achieved maximal accuracy using 12% fewer labeled cases than the SVM-Margin AL method. However, because labelers have varying levels of expertise, a major issue associated with learning methods, and AL methods in particular, is how to best to use the labeling provided by a committee of labelers. First, we wanted to know, based on the labelers' learning curves, whether using AL methods (versus standard passive learning methods) has an effect on the Intra-labeler variability (within the learning curve of each labeler) and inter-labeler variability (among the learning curves of different labelers). Then, we wanted to examine the effect of learning (either passively or actively) from the labels created by the majority consensus of a group of labelers., Methods: We used our CAESAR-ALE framework for classifying the severity of clinical conditions, the three AL methods and the passive learning method, as mentioned above, to induce the classifications models. We used a dataset of 516 clinical conditions and their severity labeling, represented by features aggregated from the medical records of 1.9 million patients treated at Columbia University Medical Center. We analyzed the variance of the classification performance within (intra-labeler), and especially among (inter-labeler) the classification models that were induced by using the labels provided by seven labelers. We also compared the performance of the passive and active learning models when using the consensus label., Results: The AL methods: produced, for the models induced from each labeler, smoother Intra-labeler learning curves during the training phase, compared to the models produced when using the passive learning method. The mean standard deviation of the learning curves of the three AL methods over all labelers (mean: 0.0379; range: [0.0182 to 0.0496]), was significantly lower (p=0.049) than the Intra-labeler standard deviation when using the passive learning method (mean: 0.0484; range: [0.0275-0.0724). Using the AL methods resulted in a lower mean Inter-labeler AUC standard deviation among the AUC values of the labelers' different models during the training phase, compared to the variance of the induced models' AUC values when using passive learning. The Inter-labeler AUC standard deviation, using the passive learning method (0.039), was almost twice as high as the Inter-labeler standard deviation using our two new AL methods (0.02 and 0.019, respectively). The SVM-Margin AL method resulted in an Inter-labeler standard deviation (0.029) that was higher by almost 50% than that of our two AL methods The difference in the inter-labeler standard deviation between the passive learning method and the SVM-Margin learning method was significant (p=0.042). The difference between the SVM-Margin and Exploitation method was insignificant (p=0.29), as was the difference between the Combination_XA and Exploitation methods (p=0.67). Finally, using the consensus label led to a learning curve that had a higher mean intra-labeler variance, but resulted eventually in an AUC that was at least as high as the AUC achieved using the gold standard label and that was always higher than the expected mean AUC of a randomly selected labeler, regardless of the choice of learning method (including a passive learning method). Using a paired t-test, the difference between the intra-labeler AUC standard deviation when using the consensus label, versus that value when using the other two labeling strategies, was significant only when using the passive learning method (p=0.014), but not when using any of the three AL methods., Conclusions: The use of AL methods, (a) reduces intra-labeler variability in the performance of the induced models during the training phase, and thus reduces the risk of halting the process at a local minimum that is significantly different in performance from the rest of the learned models; and (b) reduces Inter-labeler performance variance, and thus reduces the dependence on the use of a particular labeler. In addition, the use of a consensus label, agreed upon by a rather uneven group of labelers, might be at least as good as using the gold standard labeler, who might not be available, and certainly better than randomly selecting one of the group's individual labelers. Finally, using the AL methods: when provided by the consensus label reduced the intra-labeler AUC variance during the learning phase, compared to using passive learning., (Copyright © 2017 Elsevier B.V. All rights reserved.)
Published: 2017
Full Text: View/download PDF

50. Prognosis of Clinical Outcomes with Temporal Patterns and Experiences with One Class Feature Selection.

Author: Moskovitch R, Choi H, Hripcsak G, and Tatonetti N
Subjects: Algorithms, Electronic Health Records classification, Humans, Time Factors, Data Mining methods, Medical Informatics methods, Prognosis, Treatment Outcome
Abstract: Accurate prognosis of outcome events, such as clinical procedures or disease diagnosis, is central in medicine. The emergence of longitudinal clinical data, like the Electronic Health Records (EHR), represents an opportunity to develop automated methods for predicting patient outcomes. However, these data are highly dimensional and very sparse, complicating the application of predictive modeling techniques. Further, their temporal nature is not fully exploited by current methods, and temporal abstraction was recently used which results in symbolic time intervals representation. We present Maitreya, a framework for the prediction of outcome events that leverages these symbolic time intervals. Using Maitreya, learn predictive models based on the temporal patterns in the clinical records that are prognostic markers and use these markers to train predictive models for eight clinical procedures. In order to decrease the number of patterns that are used as features, we propose the use of three one class feature selection methods. We evaluate the performance of Maitreya under several parameter settings, including the one-class feature selection, and compare our results to that of atemporal approaches. In general, we found that the use of temporal patterns outperformed the atemporal methods, when representing the number of pattern occurrences.
Published: 2017
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

202 results on '"Electronic Health Records classification"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources