124 results on '"Campion, Thomas R."'
Search Results
2. Comparing automated vs. manual data collection for COVID-specific medications from electronic health records
- Author
-
Yin, Andrew L., Guo, Winston L., Sholle, Evan T., Rajan, Mangala, Alshak, Mark N., Choi, Justin J., Goyal, Parag, Jabri, Assem, Li, Han A., Pinheiro, Laura C., Wehmeyer, Graham T., Weiner, Mark, Safford, Monika M., Campion, Thomas R., and Cole, Curtis L.
- Published
- 2022
- Full Text
- View/download PDF
3. Critical carE Database for Advanced Research (CEDAR): An automated method to support intensive care units with electronic health record data
- Author
-
Schenck, Edward J., Hoffman, Katherine L., Cusick, Marika, Kabariti, Joseph, Sholle, Evan T., and Campion, Thomas R., Jr.
- Published
- 2021
- Full Text
- View/download PDF
4. Using weak supervision and deep learning to classify clinical notes for identification of current suicidal ideation
- Author
-
Cusick, Marika, Adekkanattu, Prakash, Campion, Thomas R., Jr., Sholle, Evan T., Myers, Annie, Banerjee, Samprit, Alexopoulos, George, Wang, Yanshan, and Pathak, Jyotishman
- Published
- 2021
- Full Text
- View/download PDF
5. Understanding enterprise data warehouses to support clinical and translational research: impact, sustainability, demand management, and accessibility.
- Author
-
Campion, Thomas R, Craven, Catherine K, Dorr, David A, Bernstam, Elmer V, and Knosp, Boyd M
- Abstract
Objectives Healthcare organizations, including Clinical and Translational Science Awards (CTSA) hubs funded by the National Institutes of Health, seek to enable secondary use of electronic health record (EHR) data through an enterprise data warehouse for research (EDW4R), but optimal approaches are unknown. In this qualitative study, our goal was to understand EDW4R impact, sustainability, demand management, and accessibility. Materials and Methods We engaged a convenience sample of informatics leaders from CTSA hubs (n = 21) for semi-structured interviews and completed a directed content analysis of interview transcripts. Results EDW4R have created institutional capacity for single- and multi-center studies, democratized access to EHR data for investigators from multiple disciplines, and enabled the learning health system. Bibliometrics have been challenging due to investigator non-compliance, but one hub's requirement to link all study protocols with funding records enabled quantifying an EDW4R's multi-million dollar impact. Sustainability of EDW4R has relied on multiple funding sources with a general shift away from the CTSA grant toward institutional and industry support. To address EDW4R demand, institutions have expanded staff, used different governance approaches, and provided investigator self-service tools. EDW4R accessibility can benefit from improved tools incorporating user-centered design, increased data literacy among scientists, expansion of informaticians in the workforce, and growth of team science. Discussion As investigator demand for EDW4R has increased, approaches to tracking impact, ensuring sustainability, and improving accessibility of EDW4R resources have varied. Conclusion This study adds to understanding of how informatics leaders seek to support investigators using EDW4R across the CTSA consortium and potentially elsewhere. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
6. Utilizing electronic health data and machine learning for the prediction of 30-day unplanned readmission or all-cause mortality in heart failure
- Author
-
Beecy, Ashley N., Gummalla, Manasa, Sholle, Evan, Xu, Zhuoran, Zhang, Yiye, Michalak, Kelly, Dolan, Kristina, Hussain, Yasin, Lee, Benjamin C., Zhang, Yongkang, Goyal, Parag, Campion, Thomas R., Jr., Shaw, Leslee J., Baskaran, Lohendran, and Al’Aref, Subhi J.
- Published
- 2020
- Full Text
- View/download PDF
7. A scalable method for supporting multiple patient cohort discovery projects using i2b2
- Author
-
Sholle, Evan T., Davila, Marcos A., Kabariti, Joseph, Schwartz, Julian Z., Varughese, Vinay I., Cole, Curtis L., and Campion, Thomas R., Jr.
- Published
- 2018
- Full Text
- View/download PDF
8. Automated Extraction of Tumor Staging and Diagnosis Information From Surgical Pathology Reports
- Author
-
Abedian, Sajjad, Sholle, Evan T., Adekkanattu, Prakash M., Cusick, Marika M., Weiner, Stephanie E., Shoag, Jonathan E., Hu, Jim C., and Campion, Thomas R., Jr
- Published
- 2021
- Full Text
- View/download PDF
9. Implementation of a commercial federated network of electronic health record data to enable sponsor-initiated clinical trials at an academic medical center
- Author
-
Campion, Thomas R., Jr., Sholle, Evan T., Abedian, Sajjad, Fuld, Xiaobo, McGregor, Ryan, Lewis, Alicia N., Gripp, Lauren T., Leonard, John P., and Cole, Curtis L.
- Published
- 2024
- Full Text
- View/download PDF
10. A method to automate the discharge summary hospital course for neurology patients.
- Author
-
Hartman, Vince C, Bapat, Sanika S, Weiner, Mark G, Navi, Babak B, Sholle, Evan T, and Campion, Thomas R
- Abstract
Objective Generation of automated clinical notes has been posited as a strategy to mitigate physician burnout. In particular, an automated narrative summary of a patient's hospital stay could supplement the hospital course section of the discharge summary that inpatient physicians document in electronic health record (EHR) systems. In the current study, we developed and evaluated an automated method for summarizing the hospital course section using encoder-decoder sequence-to-sequence transformer models. Materials and Methods We fine-tuned BERT and BART models and optimized for factuality through constraining beam search, which we trained and tested using EHR data from patients admitted to the neurology unit of an academic medical center. Results The approach demonstrated good ROUGE scores with an R-2 of 13.76. In a blind evaluation, 2 board-certified physicians rated 62% of the automated summaries as meeting the standard of care, which suggests the method may be useful clinically. Discussion and conclusion To our knowledge, this study is among the first to demonstrate an automated method for generating a discharge summary hospital course that approaches a quality level of what a physician would write. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
11. Implementing Unique Device Identification in Electronic Health Record Systems: Organizational, Workflow, and Technological Challenges
- Author
-
Campion,, Thomas R., Johnson, Stephen B., Paxton, Elizabeth W., Mushlin, Alvin I., and Sedrakyan, Art
- Published
- 2014
12. Health information exchange system usage patterns in three communities: Practice sites, users, patients, and data
- Author
-
Campion, Thomas R., Jr., Edwards, Alison M., Johnson, Stephen B., and Kaushal, Rainu
- Published
- 2013
- Full Text
- View/download PDF
13. Barriers and facilitators to the use of computer-based intensive insulin therapy
- Author
-
Campion, Thomas R., Jr., Waitman, Lemuel R., Lorenzi, Nancy M., May, Addison K., and Gadd, Cynthia S.
- Published
- 2011
- Full Text
- View/download PDF
14. Social, organizational, and contextual characteristics of clinical decision support systems for intensive insulin therapy: A literature review and case study
- Author
-
Campion, Thomas R., Jr., Waitman, Lemuel R., May, Addison K., Ozdas, Asli, Lorenzi, Nancy M., and Gadd, Cynthia S.
- Published
- 2010
- Full Text
- View/download PDF
15. Blogs, Wikis, and Discussion Forums: Attributes and Implications for Clinical Information Systems
- Author
-
Weiss, Jacob B, Campion, Thomas R, and Medinfo 2007: Proceedings of the 12th World Congress on Health (Medical) Informatics; Building Sustainable Health Systems
- Published
- 2007
16. Design and implementation of an integrated data model to support clinical and translational research administration.
- Author
-
Wood, Elizabeth A and Campion, Thomas R
- Abstract
Objective Both academic medical centers and biomedical research sponsors need to understand impact of scientific funding to determine value. For the National Institutes of Health (NIH) Clinical and Translational Science Award (CTSA) hubs, tracking research activities can be complex, often involving multiple institutions and continually changing federal reporting requirements. Existing research administrative systems are institution-specific and tend to focus only on parts of a greater whole. The goal of this case report is to describe a comprehensive data model that addresses this gap. Materials and Methods Web-based Center Administrative Management Program (WebCAMP) has been developed over a period of over 15 years in the context of CTSA hubs, with the recent addition of T32 programs. Its data model centers around the key concepts of people , projects , resources (inputs) , and outcomes (outputs). Results The WebCAMP data model and associated toolset for biomedical research administration integrates multiple components of the research enterprise, has been used by our CTSA hub for over 15 years and has been adopted by more than 20 other CTSA hubs. Discussion To the best of our knowledge, this study is among the first to describe a comprehensive data model for biomedical research administration. Opportunities for future work include improved grant tracking through the development of a universal identifier that spans public and private funders, and a more generic outcomes tracking model able to rapidly incorporate new outcome types. Conclusion We propose that the WebCAMP data model, or a derivative of it, could serve as a future standard for research administrative data warehousing. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
17. Effects of blood glucose transcription mismatches on a computer-based intensive insulin therapy protocol
- Author
-
Campion, Thomas R., May, Addison K., Waitman, Lemuel R., Ozdas, Asli, and Gadd, Cynthia S.
- Subjects
Diabetes therapy -- Comparative analysis ,Medical colleges -- Comparative analysis ,Insulin -- Comparative analysis ,Blood sugar -- Comparative analysis ,Genetic transcription -- Comparative analysis ,Hyperglycemia -- Care and treatment ,Health care industry ,Vanderbilt University. School of Medicine - Abstract
Purpose Computerized clinical decision support systems (CDSS) for intensive insulin therapy (IIT) generate recommendations using blood glucose (BG) values manually transcribed from testing devices to computers, a potential source of error. We quantified the frequency and effect of blood glucose transcription mismatches on IIT protocol performance. Methods We examined 38 months of retrospective data for patients treated with CDSS IIT in two intensive care units at one teaching hospital. A manually transcribed BG value not equal to a corresponding device value was deemed mismatched. For mismatches we recalculated CDSS recommendations using device BG values. We compared matched and mismatched data in terms of CDSS alerts, blood glucose variability, and dosing. Results Of 189,499 CDSS IIT instances, 5.3% contained mismatched BG values. Mismatched data triggered 93 false alerts and failed to issue 170 alerts for nurses to notify physicians. Four of six BG variability measures differed between matched and mismatched data. Overall insulin dose was greater for matched than mismatched [matched 3.8 (1.6-6.0), median (interquartile range, IQR), versus 3.6 (1.6-5.7); p < 0.001], but recalculated and actual dose were similar. In mismatches preceding hypoglycemia, recalculated insulin dose was significantly lower than actual dose [recalculated 2.7 (0.4-5.0), median (IQR), versus 3.5 (1.4-5.6)]. In mismatches preceding hyperglycemia, recalculated insulin dose was significantly greater than actual dose [recalculated 4.7 (3.3-6.2), median (IQR), versus 3.3 (2.4-4.3); p < 0.001]. Administration of recalculated doses might have prevented blood glucose excursions. Conclusions Mismatched blood glucose values can influence CDSS IIT protocol performance., Author(s): Thomas R. Jr. Campion [sup.1], Addison K. May [sup.2], Lemuel R. Waitman [sup.3], Asli Ozdas [sup.4], Cynthia S. Gadd [sup.5] Author Affiliations: (1) grid.152326.1, 0000000122647217, Department of Biomedical Informatics, [...]
- Published
- 2010
- Full Text
- View/download PDF
18. Extracting and classifying diagnosis dates from clinical notes: A case study
- Author
-
Fu, Julia T., Sholle, Evan, Krichevsky, Spencer, Scandura, Joseph, and Campion, Thomas R.
- Published
- 2020
- Full Text
- View/download PDF
19. Changing the research landscape: the New York City Clinical Data Research Network
- Author
-
Kaushal, Rainu, Hripcsak, George, Ascheim, Deborah D, Bloom, Toby, Campion, Thomas R, Jr, Caplan, Arthur L, Currie, Brian P, Check, Thomas, Deland, Emme Levin, Gourevitch, Marc N, Hart, Raffaella, Horowitz, Carol R, Kastenbaum, Isaac, Levin, Arthur Aaron, Low, Alexander F H, Meissner, Paul, Mirhaji, Parsa, Pincus, Harold A, Scaglione, Charles, Shelley, Donna, and Tobin, Jonathan N
- Published
- 2014
- Full Text
- View/download PDF
20. Characterizing Basic and Complex Usage of i2b2 at an Academic Medical Center
- Author
-
Sholle, Evan T., Cusick, Marika, Davila, Marcos A., Kabariti, Joseph, Flores, Steven, and Campion, Thomas R.
- Subjects
Articles - Abstract
Developed to enable basic queries for cohort discovery, i2b2 has evolved to support complex queries. Little is known whether query sophistication - and the informatics resources required to support it - addresses researcher needs. In three years at our institution, 609 researchers ran 6,662 queries and requested re-identification of 80 patient cohorts to support specific studies. After characterizing all queries as "basic" or "complex" with respect to use of sophisticated query features, we found that the majority of all queries, and the majority of queries resulting in a request for cohort re-identification, did not use complex i2b2 features. Data domains that required extensive effort to implement saw relatively little use compared to common domains (e.g., diagnoses). These findings suggest that efforts to ensure the performance of basic queries using common data domains may better serve the needs of the research community than efforts to integrate novel domains or introduce complex new features.
- Published
- 2020
21. Research data warehouse best practices: catalyzing national data sharing through informatics innovation.
- Author
-
Murphy, Shawn N, Visweswaran, Shyam, Becich, Michael J, Campion, Thomas R, Knosp, Boyd M, Melton-Meaux, Genevieve B, and Lenert, Leslie A
- Abstract
Research Patient Data Repositories (RPDRs) have become essential infrastructure for traditional Clinical and Translational Science Award (CTSA) programs and increasingly for a wide range of research consortia[[1], [3]] and learning health system networks. In the data commons, datasets associated with various research projects were made findable, accessible, interoperable, and reusable (FAIR) through the open-source Gen3 data platform that enables the interoperation and creation of cloud-based data resources. Compared to on-premises infrastructure, cloud storage is inexpensive and readily available with automatic backups, and secure analytics in the cloud are easily enabled on large datasets containing protected health information. [Extracted from the article]
- Published
- 2022
- Full Text
- View/download PDF
22. An architecture for research computing in health to support clinical and translational investigators with electronic patient data.
- Author
-
Campion, Thomas R, Sholle, Evan T, Pathak, Jyotishman, Johnson, Stephen B, Leonard, John P, and Cole, Curtis L
- Abstract
Objective: Obtaining electronic patient data, especially from electronic health record (EHR) systems, for clinical and translational research is difficult. Multiple research informatics systems exist but navigating the numerous applications can be challenging for scientists. This article describes Architecture for Research Computing in Health (ARCH), our institution's approach for matching investigators with tools and services for obtaining electronic patient data.Materials and Methods: Supporting the spectrum of studies from populations to individuals, ARCH delivers a breadth of scientific functions-including but not limited to cohort discovery, electronic data capture, and multi-institutional data sharing-that manifest in specific systems-such as i2b2, REDCap, and PCORnet. Through a consultative process, ARCH staff align investigators with tools with respect to study design, data sources, and cost. Although most ARCH services are available free of charge, advanced engagements require fee for service.Results: Since 2016 at Weill Cornell Medicine, ARCH has supported over 1200 unique investigators through more than 4177 consultations. Notably, ARCH infrastructure enabled critical coronavirus disease 2019 response activities for research and patient care.Discussion: ARCH has provided a technical, regulatory, financial, and educational framework to support the biomedical research enterprise with electronic patient data. Collaboration among informaticians, biostatisticians, and clinicians has been critical to rapid generation and analysis of EHR data.Conclusion: A suite of tools and services, ARCH helps match investigators with informatics systems to reduce time to science. ARCH has facilitated research at Weill Cornell Medicine and may provide a model for informatics and research leaders to support scientists elsewhere. [ABSTRACT FROM AUTHOR]- Published
- 2022
- Full Text
- View/download PDF
23. Understanding enterprise data warehouses to support clinical and translational research: enterprise information technology relationships, data governance, workforce, and cloud computing.
- Author
-
Knosp, Boyd M, Craven, Catherine K, Dorr, David A, Bernstam, Elmer V, and Campion, Thomas R
- Abstract
Objective: Among National Institutes of Health Clinical and Translational Science Award (CTSA) hubs, effective approaches for enterprise data warehouses for research (EDW4R) development, maintenance, and sustainability remain unclear. The goal of this qualitative study was to understand CTSA EDW4R operations within the broader contexts of academic medical centers and technology.Materials and Methods: We performed a directed content analysis of transcripts generated from semistructured interviews with informatics leaders from 20 CTSA hubs.Results: Respondents referred to services provided by health system, university, and medical school information technology (IT) organizations as "enterprise information technology (IT)." Seventy-five percent of respondents stated that the team providing EDW4R service at their hub was separate from enterprise IT; strong relationships between EDW4R teams and enterprise IT were critical for success. Managing challenges of EDW4R staffing was made easier by executive leadership support. Data governance appeared to be a work in progress, as most hubs reported complex and incomplete processes, especially for commercial data sharing. Although nearly all hubs (n = 16) described use of cloud computing for specific projects, only 2 hubs reported using a cloud-based EDW4R. Respondents described EDW4R cloud migration facilitators, barriers, and opportunities.Discussion: Descriptions of approaches to how EDW4R teams at CTSA hubs work with enterprise IT organizations, manage workforces, make decisions about data, and approach cloud computing provide insights for institutions seeking to leverage patient data for research.Conclusion: Identification of EDW4R best practices is challenging, and this study helps identify a breadth of viable options for CTSA hubs to consider when implementing EDW4R services. [ABSTRACT FROM AUTHOR]- Published
- 2022
- Full Text
- View/download PDF
24. Characteristics and effects of nurse dosing over-rides on computer-based intensive insulin therapy protocol performance
- Author
-
Campion, Thomas R, Jr, May, Addison K, Waitman, Lemuel R, Ozdas, Asli, Lorenzi, Nancy M, and Gadd, Cynthia S
- Published
- 2011
- Full Text
- View/download PDF
25. Extracting social determinants of health from electronic health records using natural language processing: a systematic review.
- Author
-
Patra, Braja G, Sharma, Mohit M, Vekaria, Veer, Adekkanattu, Prakash, Patterson, Olga V, Glicksberg, Benjamin, Lepow, Lauren A, Ryu, Euijung, Biernacka, Joanna M, Furmanchuk, Al'ona, George, Thomas J, Hogan, William, Wu, Yonghui, Yang, Xi, Bian, Jiang, Weissman, Myrna, Wickramaratne, Priya, Mann, J John, Olfson, Mark, and Campion, Thomas R
- Abstract
Objective: Social determinants of health (SDoH) are nonclinical dispositions that impact patient health risks and clinical outcomes. Leveraging SDoH in clinical decision-making can potentially improve diagnosis, treatment planning, and patient outcomes. Despite increased interest in capturing SDoH in electronic health records (EHRs), such information is typically locked in unstructured clinical notes. Natural language processing (NLP) is the key technology to extract SDoH information from clinical text and expand its utility in patient care and research. This article presents a systematic review of the state-of-the-art NLP approaches and tools that focus on identifying and extracting SDoH data from unstructured clinical text in EHRs.Materials and Methods: A broad literature search was conducted in February 2021 using 3 scholarly databases (ACL Anthology, PubMed, and Scopus) following Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines. A total of 6402 publications were initially identified, and after applying the study inclusion criteria, 82 publications were selected for the final review.Results: Smoking status (n = 27), substance use (n = 21), homelessness (n = 20), and alcohol use (n = 15) are the most frequently studied SDoH categories. Homelessness (n = 7) and other less-studied SDoH (eg, education, financial problems, social isolation and support, family problems) are mostly identified using rule-based approaches. In contrast, machine learning approaches are popular for identifying smoking status (n = 13), substance use (n = 9), and alcohol use (n = 9).Conclusion: NLP offers significant potential to extract SDoH data from narrative clinical notes, which in turn can aid in the development of screening tools, risk prediction models, and clinical decision support systems. [ABSTRACT FROM AUTHOR]- Published
- 2021
- Full Text
- View/download PDF
26. Peers, regulators, and professions: the influence of organizations in intensive insulin therapy adoption
- Author
-
Campion, Thomas R., Jr. and Gadd, Cynthia S.
- Subjects
Critically ill -- Care and treatment ,Hypoglycemia -- Care and treatment ,Insulin -- Dosage and administration ,Insulin -- Risk factors ,Critical care medicine -- Standards ,Critical care medicine -- Economic aspects ,Critical care medicine -- Research ,Medical care, Cost of -- Research ,Medical protocols -- Research ,Business ,Health care industry - Published
- 2009
27. Implementation of Informatics to Support the NIH All of Us Research Program in a Healthcare Provider Organization
- Author
-
Turner, Scott P., Pompea, Sean T., Williams, Kelly L., Kraemer, David A., Sholle, Evan T., Chen, Cindy, Cole, Curtis L., Kaushal, Rainu, and Campion, Thomas R.
- Subjects
Articles - Abstract
The NIH All of Us Research Program, a national effort to collect biospecimens and health data for over one million participants from across the United States, requires participating healthcare provider organizations (HPOs) to use informatics tools maintained by the NIH to manage participant consent, biospecimen processing, physical measurements, and other workflows. HPOs also maintain distinct workflows for handling overlapping tasks within their individual aegis, which do not necessarily achieve seamless interoperability with NIH-maintained cloud-based systems. At our HPO, we implemented informatics to address gaps in enrollment workflows and hardware, clinical workflow integration, patient engagement, laboratory support, and study team reporting. In this case report we detail our approach to inform efforts at other institutions for the NIH All of Us Research Program and other studies.
- Published
- 2019
28. Ascertaining Depression Severity by Extracting Patient Health Questionnaire-9 (PHQ-9) Scores from Clinical Notes
- Author
-
Adekkanattu, Prakash, Sholle, Evan T., DeFerio, Joseph, Pathak, Jyotishman, Johnson, Stephen B., and Campion, Thomas R.
- Subjects
Adult ,Depressive Disorder ,mental disorders ,Electronic Health Records ,Humans ,Information Storage and Retrieval ,Articles ,Patient Health Questionnaire ,behavioral disciplines and activities ,Severity of Illness Index ,Natural Language Processing - Abstract
The Patient Health Questionnaire-9 (PHQ-9) is a validated instrument for assessing depression severity. While some electronic health record (EHR) systems capture PHQ-9 scores in a structured format, unstructured clinical notes remain the only source in many settings, which presents data retrieval challenges for research and clinical decision support. To address this gap, we extended the open-source Leo natural language processing (NLP) platform to extract PHQ-9 scores from clinical notes and evaluated performance using EHR data for n=123,703 patients who were prescribed antidepressants. Compared to a reference standard, the NLP method exhibited high accuracy (97%), sensitivity (98%), precision (97%), and F-score (97%). Furthermore, of patients with PHQ-9 scores identified by the NLP method, 31% (n=498) had at least one PHQ-9 score clinically indicative of major depressive disorder (MDD), but lacked a structured ICD-9/10 diagnosis code for MDD. This NLP technique may facilitate accurate identification and stratification of patients with depression.
- Published
- 2018
29. A Comparative Analysis of the Respiratory Subscore of the Sequential Organ Failure Assessment Scoring System.
- Author
-
Schenck, Edward J., Hoffman, Katherine L., Oromendia, Clara, Sanchez, Elizabeth, Finkelsztein, Eli J., Kyung Sook Hong, Kabariti, Joseph, Torres, Lisa K., Harrington, John S., Siempos, Ilias I., Choi, Augustine M. K., Campion Jr., Thomas R., Hoffman, Katherine, Hong, Kyung Sook, and Campion, Thomas R Jr
- Abstract
Rationale: The Sequential Organ Failure Assessment (SOFA) tool is a commonly used measure of illness severity. Calculation of the respiratory subscore of SOFA is frequently limited by missing arterial oxygen pressure (PaO2) data. Although missing PaO2 data are commonly replaced with normal values, the performance of different methods of substituting PaO2 for SOFA calculation is unclear. Objectives: The study objective was to compare the performance of different substitution strategies for missing PaO2 data for SOFA score calculation. Methods: This retrospective cohort study was performed using the Weill Cornell Critical Care Database for Advanced Research from a tertiary care hospital in the United States. All adult patients admitted to an intensive care unit (ICU) from 2011 to 2019 with an available respiratory SOFA score were included. We analyzed the availability of the PaO2/fraction of inspired oxygen (FiO2) ratio on the first day of ICU admission. In those without a PaO2/FiO2 ratio available, the ratio of oxygen saturation as measured by pulse oximetry to FiO2 was used to calculate a respiratory SOFA subscore according to four methods (linear substitution [Rice], nonlinear substitution [Severinghaus], modified respiratory SOFA, and multiple imputation by chained equations [MICE]) as well as the missing-as-normal technique. We then compared how well the different total SOFA scores discriminated in-hospital mortality. We performed several subgroup and sensitivity analyses. Results: We identified 35,260 unique visits, of which 9,172 included predominant respiratory failure. PaO2 data were available for 14,939 (47%). The area under the receiver operating characteristic curve for each substitution technique for discriminating in-hospital mortality was higher than that for the missing-as-normal technique (0.78 [0.77-0.79]) in all analyses (modified, 0.80 [0.79-0.81]; Rice, 0.80 [0.79-0.81]; Severinghaus, 0.80 [0.79-0.81]; and MICE, 0.80 [0.79-0.81]) (P < 0.01). Each substitution method had a higher accuracy for discriminating in-hospital mortality (MICE, 0.67; Rice, 0.67; modified, 0.66; and Severinghaus, 0.66) than the missing-as-normal technique. Model calibration for in-hospital mortality was less precise for the missing-as-normal technique than for the other substitution techniques at the lower range of SOFA and among the subgroups. Conclusions: Using physiologic and statistical substitution methods improved the total SOFA score's ability to discriminate mortality compared with the missing-as-normal technique. Treating missing data as normal may result in underreporting the severity of illness compared with using substitution. The simplicity of a direct oxygen saturation as measured by pulse oximetry/FiO2 ratio-modified SOFA technique makes it an attractive choice for electronic health record-based research. This knowledge can inform comparisons of severity of illness across studies that used different techniques. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
30. Secondary Use of Patients’ Electronic Records (SUPER): An Approach for Meeting Specific Data Needs of Clinical and Translational Researchers
- Author
-
Sholle, Evan T., Kabariti, Joseph, Johnson, Stephen B., Leonard, John P., Pathak, Jyotishman, Varughese, Vinay I., Cole, Curtis L., and Campion, Thomas R.
- Subjects
Systems Integration ,Translational Research, Biomedical ,Academic Medical Centers ,Data Aggregation ,Data Warehousing ,Clinical Studies as Topic ,Data Mining ,Electronic Health Records ,Humans ,New York City ,Articles ,Information Systems - Abstract
Academic medical centers commonly approach secondary use of electronic health record (EHR) data by implementing centralized clinical data warehouses (CDWs). However, CDWs require extensive resources to model data dimensions and harmonize clinical terminology, which can hinder effective support of the specific and varied data needs of investigators. We hypothesized that an approach that aggregates raw data from source systems, ignores initial modeling typical of CDWs, and transforms raw data for specific research purposes would meet investigator needs. The approach has successfully enabled multiple tools that provide utility to the institutional research enterprise. To our knowledge, this is the first complete description of a methodology for electronic patient data acquisition and provisioning that ignores data harmonization at the time of initial storage in favor of downstream transformation to address specific research questions and applications.
- Published
- 2018
31. Risk of Ischemic Stroke in Patients With Coronavirus Disease 2019 (COVID-19) vs Patients With Influenza.
- Author
-
Merkler, Alexander E., Parikh, Neal S., Mir, Saad, Gupta, Ajay, Kamel, Hooman, Lin, Eaton, Lantos, Joshua, Schenck, Edward J., Goyal, Parag, Bruce, Samuel S., Kahan, Joshua, Lansdale, Kelsey N., LeMoss, Natalie M., Murthy, Santosh B., Stieg, Philip E., Fink, Matthew E., Iadecola, Costantino, Segal, Alan Z., Cusick, Marika, and Campion, Thomas R.
- Published
- 2020
- Full Text
- View/download PDF
32. COVID-19 Viral and Serology Testing in New York City Health Care Workers.
- Author
-
Racine-Brzostek, Sabrina E, Yang, He S, Chadburn, Amy, Orlander, Duncan, An, Anjile, Campion, Thomas R, Yee, Jim, Chen, Zhengming, Loda, Massimo, Zhao, Zhen, Kaushal, Rainu, and Cushing, Melissa M
- Subjects
COVID-19 ,URBAN health ,MEDICAL care ,MEDICAL personnel ,SEROLOGY - Published
- 2020
- Full Text
- View/download PDF
33. Understanding enterprise data warehouses to support clinical and translational research.
- Author
-
Campion, Thomas R, Craven, Catherine K, Dorr, David A, and Knosp, Boyd M
- Abstract
Objective: Among National Institutes of Health Clinical and Translational Science Award (CTSA) hubs, adoption of electronic data warehouses for research (EDW4R) containing data from electronic health record systems is nearly ubiquitous. Although benefits of EDW4R include more effective, efficient support of scientists, little is known about how CTSA hubs have implemented EDW4R services. The goal of this qualitative study was to understand the ways in which CTSA hubs have operationalized EDW4R to support clinical and translational researchers.Materials and Methods: After conducting semistructured interviews with informatics leaders from 20 CTSA hubs, we performed a directed content analysis of interview notes informed by naturalistic inquiry.Results: We identified 12 themes: organization and data; oversight and governance; data access request process; data access modalities; data access for users with different skill sets; engagement, communication, and literacy; service management coordinated with enterprise information technology; service management coordinated within a CTSA hub; service management coordinated between informatics and biostatistics; funding approaches; performance metrics; and future trends and current technology challenges.Discussion: This study is a step in developing an improved understanding and creating a common vocabulary about EDW4R operations across institutions. Findings indicate an opportunity for establishing best practices for EDW4R operations in academic medicine. Such guidance could reduce the costs associated with developing an EDW4R by establishing a clear roadmap and maturity path for institutions to follow.Conclusions: CTSA hubs described varying approaches to EDW4R operations that may assist other institutions in better serving investigators with electronic patient data. [ABSTRACT FROM AUTHOR]- Published
- 2020
- Full Text
- View/download PDF
34. Underserved populations with missing race ethnicity data differ significantly from those with structured race/ethnicity documentation.
- Author
-
Sholle, Evan T, Pinheiro, Laura C, Adekkanattu, Prakash, Davila, Marcos A, Johnson, Stephen B, Pathak, Jyotishman, Sinha, Sanjai, Li, Cassidie, Lubansky, Stasi A, Safford, Monika M, and Campion, Thomas R
- Abstract
Objective: We aimed to address deficiencies in structured electronic health record (EHR) data for race and ethnicity by identifying black and Hispanic patients from unstructured clinical notes and assessing differences between patients with or without structured race/ethnicity data.Materials and Methods: Using EHR notes for 16 665 patients with encounters at a primary care practice, we developed rule-based natural language processing (NLP) algorithms to classify patients as black/Hispanic. We evaluated performance of the method against an annotated gold standard, compared race and ethnicity between NLP-derived and structured EHR data, and compared characteristics of patients identified as black or Hispanic using only NLP vs patients identified as such only in structured EHR data.Results: For the sample of 16 665 patients, NLP identified 948 additional patients as black, a 26%increase, and 665 additional patients as Hispanic, a 20% increase. Compared with the patients identified as black or Hispanic in structured EHR data, patients identified as black or Hispanic via NLP only were older, more likely to be male, less likely to have commercial insurance, and more likely to have higher comorbidity.Discussion: Structured EHR data for race and ethnicity are subject to data quality issues. Supplementing structured EHR race data with NLP-derived race and ethnicity may allow researchers to better assess the demographic makeup of populations and draw more accurate conclusions about intergroup differences in health outcomes.Conclusions: Black or Hispanic patients who are not documented as such in structured EHR race/ethnicity fields differ significantly from those who are. Relatively simple NLP can help address this limitation. [ABSTRACT FROM AUTHOR]- Published
- 2019
- Full Text
- View/download PDF
35. Relationship between left atrial volume and ischemic stroke subtype.
- Author
-
Kamel, Hooman, Okin, Peter M., Merkler, Alexander E., Navi, Babak B., Campion, Thomas R., Devereux, Richard B., Díaz, Iván, Weinsaft, Jonathan W., and Kim, Jiwon
- Subjects
STROKE ,ATRIAL fibrillation ,ANALYSIS of variance ,IMAGING systems - Abstract
Objective: Atrial cardiopathy without atrial fibrillation (AF) may be a potential cardiac source of embolic strokes of undetermined source (ESUS). Atrial volume is a feature of atrial cardiopathy, but the relationship between atrial volume and ESUS remains unclear. Methods: We compared left atrial volume among ischemic stroke subtypes in the Cornell Acute Stroke Academic Registry (CAESAR), which includes all patients with acute ischemic stroke at our hospital since 2011. Stroke subtype was determined by neurologists per the TOAST classification and consensus ESUS definition. Left atrial volume index (LAVI) was obtained directly from our echocardiography image system (Xcelera, Philips Healthcare). We used t‐tests and analysis of variance for unadjusted comparisons and targeted minimum loss‐based estimation for comparisons adjusted for demographics and comorbidities. Results: Among 2116 patients in CAESAR from 2011 to 2016, 1293 had LAVI measurements. LAVI varied across subtypes (P < 0.001) from 48.8 (±30.0) mL/m2 in cardioembolic strokes to 30.3 (±10.5) mL/m2 in small‐vessel strokes. LAVI was larger in ESUS (33.3 ± 13.6 mL/m2) than in small‐ or large‐vessel stroke (30.9 ± 10.7 mL/m2) (P = 0.01). The association between LAVI and ESUS persisted after the adjustment for demographics and comorbidities: a 10 mL/m2 increase in LAVI was associated with a 4.4% increase in ESUS probability (95% CI, 2.3%–6.4%). Results were similar after excluding patients with AF during post‐discharge heart‐rhythm monitoring. Interpretation: We found larger left atria among patients with ESUS versus non‐cardioembolic stroke. There was significant overlap in left atrial size between ESUS and non‐cardioembolic stroke, highlighting that many ESUS cases are not cardioembolic. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
36. Adoption of Clinical Data Exchange in Community Settings: A Comparison of Two Approaches
- Author
-
Campion, Thomas R., Vest, Joshua R., Kern, Lisa M., and Kaushal, Rainu
- Subjects
Cross-Sectional Studies ,Health Information Exchange ,New York ,American Recovery and Reinvestment Act ,Electronic Health Records ,Humans ,Hospitals, Community ,Articles ,Community Health Centers ,Long-Term Care ,United States - Abstract
Adoption of electronic clinical data exchange (CDE) across disparate healthcare organizations remains low in community settings despite demonstrated benefits. To expand CDE in communities, New York State funded sixteen community-based organizations to implement point-to-point directed exchange (n=8) and multi-site query-based health information exchange (HIE) (n=8). We conducted a cross-sectional study to compare adoption of directed exchange versus query-based HIE. From 2008 to 2011, 66% (n=1,747) of providers targeted for directed exchange and 21% (n=5,427) of providers targeted for query-based HIE adopted CDE. Funding per provider adoptee was almost two times greater for directed exchange (median (interquartile range): $25,535 ($17,391-$42,240)) than query-based HIE ($14,649 ($9,897-$28,078)), although the difference was not statistically significant. Because its infrastructure can cover larger populations using similar levels of public funding, query-based HIE may scale more broadly than directed exchange. To our knowledge, this is among the first studies to compare directed exchange versus query-based HIE.
- Published
- 2014
37. Using a Health Information Exchange System for Imaging Information: Patterns and Predictors
- Author
-
Vest, Joshua R, Grinspan, Zachary M, Kern, Lisa M, Campion, Thomas R, and Kaushal, Rainu
- Subjects
Diagnostic Imaging ,Health Information Systems ,Radiology Information Systems ,Health Information Management ,New York ,Humans ,Articles ,Medical Record Linkage ,Regional Medical Programs ,Medical Informatics - Abstract
Health information exchange (HIE) systems may address the challenges that prevent easy access to patients' existing radiological information at the point of care. However, little is known about the factors associated with usage of HIE for radiology reports, nor about how reports are shared with an exchange network. We analyzed the system log files from a regional health information organization in upstate New York matched with insurance claims files using network analysis and regression modeling. The exchange network was dominated by a few key information sources. Outpatient users overall accessed 17 times more radiology reports than inpatient and ED users combined. Additionally, as the number of exchange partners increased per organization, the average number of reports exchanged by that organization also increased. Radiology reports were most likely to be accessed by physicians and other clinical users. These findings have implications for those operating and fostering exchange activity.
- Published
- 2013
38. A case study evaluating the portability of an executable computable phenotype algorithm across multiple institutions and electronic health record environments.
- Author
-
Pacheco, Jennifer A, Rasmussen, Luke V, Kiefer, Richard C, Campion, Thomas R, Speltz, Peter, Carroll, Robert J, Stallings, Sarah C, Mo, Huan, Ahuja, Monika, Jiang, Guoqian, LaRose, Eric R, Peissig, Peggy L, Shang, Ning, Benoit, Barbara, Gainer, Vivian S, Borthwick, Kenneth, Jackson, Kathryn L, Sharma, Ambrish, Wu, Andy Yizhou, and Kho, Abel N
- Abstract
Electronic health record (EHR) algorithms for defining patient cohorts are commonly shared as free-text descriptions that require human intervention both to interpret and implement. We developed the Phenotype Execution and Modeling Architecture (PhEMA, http://projectphema.org) to author and execute standardized computable phenotype algorithms. With PhEMA, we converted an algorithm for benign prostatic hyperplasia, developed for the electronic Medical Records and Genomics network (eMERGE), into a standards-based computable format. Eight sites (7 within eMERGE) received the computable algorithm, and 6 successfully executed it against local data warehouses and/or i2b2 instances. Blinded random chart review of cases selected by the computable algorithm shows PPV ≥90%, and 3 out of 5 sites had >90% overlap of selected cases when comparing the computable algorithm to their original eMERGE implementation. This case study demonstrates potential use of PhEMA computable representations to automate phenotyping across different EHR systems, but also highlights some ongoing challenges. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
39. Public and private sector roles in health information technology policy: Insights from the implementation and operation of exchange efforts in the United States.
- Author
-
Vest, Joshua R., Campion, Thomas R., Kern, Lisa M., and Kaushal, Rainu
- Abstract
Abstract: Objectives: In the US, the federal and state governments are supporting interoperable health information technology (HIT) and health information exchange (HIE) through policy interventions and financial investments. However, private healthcare organizations and partnerships have also been active in establishing exchange activities, promoting interoperability, and developing technologies. This combination of influence from different actors has resulted in a rapidly changing healthcare environment. In this context, we sought insights into the optimal roles for the public and private sectors in HIT/HIE policy development and implementation. Methods: We leveraged the concurrency of federal and New York State initiatives to spur HIT/HIE adoption by interviewing HIT experts (n=17). Interviewees represented federal and state government agencies, healthcare providers, and exchange organizations. A semi-structured interview guide with open-ended questions covered the domains of organization, value, privacy, security, and evaluation. We analyzed transcripts using a general inductive and comparative approach. Results: Interviewees assigned roles for standard setting and funding to the federal government and suggested states were better positioned to offer implementation support. Interviewees forwarded a public–private partnership model as a potential solution to the limitations facing the private and public sectors. Conclusions: HIT/HIE policy is a complex issue involving standards, privacy, funding and implementation. When New York State began funding HIT, significant federal intervention did not exist. Since the launch of New York State’s program and the subsequent federal Meaningful Use criteria, interviewees expressed distinct but complementary roles for both state and federal governments and saw an avenue to include the private sector through public–private partnerships. [Copyright &y& Elsevier]
- Published
- 2014
- Full Text
- View/download PDF
40. Clinical Characteristics of Covid-19 in New York City.
- Author
-
Goyal, Parag, Choi, Justin J., Safford, Monika M., Pinheiro, Laura C, Schenck, Edward J, Chen, Ruijun, Jabri, Assem, Satlin, Michael J, Campion, Thomas R Jr, Nahid, Musarrat, Ringel, Joanna B, Hoffman, Katherine L, Alshak, Mark N, Li, Han A, Wehmeyer, Graham T, Rajan, Mangala, Reshetnyak, Evgeniya, Hupert, Nathaniel, Horn, Evelyn M, and Martinez, Fernando J
- Subjects
- *
COVID-19 , *OBESITY complications , *CORONAVIRUS disease treatment , *VIRAL pneumonia , *RESEARCH , *RESEARCH methodology , *RETROSPECTIVE studies , *EVALUATION research , *MEDICAL cooperation , *ARTIFICIAL respiration , *COMPARATIVE studies , *EPIDEMICS , *RESEARCH funding - Abstract
The article offers information on the public health challenges faced by the Covid-19 pandemic, along with clinical characteristics of Covid-19 pandemic. It discusses how hospitals adopted an early-intubation strategy with limited use of high flow nasal cannulae during the pandemic. It mentions the high demand for invasive mechanical ventilation in the hospitals amid the pandemic.
- Published
- 2020
- Full Text
- View/download PDF
41. Increasing Blood Glucose Variability Heralds Hypoglycemia in the Critically Ill 1
- Author
-
Kauffmann, Rondi M., Hayes, Rachel M., Buske, Brad D., Norris, Patrick R., Campion, Thomas R., Dortch, Marcus, Jenkins, Judith M., Collier, Bryan R., and May, Addison K.
- Subjects
- *
HYPOGLYCEMIA , *BLOOD sugar , *CRITICALLY ill , *MORTALITY , *INTENSIVE care units , *VASOCONSTRICTORS , *DIABETES , *HEALTH outcome assessment , *DISEASE risk factors - Abstract
Background: Control of hyperglycemia improves outcomes, but increases the risk of hypoglycemia. Recent evidence suggests that blood glucose variability (BGV) is more closely associated with mortality than either isolated or mean BG. We hypothesized that differences in BGV over time are associated with hypoglycemia and can be utilized to estimate risk of hypoglycemia (<50 mg/dL). Materials and Methods: Patients treated with intravenous insulin in the Surgical Intensive Care Unit of a tertiary care center formed the retrospective cohort. Exclusion criteria included death within 24 h of admission. We describe BGV in patients over time and its temporal relationship to hypoglycemic events. The risk of hypoglycemia for each BG measurement was estimated in a multivariable regression model. Predictors were measures of BGV, infusions of dextrose and vasopressors, patient demographics, illness severity, and BG measurements. Results: A total of 66,592 BG measurements were collected on 1392 patients. Hypoglycemia occurred in 154 patients (11.1%). Patient BGV fluctuated over time, and increased in the 24 h preceding a hypoglycemic event. In crude and adjusted analyses, higher BGV was positively associated with a hypoglycemia (OR 1.41, P < 0.001). Previous hypoglycemic events and time since previous BG measurement were also positively associated with hypoglycemic events. Severity of illness, vasopressor use, and diabetes were not independently associated with hypoglycemia. Conclusions: BGV increases in the 24 h preceding hypoglycemia, and patients are at increased risk during periods of elevated BG variability. Prospective measurement of variability may identify periods of increased risk for hypoglycemia, and provide an opportunity to mitigate this risk. [Copyright &y& Elsevier]
- Published
- 2011
- Full Text
- View/download PDF
42. Closing the gap between open source and commercial large language models for medical evidence summarization.
- Author
-
Zhang G, Jin Q, Zhou Y, Wang S, Idnay B, Luo Y, Park E, Nestor JG, Spotnitz ME, Soroush A, Campion TR Jr, Lu Z, Weng C, and Peng Y
- Abstract
Large language models (LLMs) hold great promise in summarizing medical evidence. Most recent studies focus on the application of proprietary LLMs. Using proprietary LLMs introduces multiple risk factors, including a lack of transparency and vendor dependency. While open-source LLMs allow better transparency and customization, their performance falls short compared to the proprietary ones. In this study, we investigated to what extent fine-tuning open-source LLMs can further improve their performance. Utilizing a benchmark dataset, MedReview, consisting of 8161 pairs of systematic reviews and summaries, we fine-tuned three broadly-used, open-sourced LLMs, namely PRIMERA, LongT5, and Llama-2. Overall, the performance of open-source models was all improved after fine-tuning. The performance of fine-tuned LongT5 is close to GPT-3.5 with zero-shot settings. Furthermore, smaller fine-tuned models sometimes even demonstrated superior performance compared to larger zero-shot models. The above trends of improvement were manifested in both a human evaluation and a larger-scale GPT4-simulated evaluation., (© 2024. The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF
43. Comparative Merits of Available Mortality Data Sources for Clinical Research.
- Author
-
Sholle ET, Davila MA, Kostka K, Abedian S, Cusick M, Krichevsky S, Pathak J, and Campion TR
- Subjects
- Humans, Academic Medical Centers, Information Sources
- Abstract
Obtaining reliable data on patient mortality is a critical challenge facing observational researchers seeking to conduct studies using real-world data. As these analyses are conducted more broadly using newly-available sources of real-world evidence, missing data can serve as a rate-limiting factor. We conducted a comparison of mortality data sources from different stakeholder perspectives - academic medical center (AMC) informatics service providers, AMC research coordinators, industry analytics professionals, and academics - to understand the strengths and limitations of differing mortality data sources: locally generated data from sites conducting research, data provided by governmental sources, and commercially available data sets. Researchers seeking to conduct observational studies using extant data should consider these factors in sourcing outcomes data for their populations of interest., (©2023 AMIA - All rights reserved.)
- Published
- 2024
44. Linking Patient Encounters across Primary and Ancillary Electronic Health Record Systems: A Comparison of Two Approaches.
- Author
-
Davila MA 3rd, Sholle ET, Fuld X, Israel ML, Cole CL, and Campion TR Jr
- Abstract
Background: To achieve scientific goals, researchers often require integration of data from a primary electronic health record (EHR) system and one or more ancillary EHR systems used during the same patient care encounter. Although studies have demonstrated approaches for linking patient identity records across different EHR systems, little is known about linking patient encounter records across primary and ancillary EHR systems., Objectives: We compared a patients-first approach versus an encounters-first approach for linking patient encounter records across multiple EHR systems., Methods: We conducted a retrospective observational study of 348,904 patients with 533,283 encounters from 2010 to 2020 across our institution's primary EHR system and an ancillary EHR system used in perioperative settings. For the patients-first approach and the encounters-first approach, we measured the number of patient and encounter links created as well as runtime., Results: While the patients-first approach linked 43% of patients and 49% of encounters, the encounters-first approach linked 98% of patients and 100% of encounters. The encounters-first approach was 20 times faster than the patients-first approach for linking patients and 33% slower for linking encounters., Conclusion: Findings suggest that common patient and encounter identifiers shared among EHR systems via automated interfaces may be clinically useful but not "research-ready" and thus require an encounters-first linkage approach to enable secondary use for scientific purposes. Based on our search, this study is among the first to demonstrate approaches for linking patient encounters across multiple EHR systems. Enterprise data warehouse for research efforts elsewhere may benefit from an encounters-first approach., Competing Interests: Conflict of Interest None declared.
- Published
- 2024
- Full Text
- View/download PDF
45. A RE-AIM Evaluation of a Visualization-Based Electronic Patient-Reported Outcome System.
- Author
-
Turchioe MR, Mangal S, Goyal P, Axsom K, Myers A, Liu LG, Lee J, Campion TR Jr, and Creber RM
- Subjects
- Humans, Female, Aged, Male, Delivery of Health Care, Surveys and Questionnaires, Electronics, Software, Patient Reported Outcome Measures
- Abstract
Objectives: Health care systems are primarily collecting patient-reported outcomes (PROs) for research and clinical care using proprietary, institution- and disease-specific tools for remote assessment. The purpose of this study was to conduct a Reach, Effectiveness, Adoption, Implementation, and Maintenance (RE-AIM) evaluation of a scalable electronic PRO (ePRO) reporting and visualization system in a single-arm study., Methods: The "mi.symptoms" ePRO system was designed using gerontechnological design principles to ensure high usability among older adults. The system enables longitudinal reporting of disease-agnostic ePROs and includes patient-facing PRO visualizations. We conducted an evaluation of the implementation of the system guided by the RE-AIM framework. Quantitative data were analyzed using basic descriptive statistics, and qualitative data were analyzed using directed content analysis., Results: Reach-the total reach of the study was 70 participants (median age: 69, 31% female, 17% Black or African American, 27% reported not having enough financial resources). Effectiveness-half (51%) of participants completed the 2-week follow-up survey and 36% completed all follow-up surveys. Adoption-the desire for increased self-knowledge, the value of tracking symptoms, and altruism motivated participants to adopt the tool. Implementation-the predisposing factor was access to, and comfort with, computers. Three enabling factors were incorporation into routines, multimodal nudges, and ease of use. Maintenance-reinforcing factors were perceived usefulness of viewing symptom reports with the tool and understanding the value of sustained symptom tracking in general., Conclusion: Challenges in ePRO reporting, particularly sustained patient engagement, remain. Nonetheless, freely available, scalable, disease-agnostic systems may pave the road toward inclusion of a more diverse range of health systems and patients in ePRO collection and use., Competing Interests: M.R.T. is a consultant for Boston Scientific Corp. and has equity in Iris OB Health Inc (New York). The remaining authors have no disclosures., (Thieme. All rights reserved.)
- Published
- 2023
- Full Text
- View/download PDF
46. Maturity in enterprise data warehouses for research operations: Analysis of a pilot study.
- Author
-
Knosp BM, Dorr DA, and Campion TR
- Abstract
Enterprise data warehouses for research (EDW4R) is a critical component of National Institutes of Health Clinical and Translational Science Award (CTSA) hubs. EDW4R operations have unique needs that require specialized skills and collaborations across multiple domains which limit the ability to apply existing models of information technology (IT) performance. Because of this uniqueness, we developed a new EDW4R maturity model based on prior qualitative study of operational practices for supporting EDW4Rs at CTSA hubs. In a pilot study, respondents from fifteen CTSA hubs completed the novel EDW4R maturity index survey by rating 33 maturity statements across 6 categories using a 5-point Likert scale. Of the six categories, respondents rated workforce as most mature (4.17 [3.67-4.42]) and relationship with enterprise IT as the least mature (3.00 [2.80-3.80]). Our pilot of a novel maturity index shows a baseline quantitative measure of EDW4R functions across fifteen CTSA hubs. The maturity index may be useful to faculty and staff currently leading an EDW4R by creating opportunities to explore the index in local context and comparison to other institutions., Competing Interests: The authors have no conflicts of interest relevant to the topic of this manuscript., (© The Author(s) 2023.)
- Published
- 2023
- Full Text
- View/download PDF
47. A Day-to-Day Approach for Automating the Hospital Course Section of the Discharge Summary.
- Author
-
Hartman V and Campion TR
- Abstract
Optimal solutions for abstractive summarization of electronic health record content have yet to be discovered. Although studies have applied state-of-the-art transformers in the clinical domain to radiology reports and information extraction, little is known of transformers' performance with the hospital course section of the discharge summary. This paper compares two summarization approaches for automating the hospital course section within the discharge summary: (1) a truncation approach that uses all clinical notes and (2) a day-to-day approach that segments the notes per clinical day. We pair both approaches with different transformer encoder-decoder based-models - BART, BERT2GPT2, ClinicalBERT2GPT2, and ClinicalBERT2ClinicalBERT and evaluate the transformers that work best for each approach using ROUGE metrics. The results demonstrate that the day-to-day approach can overcome the limitations of longform document summarization for the patient clinical record., (©2022 AMIA - All rights reserved.)
- Published
- 2022
48. Synergies between centralized and federated approaches to data quality: a report from the national COVID cohort collaborative.
- Author
-
Pfaff ER, Girvin AT, Gabriel DL, Kostka K, Morris M, Palchuk MB, Lehmann HP, Amor B, Bissell M, Bradwell KR, Gold S, Hong SS, Loomba J, Manna A, McMurry JA, Niehaus E, Qureshi N, Walden A, Zhang XT, Zhu RL, Moffitt RA, Haendel MA, Chute CG, Adams WG, Al-Shukri S, Anzalone A, Baghal A, Bennett TD, Bernstam EV, Bernstam EV, Bissell MM, Bush B, Campion TR, Castro V, Chang J, Chaudhari DD, Chen W, Chu S, Cimino JJ, Crandall KA, Crooks M, Davies SJD, DiPalazzo J, Dorr D, Eckrich D, Eltinge SE, Fort DG, Golovko G, Gupta S, Haendel MA, Hajagos JG, Hanauer DA, Harnett BM, Horswell R, Huang N, Johnson SG, Kahn M, Khanipov K, Kieler C, Luzuriaga KR, Maidlow S, Martinez A, Mathew J, McClay JC, McMahan G, Melancon B, Meystre S, Miele L, Morizono H, Pablo R, Patel L, Phuong J, Popham DJ, Pulgarin C, Santos C, Sarkar IN, Sazo N, Setoguchi S, Soby S, Surampalli S, Suver C, Vangala UMR, Visweswaran S, Oehsen JV, Walters KM, Wiley L, Williams DA, and Zai A
- Subjects
- Cohort Studies, Data Accuracy, Health Insurance Portability and Accountability Act, Humans, United States, COVID-19
- Abstract
Objective: In response to COVID-19, the informatics community united to aggregate as much clinical data as possible to characterize this new disease and reduce its impact through collaborative analytics. The National COVID Cohort Collaborative (N3C) is now the largest publicly available HIPAA limited dataset in US history with over 6.4 million patients and is a testament to a partnership of over 100 organizations., Materials and Methods: We developed a pipeline for ingesting, harmonizing, and centralizing data from 56 contributing data partners using 4 federated Common Data Models. N3C data quality (DQ) review involves both automated and manual procedures. In the process, several DQ heuristics were discovered in our centralized context, both within the pipeline and during downstream project-based analysis. Feedback to the sites led to many local and centralized DQ improvements., Results: Beyond well-recognized DQ findings, we discovered 15 heuristics relating to source Common Data Model conformance, demographics, COVID tests, conditions, encounters, measurements, observations, coding completeness, and fitness for use. Of 56 sites, 37 sites (66%) demonstrated issues through these heuristics. These 37 sites demonstrated improvement after receiving feedback., Discussion: We encountered site-to-site differences in DQ which would have been challenging to discover using federated checks alone. We have demonstrated that centralized DQ benchmarking reveals unique opportunities for DQ improvement that will support improved research analytics locally and in aggregate., Conclusion: By combining rapid, continual assessment of DQ with a large volume of multisite data, it is possible to support more nuanced scientific questions with the scale and rigor that they require., (© The Author(s) 2021. Published by Oxford University Press on behalf of the American Medical Informatics Association.)
- Published
- 2022
- Full Text
- View/download PDF
49. Assessment of structured data elements for social risk factors.
- Author
-
Vest JR, Adler-Milstein J, Gottlieb LM, Bian J, Campion TR Jr, Cohen GR, Donnelly N, Harper J, Huerta TR, Kansky JP, Kharrazi H, Khurshid A, Kooreman HE, McDonnell C, Overhage JM, Pantell MS, Parisi W, Shenkman EA, Tierney WM, Wiehe S, and Harle CA
- Subjects
- Delphi Technique, Electronic Health Records, Humans, Risk Factors, Health Information Exchange
- Abstract
Objectives: Computable social risk factor phenotypes derived from routinely collected structured electronic health record (EHR) or health information exchange (HIE) data may represent a feasible and robust approach to measuring social factors. This study convened an expert panel to identify and assess the quality of individual EHR and HIE structured data elements that could be used as components in future computable social risk factor phenotypes., Study Design: Technical expert panel., Methods: A 2-round Delphi technique included 17 experts with an in-depth knowledge of available EHR and/or HIE data. The first-round identification sessions followed a nominal group approach to generate candidate data elements that may relate to socioeconomics, cultural context, social relationships, and community context. In the second-round survey, panelists rated each data element according to overall data quality and likelihood of systematic differences in quality across populations (ie, bias)., Results: Panelists identified a total of 89 structured data elements. About half of the data elements (n = 45) were related to socioeconomic characteristics. The panelists identified a diverse set of data elements. Elements used in reimbursement-related processes were generally rated as higher quality. Panelists noted that several data elements may be subject to implicit bias or reflect biased systems of care, which may limit their utility in measuring social factors., Conclusions: Routinely collected structured data within EHR and HIE systems may reflect patient social risk factors. Identifying and assessing available data elements serves as a foundational step toward developing future computable social factor phenotypes.
- Published
- 2022
- Full Text
- View/download PDF
50. Identifying organ dysfunction trajectory-based subphenotypes in critically ill patients with COVID-19.
- Author
-
Su C, Xu Z, Hoffman K, Goyal P, Safford MM, Lee J, Alvarez-Mulett S, Gomez-Escobar L, Price DR, Harrington JS, Torres LK, Martinez FJ, Campion TR Jr, Wang F, and Schenck EJ
- Subjects
- Aged, COVID-19 complications, COVID-19 physiopathology, Critical Illness, Female, Humans, Male, Middle Aged, Multiple Organ Failure etiology, Multiple Organ Failure physiopathology, Organ Dysfunction Scores, Prognosis, SARS-CoV-2 isolation & purification, Severity of Illness Index, COVID-19 diagnosis, Multiple Organ Failure diagnosis
- Abstract
COVID-19-associated respiratory failure offers the unprecedented opportunity to evaluate the differential host response to a uniform pathogenic insult. Understanding whether there are distinct subphenotypes of severe COVID-19 may offer insight into its pathophysiology. Sequential Organ Failure Assessment (SOFA) score is an objective and comprehensive measurement that measures dysfunction severity of six organ systems, i.e., cardiovascular, central nervous system, coagulation, liver, renal, and respiration. Our aim was to identify and characterize distinct subphenotypes of COVID-19 critical illness defined by the post-intubation trajectory of SOFA score. Intubated COVID-19 patients at two hospitals in New York city were leveraged as development and validation cohorts. Patients were grouped into mild, intermediate, and severe strata by their baseline post-intubation SOFA. Hierarchical agglomerative clustering was performed within each stratum to detect subphenotypes based on similarities amongst SOFA score trajectories evaluated by Dynamic Time Warping. Distinct worsening and recovering subphenotypes were identified within each stratum, which had distinct 7-day post-intubation SOFA progression trends. Patients in the worsening suphenotypes had a higher mortality than those in the recovering subphenotypes within each stratum (mild stratum, 29.7% vs. 10.3%, p = 0.033; intermediate stratum, 29.3% vs. 8.0%, p = 0.002; severe stratum, 53.7% vs. 22.2%, p < 0.001). Pathophysiologic biomarkers associated with progression were distinct at each stratum, including findings suggestive of inflammation in low baseline severity of illness versus hemophagocytic lymphohistiocytosis in higher baseline severity of illness. The findings suggest that there are clear worsening and recovering subphenotypes of COVID-19 respiratory failure after intubation, which are more predictive of outcomes than baseline severity of illness. Distinct progression biomarkers at differential baseline severity of illness suggests a heterogeneous pathobiology in the progression of COVID-19 respiratory failure., (© 2021. The Author(s).)
- Published
- 2021
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.