422 results on '"Celi LA"'
Search Results
2. Varying association of laboratory values with reference ranges and outcomes in critically ill patients: an analysis of data from five databases in four countries across Asia, Europe and North America.
- Author
-
Xu, H, Agha-Mir-Salim, L, O'Brien, Z, Huang, DC, Li, P, Gómez, J, Liu, X, Liu, T, Yeung, W, Thoral, P, Elbers, P, Zhang, Z, Saera, MB, Celi, LA, Xu, H, Agha-Mir-Salim, L, O'Brien, Z, Huang, DC, Li, P, Gómez, J, Liu, X, Liu, T, Yeung, W, Thoral, P, Elbers, P, Zhang, Z, Saera, MB, and Celi, LA
- Abstract
BACKGROUND: Despite wide usage across all areas of medicine, it is uncertain how useful standard reference ranges of laboratory values are for critically ill patients. OBJECTIVES: The aim of this study is to assess the distributions of standard laboratory measurements in more than 330 selected intensive care units (ICUs) across the USA, Amsterdam, Beijing and Tarragona; compare differences and similarities across different geographical locations and evaluate how they may be associated with differences in length of stay (LOS) and mortality in the ICU. METHODS: A multi-centre, retrospective, cross-sectional study of data from five databases for adult patients first admitted to an ICU between 2001 and 2019 was conducted. The included databases contained patient-level data regarding demographics, interventions, clinical outcomes and laboratory results. Kernel density estimation functions were applied to the distributions of laboratory tests, and the overlapping coefficient and Cohen standardised mean difference were used to quantify differences in these distributions. RESULTS: The 259 382 patients studied across five databases in four countries showed a high degree of heterogeneity with regard to demographics, case mix, interventions and outcomes. A high level of divergence in the studied laboratory results (creatinine, haemoglobin, lactate, sodium) from the locally used reference ranges was observed, even when stratified by outcome. CONCLUSION: Standardised reference ranges have limited relevance to ICU patients across a range of geographies. The development of context-specific reference ranges, especially as it relates to clinical outcomes like LOS and mortality, may be more useful to clinicians.
- Published
- 2021
3. Hyperdynamic ejection fraction in the critically ill patient
- Author
-
Paonessa, JR, Brennan, TP, and Celi, LA
- Published
- 2014
- Full Text
- View/download PDF
4. Reducing ICU blood draws with artificial intelligence
- Author
-
Cismondi, FC, Fialho, AS, Vieira, SM, Celi, LA, Reti, SR, Sousa, JM, and Finkelstein, SN
- Published
- 2012
- Full Text
- View/download PDF
5. Long-term survival for ICU patients with acute kidney injury
- Author
-
Scott, D, Cismondi, F, Lee, J, Mandelbaum, T, Celi, LA, Mark, RG, and Talmor, D
- Published
- 2012
- Full Text
- View/download PDF
6. What matters during a hypotensive episode: fluids, vasopressors, or both?
- Author
-
Lee, J, Kothari, R, Ladapo, JA, Scott, DJ, and Celi, LA
- Published
- 2012
- Full Text
- View/download PDF
7. Técnica, tecnologia e inovação no âmbito da educação corporativa: ressignificação de conceitos
- Author
-
Celi Langhi, Denilson de Sousa Cordeiro, and Edna Mataruco Duarte
- Subjects
técnica ,tecnologia ,inovação ,Industrial engineering. Management engineering ,T55.4-60.8 ,Management. Industrial management ,HD28-70 - Abstract
Educação, conhecimento, poder, tecnologia e inovação são conceitos cujos vínculos estão presentes em todas as relações humanas, em todos os momentos históricos e em todos os lugares. Um dos contextos nos quais a conexão entre esses conceitos demonstra grande relevância é o ambiente corporativo. Desta forma, o objetivo deste trabalho é promover reflexões sobre a possibilidade de ressignificação dos conceitos de técnica, tecnologia e inovação no âmbito da Educação Corporativa. Como método de pesquisa, foi empregada a revisão de literatura. Os resultados da pesquisa demonstram que esses conceitos podem ser empregados na gestão de pessoas e na gestão de conhecimento. Conclui-se que as reflexões a partir dessa temática podem alavancar os resultados das estratégias de ensino e aprendizagem voltadas para o desenvolvimento de pessoas e de organizações.
- Published
- 2023
- Full Text
- View/download PDF
8. Understanding the artificial intelligence clinician and optimal treatment strategies for sepsis in intensive care
- Author
-
Komorowski, M, Celi, LA, Badawi, O, Gordon, AC, and Faisal, AA
- Subjects
cs.AI ,stat.AP - Abstract
In this document, we explore in more detail our published work (Komorowski, Celi, Badawi, Gordon, & Faisal, 2018) for the benefit of the AI in Healthcare research community. In the above paper, we developed the AI Clinician system, which demonstrated how reinforcement learning could be used to make useful recommendations towards optimal treatment decisions from intensive care data. Since publication a number of authors have reviewed our work (e.g. Abbasi, 2018; Bos, Azoulay, & Martin-Loeches, 2019; Saria, 2018). Given the difference of our framework to previous work, the fact that we are bridging two very different academic communities (intensive care and machine learning) and that our work has impact on a number of other areas with more traditional computer-based approaches (biosignal processing and control, biomedical engineering), we are providing here additional details on our recent publication.
- Published
- 2019
9. Risk prediction system for dengue transmission based on high resolution weather data
- Author
-
Celi, LA, Hettiarachchige, C, von Cavallar, S, Lynar, T, Hickson, R, Gambhir, M, Celi, LA, Hettiarachchige, C, von Cavallar, S, Lynar, T, Hickson, R, and Gambhir, M
- Abstract
BACKGROUND: Dengue is the fastest spreading vector-borne viral disease, resulting in an estimated 390 million infections annually. Precise prediction of many attributes related to dengue is still a challenge due to the complex dynamics of the disease. Important attributes to predict include: the risk of and risk factors for an infection; infection severity; and the timing and magnitude of outbreaks. In this work, we build a model for predicting the risk of dengue transmission using high-resolution weather data. The level of dengue transmission risk depends on the vector density, hence we predict risk via vector prediction. METHODS AND FINDINGS: We make use of surveillance data on Aedes aegypti larvae collected by the Taiwan Centers for Disease Control as part of the national routine entomological surveillance of dengue, and weather data simulated using the IBM's Containerized Forecasting Workflow, a high spatial- and temporal-resolution forecasting system. We propose a two stage risk prediction system for assessing dengue transmission via Aedes aegypti mosquitoes. In stage one, we perform a logistic regression to determine whether larvae are present or absent at the locations of interest using weather attributes as the explanatory variables. The results are then aggregated to an administrative division, with presence in the division determined by a threshold percentage of larvae positive locations resulting from a bootstrap approach. In stage two, larvae counts are estimated for the predicted larvae positive divisions from stage one, using a zero-inflated negative binomial model. This model identifies the larvae positive locations with 71% accuracy and predicts the larvae numbers producing a coverage probability of 98% over 95% nominal prediction intervals. This two-stage model improves the overall accuracy of identifying larvae positive locations by 29%, and the mean squared error of predicted larvae numbers by 9.6%, against a single-stage approach which uses a zer
- Published
- 2018
10. Trends in severity of illness on ICU admission and mortality among the elderly.
- Author
-
Alazawi, W, Fuchs, L, Novack, V, McLennan, S, Celi, LA, Baumfeld, Y, Park, S, Howell, MD, Talmor, DS, Alazawi, W, Fuchs, L, Novack, V, McLennan, S, Celi, LA, Baumfeld, Y, Park, S, Howell, MD, and Talmor, DS
- Abstract
BACKGROUND: There is an increase in admission rate for elderly patients to the ICU. Mortality rates are lower when more liberal ICU admission threshold are compared to more restrictive threshold. We sought to describe the temporal trends in elderly admissions and outcomes in a tertiary hospital before and after the addition of an 8-bed medical ICU. METHODS: We conducted a retrospective analysis of a comprehensive longitudinal ICU database, from a large tertiary medical center, examining trends in patients' characteristics, severity of illness, intensity of care and mortality rates over the years 2001-2008. The study population consisted of elderly patients and the primary endpoints were 28 day and one year mortality from ICU admission. RESULTS: Between the years 2001 and 2008, 7,265 elderly patients had 8,916 admissions to ICU. The rate of admission to the ICU increased by 5.6% per year. After an eight bed MICU was added, the severity of disease on ICU admission dropped significantly and crude mortality rates decreased thereafter. Adjusting for severity of disease on presentation, there was a decreased mortality at 28- days but no improvement in one- year survival rates for elderly patient admitted to the ICU over the years of observation. Hospital mortality rates have been unchanged from 2001 through 2008. CONCLUSION: In a high capacity ICU bed hospital, there was a temporal decrease in severity of disease on ICU admission, more so after the addition of additional medical ICU beds. While crude mortality rates decreased over the study period, adjusted one-year survival in ICU survivors did not change with the addition of ICU beds. These findings suggest that outcome in critically ill elderly patients may not be influenced by ICU admission. Adding additional ICU beds to deal with the increasing age of the population may therefore not be effective.
- Published
- 2014
11. Measuring the benefits of ICTs in social enterprises: an exploratory study
- Author
-
Marcelo Okano, Celi Langhi, and Rosinei Batista Ribeiro
- Subjects
social enterprises ,information and communication technologies ,indicators for social mission ,Business ,HF5001-6182 - Abstract
The study presented in this article aims to identify what are, in fact, the social missions of companies called “social companies” , and what are the benefits obtained from the use of ICTs by these companies. The research methods employed were bibliographic research and case study. The bibliographic research was elaborated from a systematic analysis and the use of a theoretical framework related to performance indicators, as proposed by Hutchinson and Molla (2009). The case study was carried out based on four social companies, with which observations were made in loco, application of interviews and analysis of the respective websites. The results indicated that the social mission of this type of company can be explained by the use of six indicators: access to markets, income generation, employment opportunities, social training, strengthening of relations with the sponsor and social outsourcing; and that these indicators require the use of ICTs to be developed. It is concluded that, if at least one of the indicators is present in these companies, ICTs can bring some kind of benefit.
- Published
- 2021
- Full Text
- View/download PDF
12. Tecnologia de apoio ao ensino e aprendizagem de programação em graduações tecnológicas profissionais: Juiz On-line
- Author
-
Carlos Vital Giordano, Lucio Nunes de Lira, Celi Langhi, and Marcelo Duduchi Feitosa
- Subjects
método ,programação ,algoritmos ,comportamento ,Education (General) ,L7-991 ,Special aspects of education ,LC8-6691 - Abstract
As disciplinas de programação de computadores se posicionam entre as com maior índice de reprovação em graduações em computação. Porém, existem tecnologias com potencial para apoiar o processo de ensino e aprendizagem, permitindo aos discentes estudar extraclasse com uma abordagem diferente da tradicional, de aulas expositivas, tendo em seguida um feedback imediato por meio de resolução de problemas e avaliação automática das soluções construídas. A tecnologia Juiz On-line possui características fundamentadas na abordagem de ensino comportamental. A presente investigação objetiva analisar e discutir características da tecnologia, conectando-a com a abordagem comportamentalista, e propor sua aplicação em cursos superiores tecnológicos de computação.
- Published
- 2022
- Full Text
- View/download PDF
13. Relações entre gestão do conhecimento, aprendizagem organizacional e educação corporativa
- Author
-
Celi Langhi and Denilson de Sousa Cordeiro
- Subjects
gestão do conhecimento ,aprendizagem organizacional ,educação corporativa ,Education - Abstract
Empresas brasileiras têm exigido novas competências de seus colaboradores. Para isso, novos modelos de gestão, aprendizagem e educação têm sido adotados, a fim de harmonizar os interesses das pessoas e suas carreiras com os objetivos estratégicos das corporações. Este artigo tem o objetivo de identificar relações entre gestão do conhecimento, aprendizagem organizacional e educação corporativa. A partir da abordagem dos conceitos em textos acadêmicos produzidos sobre os tópicos, os assuntos foram articulados de modo a trazer maior compreensão sobre o tema. Como resultado, foi possível analisar como o conhecimento é obtido, armazenado, disseminado e empregado pelas empresas, o que fomenta a aprendizagem no ambiente corporativo, viabilizando o planejamento e a implantação de programas de educação que não apenas qualificam os colaboradores, mas também os ajudam a exercer a liderança nos diversos grupos de trabalho das organizações, o que traz diversos benefícios para indivíduos, grupos, empresas, comunidades e para a sociedade.
- Published
- 2021
- Full Text
- View/download PDF
14. Artificial intelligence to reduce practice variation in the ICU
- Author
-
Celi, LA, primary, Hinske, C, additional, and Alterovitz, G, additional
- Published
- 2008
- Full Text
- View/download PDF
15. Educação corporativa: aprendizagem significativa no âmbito das empresas
- Author
-
Celi Langhi, Denilson de Sousa Cordeiro, Mariana de Lima Simões, and Caio Flávio Stettiner
- Subjects
Educação corporativa ,Aprendizagem significativa ,Estrutura cognitiva ,Conhecimentos prévios ,Education - Abstract
As empresas brasileiras têm efetuado investimentos em educação para superarem crises, para adquirirem competividade para lidar com os desafios da economia atual e para alcançarem seus objetivos estratégicos. Este artigo tem como objetivo discutir a aplicação da teoria da aprendizagem significativa para o desenvolvimento de boas práticas de ensino na educação corporativa. Foi realizada uma pesquisa qualitativa, com base na revisão descritiva da literatura, a qual revelou estruturas e tendências a respeito da aplicabilidade dos pressupostos teóricos da aprendizagem significativa nos processos de ensino e aprendizagem nas empresas. Os resultados indicam que, para que ocorra aprendizagem significativa na educação corporativa, é necessário que os processos didáticos nas empresas sejam mais inclusivos em relação a novos conhecimentos, considerando a importância dos conhecimentos prévios dos indivíduos.
- Published
- 2021
- Full Text
- View/download PDF
16. Effects of age and coronary artery disease on cerebrovascular reactivity to carbon dioxide in humans.
- Author
-
Galvin SD, Celi LA, Thomas KN, Clendon TR, Galvin IE, Bunton RW, Ainslie PN, Galvin, S D, Celi, L A, Thomas, K N, Clendon, T R, Galvin, I F, Bunton, R W, and Ainslie, P N
- Abstract
Alterations in cerebrovascular reactivity to CO2, an index of cerebrovascular function, have been associated with increased risk of stroke. We hypothesised that cerebrovascular reactivity is impaired with increasing age and in patients with symptomatic coronary artery disease (CAD). Cerebrovascular and cardiovascular reactivity to CO2 was assessed at rest and during hypercapnia (5% CO2) and hypocapnia (hyperventilation) in subjects with symptomatic CAD (n=13) and age-matched old (n=9) and young (n=20) controls without CAD. Independent of CAD, reductions in middle cerebral artery blood velocity (transcranial Doppler) and cerebral oxygenation (near-infrared spectroscopy) were correlated with increasing age (r = -0.68, r = -0.51, respectively, P < 0.01). In CAD patients, at rest and during hypercapnia, cerebral oxygenation was lower (P < 0.05 vs. young). Although middle cerebral artery blood velocity reactivity was unaltered in the hypercapnic range, middle cerebral artery blood velocity reactivity to hypocapnia was elevated in the CAD and age-matched controls (P < 0.01 vs. young), and was associated with age (r = 0.62, P < 0.01). Transient drops in arterial PCO2 occur in a range of physiological and pathophysiological situations, therefore, the elevated middle cerebral artery blood velocity reactivity to hypocapnia combined with reductions in middle cerebral artery blood velocity may be important mechanisms underlying neurological risk with aging. In CAD patients, additional reductions in cerebral oxygenation may place them at additional risk of cerebral ischaemia. [ABSTRACT FROM AUTHOR]
- Published
- 2010
- Full Text
- View/download PDF
17. A Legislação Brasileira sobre Direito Ambiental até 2010 e suas implicações para as Tecnologias da Informação e Comunicação (TIC) Sustentáveis
- Author
-
Samuel de Barros Moraes, Celi Langhi, and Marcos Crivelaro
- Subjects
TI Verde ,Legislação ambiental ,Sistemas de Informação ,Tecnologia da Informação ,Social Sciences - Abstract
Este artigo faz uma análise da legislação ambiental brasileira, visando identificar suas referências à tecnologia da informação e dentro de suas exigências legais como ela influência a adoção de formas sustentáveis de uso da tecnologia da informação pelas empresas. Para tanto se procede a uma revisão bibliográfica da legislação promulgada até 2010, localizando-se as menções diretas e indiretas a tecnologia e analisando o contexto em que cada menção foi feita. A análise realizada indica que TIC é considerada pela legislação uma ferramenta fundamental para a preservação ambiental e como causadora de baixo impacto ambiental, dado que existem poucas referências as TIC neste sentido.
- Published
- 2014
- Full Text
- View/download PDF
18. The eICU: it's not just telemedicine.
- Author
-
Celi LA, Hassan E, Marquardt C, Breslow M, Rosenfeld B, Celi, L A, Hassan, E, Marquardt, C, Breslow, M, and Rosenfeld, B
- Published
- 2001
19. Introducing the Team Card: Enhancing governance for medical Artificial Intelligence (AI) systems in the age of complexity.
- Author
-
Modise LM, Alborzi Avanaki M, Ameen S, Celi LA, Chen VXY, Cordes A, Elmore M, Fiske A, Gallifant J, Hayes M, Marcelo A, Matos J, Nakayama L, Ozoani E, Silverman BC, and Comeau DS
- Abstract
This paper introduces the Team Card (TC) as a protocol to address harmful biases in the development of clinical artificial intelligence (AI) systems by emphasizing the often-overlooked role of researchers' positionality. While harmful bias in medical AI, particularly in Clinical Decision Support (CDS) tools, is frequently attributed to issues of data quality, this limited framing neglects how researchers' worldviews-shaped by their training, backgrounds, and experiences-can influence AI design and deployment. These unexamined subjectivities can create epistemic limitations, amplifying biases and increasing the risk of inequitable applications in clinical settings. The TC emphasizes reflexivity-critical self-reflection-as an ethical strategy to identify and address biases stemming from the subjectivity of research teams. By systematically documenting team composition, positionality, and the steps taken to monitor and address unconscious bias, TCs establish a framework for assessing how diversity within teams impacts AI development. Studies across business, science, and organizational contexts demonstrate that diversity improves outcomes, including innovation, decision-making quality, and overall performance. However, epistemic diversity-diverse ways of thinking and problem-solving-must be actively cultivated through intentional, collaborative processes to mitigate bias effectively. By embedding epistemic diversity into research practices, TCs may enhance model performance, improve fairness and offer an empirical basis for evaluating how diversity influences bias mitigation efforts over time. This represents a critical step toward developing inclusive, ethical, and effective AI systems in clinical care. A publicly available prototype presenting our TC is accessible at https://www.teamcard.io/team/demo., Competing Interests: Leo Anthony Celi is the Editor-in-Chief of PLOS Digital Health., (Copyright: © 2025 Modise et al.. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.)
- Published
- 2025
- Full Text
- View/download PDF
20. A systematic review of machine learning-based prognostic models for acute pancreatitis: Towards improving methods and reporting quality.
- Author
-
Critelli B, Hassan A, Lahooti I, Noh L, Park JS, Tong K, Lahooti A, Matzko N, Adams JN, Liss L, Quion J, Restrepo D, Nikahd M, Culp S, Lacy-Hulbert A, Speake C, Buxbaum J, Bischof J, Yazici C, Evans-Phillips A, Terp S, Weissman A, Conwell D, Hart P, Ramsey M, Krishna S, Han S, Park E, Shah R, Akshintala V, Windsor JA, Mull NK, Papachristou G, Celi LA, and Lee P
- Subjects
- Humans, Prognosis, Acute Disease, Machine Learning, Pancreatitis therapy, Pancreatitis diagnosis
- Abstract
Background: An accurate prognostic tool is essential to aid clinical decision-making (e.g., patient triage) and to advance personalized medicine. However, such a prognostic tool is lacking for acute pancreatitis (AP). Increasingly machine learning (ML) techniques are being used to develop high-performing prognostic models in AP. However, methodologic and reporting quality has received little attention. High-quality reporting and study methodology are critical for model validity, reproducibility, and clinical implementation. In collaboration with content experts in ML methodology, we performed a systematic review critically appraising the quality of methodology and reporting of recently published ML AP prognostic models., Methods/findings: Using a validated search strategy, we identified ML AP studies from the databases MEDLINE and EMBASE published between January 2021 and December 2023. We also searched pre-print servers medRxiv, bioRxiv, and arXiv for pre-prints registered between January 2021 and December 2023. Eligibility criteria included all retrospective or prospective studies that developed or validated new or existing ML models in patients with AP that predicted an outcome following an episode of AP. Meta-analysis was considered if there was homogeneity in the study design and in the type of outcome predicted. For risk of bias (ROB) assessment, we used the Prediction Model Risk of Bias Assessment Tool. Quality of reporting was assessed using the Transparent Reporting of a Multivariable Prediction Model of Individual Prognosis or Diagnosis-Artificial Intelligence (TRIPOD+AI) statement that defines standards for 27 items that should be reported in publications using ML prognostic models. The search strategy identified 6,480 publications of which 30 met the eligibility criteria. Studies originated from China (22), the United States (4), and other (4). All 30 studies developed a new ML model and none sought to validate an existing ML model, producing a total of 39 new ML models. AP severity (23/39) or mortality (6/39) were the most common outcomes predicted. The mean area under the curve for all models and endpoints was 0.91 (SD 0.08). The ROB was high for at least one domain in all 39 models, particularly for the analysis domain (37/39 models). Steps were not taken to minimize over-optimistic model performance in 27/39 models. Due to heterogeneity in the study design and in how the outcomes were defined and determined, meta-analysis was not performed. Studies reported on only 15/27 items from TRIPOD+AI standards, with only 7/30 justifying sample size and 13/30 assessing data quality. Other reporting deficiencies included omissions regarding human-AI interaction (28/30), handling low-quality or incomplete data in practice (27/30), sharing analytical codes (25/30), study protocols (25/30), and reporting source data (19/30)., Conclusions: There are significant deficiencies in the methodology and reporting of recently published ML based prognostic models in AP patients. These undermine the validity, reproducibility, and implementation of these prognostic models despite their promise of superior predictive accuracy., Registration: Research Registry (reviewregistry1727)., Competing Interests: The authors have declared that no competing interests exist., (Copyright: © 2025 Critelli et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.)
- Published
- 2025
- Full Text
- View/download PDF
21. Regulation of Artificial Intelligence in Health Care and Biomedicine.
- Author
-
Jaiswal N, Samsel K, and Celi LA
- Published
- 2025
- Full Text
- View/download PDF
22. Potential source of bias in AI models: Lactate measurement in the ICU as a template.
- Author
-
Hussein NSA, Pradhan P, Haug FW, Moukheiber D, Moukheiber L, Moukheiber M, Moukheiber S, Weishaupt LL, Ellen JG, D'Couto H, Williams IC, Celi LA, Matos J, and Struja T
- Abstract
Objective: Health inequities may be driven by demographics such as sex, language proficiency, and race-ethnicity. These disparities may manifest through likelihood of testing, which in turn can bias artificial intelligence models. The goal of this study is to evaluate variation in serum lactate measurements in the Intensive Care Unit (ICU)., Methods: Utilizing MIMIC-IV (2008-2019), we identified adults fulfilling sepsis-3 criteria. Exclusion criteria were ICU stay <1-day, unknown race-ethnicity, <18 years of age, and recurrent stays. Employing targeted maximum likelihood estimation analysis, we assessed the likelihood of a lactate measurement on day 1. For patients with a measurement on day 1, we evaluated the predictors of subsequent readings., Results: We studied 15,601 patients (19.5% racial-ethnic minority, 42.4% female, and 10.0% limited English proficiency). After adjusting for confounders, Black patients had a slightly higher likelihood of receiving a lactate measurement on day 1 (odds ratio 1.19, 95% confidence interval (CI) 1.06-1.34), but not the other minority groups. Subsequent frequency was similar across race-ethnicities, but women had a lower incidence rate ratio (IRR) 0.94 (95% CI 0.90-0.98). Interestingly, patients with elective admission and private insurance also had a higher frequency of repeated serum lactate measurements (IRR 1.70, 95% CI 1.61-1.81, and 1.07, 95% CI, 1.02-1.12, respectively)., Conclusion: We found no disparities in the likelihood of a lactate measurement among patients with sepsis across demographics, except for a small increase for Black patients, and a reduced frequency for women. Variation in biomarker monitoring can be a source of data bias when modeling patient outcomes, and thus should be accounted for in every analysis., Competing Interests: Conflicts of Interest None of the authors have any conflicts of interest relevant to this work.
- Published
- 2025
- Full Text
- View/download PDF
23. The Data Artifacts Glossary: a community-based repository for bias on health datasets.
- Author
-
Gameiro RR, Woite NL, Sauer CM, Hao S, Fernandes CO, Premo AE, Teixeira AR, Resli I, Wong AI, and Celi LA
- Subjects
- Humans, Bias, Datasets as Topic, Artificial Intelligence
- Abstract
Background: The deployment of Artificial Intelligence (AI) in healthcare has the potential to transform patient care through improved diagnostics, personalized treatment plans, and more efficient resource management. However, the effectiveness and fairness of AI are critically dependent on the data it learns from. Biased datasets can lead to AI outputs that perpetuate disparities, particularly affecting social minorities and marginalized groups., Objective: This paper introduces the "Data Artifacts Glossary", a dynamic, open-source framework designed to systematically document and update potential biases in healthcare datasets. The aim is to provide a comprehensive tool that enhances the transparency and accuracy of AI applications in healthcare and contributes to understanding and addressing health inequities., Methods: Utilizing a methodology inspired by the Delphi method, a diverse team of experts conducted iterative rounds of discussions and literature reviews. The team synthesized insights to develop a comprehensive list of bias categories and designed the glossary's structure. The Data Artifacts Glossary was piloted using the MIMIC-IV dataset to validate its utility and structure., Results: The Data Artifacts Glossary adopts a collaborative approach modeled on successful open-source projects like Linux and Python. Hosted on GitHub, it utilizes robust version control and collaborative features, allowing stakeholders from diverse backgrounds to contribute. Through a rigorous peer review process managed by community members, the glossary ensures the continual refinement and accuracy of its contents. The implementation of the Data Artifacts Glossary with the MIMIC-IV dataset illustrates its utility. It categorizes biases, and facilitates their identification and understanding., Conclusion: The Data Artifacts Glossary serves as a vital resource for enhancing the integrity of AI applications in healthcare by providing a mechanism to recognize and mitigate dataset biases before they impact AI outputs. It not only aids in avoiding bias in model development but also contributes to understanding and addressing the root causes of health disparities., Competing Interests: Declarations. Ethics approval and consent to participate: Not applicable. Consent for publication: Not applicable. Competing interests: The authors declare no competing interests. Dr. L. A. Celi is funded by the National Institute of Health through R01 EB017205, DS-I Africa U54 TW012043-01 and Bridge2AI OT2OD032701, and the National Science Foundation through ITEST #2148451. Dr. C. M. Sauer is supported by the German Research Foundation funded UMEA Clinician Scientist Program, under FU356/12-2., (© 2025. The Author(s).)
- Published
- 2025
- Full Text
- View/download PDF
24. Step-by-step causal analysis of EHRs to ground decision-making.
- Author
-
Doutreligne M, Struja T, Abecassis J, Morgand C, Celi LA, and Varoquaux G
- Abstract
Causal inference enables machine learning methods to estimate treatment effects of medical interventions from electronic health records (EHRs). The prevalence of such observational data and the difficulty for randomized controlled trials (RCT) to cover all population/treatment relationships make these methods increasingly attractive for studying causal effects. However, researchers should be wary of many pitfalls. We propose and illustrate a framework for causal inference estimating the effect of albumin on mortality in sepsis using an Intensive Care database (MIMIC-IV) and comparing various sensitivity analyses to results from RCTs as gold-standard. The first step is study design, using the target trial concept and the PICOT framework: Population (patients with sepsis), Intervention (combination of crystalloids and albumin for fluid resuscitation), Control (crystalloids only), Outcome (28-day mortality), Time (intervention start within 24h of admission). We show that too large treatment-initiation times induce immortal time bias. The second step is selection of the confounding variables based on expert knowledge. Increasingly adding confounders enables to recover the RCT results from observational data. As the third step, we assess the influence of multiple models with varying assumptions, showing that a doubly robust estimator (AIPW) with random forests proved to be the most reliable estimator. Results show that these steps are all important for valid causal estimates. A valid causal model can then be used to individualize decision making: subgroup analyses showed that treatment efficacy of albumin was better for patients >60 years old, males, and patients with septic shock. Without causal thinking, machine learning is not enough for optimal clinical decision on an individual patient level. Our step-by-step analytic framework helps avoiding many pitfalls of applying machine learning to EHR data, building models that avoid shortcuts and extract the best decision-making evidence., Competing Interests: Leo Anthony Celi is part of the editor board of PLOS Digital Health., (Copyright: © 2025 Doutreligne et al. This is an open access article distributed under the terms of the CreativeCommonsAttributionLicense, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.)
- Published
- 2025
- Full Text
- View/download PDF
25. Clinical Notes as Narratives: Implications for Large Language Models in Healthcare.
- Author
-
Brender TD, Celi LA, and Cobert JM
- Abstract
Competing Interests: Declarations:. Ethics Approval:: Not applicable. Conflict of Interest:: The authors declare that they do not have a conflict of interest.
- Published
- 2025
- Full Text
- View/download PDF
26. Deep learning generalization for diabetic retinopathy staging from fundus images.
- Author
-
Men Y, Fhima J, Celi LA, Ribeiro LZ, Nakayama LF, and Behar JA
- Subjects
- Humans, Image Processing, Computer-Assisted methods, Female, Male, Middle Aged, Diabetic Retinopathy diagnostic imaging, Deep Learning, Fundus Oculi
- Abstract
Objective . Diabetic retinopathy (DR) is a serious diabetes complication that can lead to vision loss, making timely identification crucial. Existing data-driven algorithms for DR staging from digital fundus images (DFIs) often struggle with generalization due to distribution shifts between training and target domains. Approach . To address this, DRStageNet, a deep learning model, was developed using six public and independent datasets with 91 984 DFIs from diverse demographics. Five pretrained self-supervised vision transformers (ViTs) were benchmarked, with the best further trained using a multi-source domain (MSD) fine-tuning strategy. Main results . DINOv2 showed a 27.4% improvement in L-Kappa versus other pretrained ViT. MSD fine-tuning improved performance in four of five target domains. The error analysis revealing 60% of errors due to incorrect labels, 77.5% of which were correctly classified by DRStageNet. Significance . We developed DRStageNet, a DL model for DR, designed to accurately stage the condition while addressing the challenge of generalizing performance across target domains. The model and explainability heatmaps are available atwww.aimlab-technion.com/lirot-ai., (Creative Commons Attribution license.)
- Published
- 2025
- Full Text
- View/download PDF
27. Diversity in the medical research ecosystem: a descriptive scientometric analysis of over 49 000 studies and 150 000 authors published in high-impact medical journals between 2007 and 2022.
- Author
-
Charpignon ML, Matos J, Nakayama LF, Gallifant J, Alfonso PGI, Cobanaj M, Fiske AM, Gates AJ, Ho FDV, Jain U, Kashkooli M, Link N, McCoy LG, Shaffer J, and Celi LA
- Subjects
- Humans, Female, Male, Authorship, Developing Countries, Journal Impact Factor, Periodicals as Topic statistics & numerical data, Biomedical Research statistics & numerical data, Bibliometrics
- Abstract
Objectives: Health research that significantly impacts global clinical practice and policy is often published in high-impact factor (IF) medical journals. These outlets play a pivotal role in the worldwide dissemination of novel medical knowledge. However, researchers identifying as women and those affiliated with institutions in low- and middle-income countries (LMICs) have been largely under-represented in high-IF journals across multiple fields of medicine. To evaluate disparities in gender and geographical representation among authors who have published in any of five top general medical journals, we conducted scientometric analyses using a large-scale dataset extracted from the New England Journal of Medicine , Journal of the American Medical Association , The BMJ , The Lancet and Nature Medicine ., Methods: Author metadata from all articles published in the selected journals between 2007 and 2022 were collected using the DimensionsAI platform. The Genderize.io Application Programming Interface was then used to infer each author's likely gender based on their extracted first name. The World Bank country classification was used to map countries associated with researcher affiliations to the LMIC or the high-income country (HIC) category. We characterised the overall gender and country income category representation across the five medical journals. In addition, we computed article-level diversity metrics and contrasted their distributions across the journals., Results: We studied 151 536 authors across 49 764 articles published in five top medical journals, over a period spanning 15 years. On average, approximately one-third (33.1%) of the authors of a given paper were inferred to be women; this result was consistent across the journals we studied. Further, 86.6% of the teams were exclusively composed of HIC authors; in contrast, only 3.9% were exclusively composed of LMIC authors. The probability of serving as the first or last author was significantly higher if the author was inferred to be a man (18.1% vs 16.8%, p<0.01) or was affiliated with an institution in a HIC (16.9% vs 15.5%, p<0.01). Our primary finding reveals that having a diverse team promotes further diversity, within the same dimension (ie, gender or geography) and across dimensions. Notably, papers with at least one woman among the authors were more likely to also involve at least two LMIC authors (11.7% vs 10.4% in baseline, p<0.001; based on inferred gender); conversely, papers with at least one LMIC author were more likely to also involve at least two women (49.4% vs 37.6%, p<0.001; based on inferred gender)., Conclusion: We provide a scientometric framework to assess authorship diversity. Our research suggests that the inclusiveness of high-impact medical journals is limited in terms of both gender and geography. We advocate for medical journals to adopt policies and practices that promote greater diversity and collaborative research. In addition, our findings offer a first step towards understanding the composition of teams conducting medical research globally and an opportunity for individual authors to reflect on their own collaborative research practices and possibilities to cultivate more diverse partnerships in their work., Competing Interests: Competing interests: None declared., (© Author(s) (or their employer(s)) 2025. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ Group.)
- Published
- 2025
- Full Text
- View/download PDF
28. Preventing unrestricted and unmonitored AI experimentation in healthcare through transparency and accountability.
- Author
-
Comeau DS, Bitterman DS, and Celi LA
- Abstract
The integration of large language models (LLMs) into electronic health records offers potential benefits but raises significant ethical, legal, and operational concerns, including unconsented data use, lack of governance, and AI-related malpractice accountability. Sycophancy, feedback loop bias, and data reuse risk amplifying errors without proper oversight. To safeguard patients, especially the vulnerable, clinicians must advocate for patient-centered education, ethical practices, and robust oversight to prevent harm., Competing Interests: Competing interests: The authors declare no competing interests., (© 2025. The Author(s).)
- Published
- 2025
- Full Text
- View/download PDF
29. Rebooting artificial intelligence for health.
- Author
-
Mitchell WG, Wawira JG, and Celi LA
- Abstract
Competing Interests: The authors have declared that no competing interests exist.
- Published
- 2025
- Full Text
- View/download PDF
30. The TRIPOD-LLM reporting guideline for studies using large language models.
- Author
-
Gallifant J, Afshar M, Ameen S, Aphinyanaphongs Y, Chen S, Cacciamani G, Demner-Fushman D, Dligach D, Daneshjou R, Fernandes C, Hansen LH, Landman A, Lehmann L, McCoy LG, Miller T, Moreno A, Munch N, Restrepo D, Savova G, Umeton R, Gichoya JW, Collins GS, Moons KGM, Celi LA, and Bitterman DS
- Subjects
- Humans, Artificial Intelligence, Research Design standards, Delphi Technique, Prognosis, Checklist, Guidelines as Topic
- Abstract
Large language models (LLMs) are rapidly being adopted in healthcare, necessitating standardized reporting guidelines. We present transparent reporting of a multivariable model for individual prognosis or diagnosis (TRIPOD)-LLM, an extension of the TRIPOD + artificial intelligence statement, addressing the unique challenges of LLMs in biomedical applications. TRIPOD-LLM provides a comprehensive checklist of 19 main items and 50 subitems, covering key aspects from title to discussion. The guidelines introduce a modular format accommodating various LLM research designs and tasks, with 14 main items and 32 subitems applicable across all categories. Developed through an expedited Delphi process and expert consensus, TRIPOD-LLM emphasizes transparency, human oversight and task-specific performance reporting. We also introduce an interactive website ( https://tripod-llm.vercel.app/ ) facilitating easy guideline completion and PDF generation for submission. As a living document, TRIPOD-LLM will evolve with the field, aiming to enhance the quality, reproducibility and clinical applicability of LLM research in healthcare through comprehensive reporting., Competing Interests: Competing interests: D.S.B. is an associate editor at Radiation Oncology and HemOnc.org, receives research funding from the American Association for Cancer Research, and provides advisory and consulting services for MercurialAI. D.D.F. is an associate editor at the Journal of the American Medical Informatics Association, is a member of the editorial board of Scientific Data, and receives funding from the intramural research program at the US National Library of Medicine, NIH. J.W.G. is a member of the editorial board of Radiology: Artificial Intelligence, BJR Artificial Intelligence and NEJM AI. All other authors declare no competing interests., (© 2025. The Author(s), under exclusive licence to Springer Nature America, Inc.)
- Published
- 2025
- Full Text
- View/download PDF
31. Evaluation of a Non-Enzymatic Electrochemical Sensor Based on Co(OH) 2 -Functionalized Carbon Nanotubes for Glucose Detection.
- Author
-
Bolaños-Mendez D, Fernández L, Uribe R, Cunalata-Castro A, González G, Rojas I, Chico-Proano A, Debut A, Celi LA, and Espinoza-Montero P
- Subjects
- Electrodes, Limit of Detection, Nanocomposites chemistry, Reproducibility of Results, Hydroxides chemistry, Nanotubes, Carbon chemistry, Glucose analysis, Glucose chemistry, Electrochemical Techniques methods, Cobalt chemistry, Biosensing Techniques methods
- Abstract
This work reports on the assessment of a non-hydrolytic electrochemical sensor for glucose sensing that is developed using functionalized carbon nanotubes (fCNTs)/Co(OH)
2 . The morphology of the nanocomposite was investigated by scanning electron microscopy, which revealed that the CNTs interacted with Co(OH)2 . This content formed a nanocomposite that improved the electrochemical characterizations of the electrode, including the electrochemical active surface area and capacitance, thus improving sensitivity to glucose. In the electrochemical characterization by cyclic voltammetry and chronoamperometry, the increase in catalytic activity by Co(OH)2 improved the stability and reproducibility of the glucose sensor without the use of enzymes, and its concentration range was between 50 and 700 μmol L-1 . The sensor exhibited good linearity towards glucose with LOD value of 43.200 µmol L-1 , which proved that the Co(OH)2 -fCNTs composite is judicious for constructing cost effective and feasible sensor for glucose detection.- Published
- 2024
- Full Text
- View/download PDF
32. A toolbox for surfacing health equity harms and biases in large language models.
- Author
-
Pfohl SR, Cole-Lewis H, Sayres R, Neal D, Asiedu M, Dieng A, Tomasev N, Rashid QM, Azizi S, Rostamzadeh N, McCoy LG, Celi LA, Liu Y, Schaekermann M, Walton A, Parrish A, Nagpal C, Singh P, Dewitt A, Mansfield P, Prakash S, Heller K, Karthikesalingam A, Semturs C, Barral J, Corrado G, Matias Y, Smith-Loud J, Horn I, and Singhal K
- Subjects
- Humans, Language, Bias, Artificial Intelligence, Healthcare Disparities, Health Equity
- Abstract
Large language models (LLMs) hold promise to serve complex health information needs but also have the potential to introduce harm and exacerbate health disparities. Reliably evaluating equity-related model failures is a critical step toward developing systems that promote health equity. We present resources and methodologies for surfacing biases with potential to precipitate equity-related harms in long-form, LLM-generated answers to medical questions and conduct a large-scale empirical case study with the Med-PaLM 2 LLM. Our contributions include a multifactorial framework for human assessment of LLM-generated answers for biases and EquityMedQA, a collection of seven datasets enriched for adversarial queries. Both our human assessment framework and our dataset design process are grounded in an iterative participatory approach and review of Med-PaLM 2 answers. Through our empirical study, we find that our approach surfaces biases that may be missed by narrower evaluation approaches. Our experience underscores the importance of using diverse assessment methodologies and involving raters of varying backgrounds and expertise. While our approach is not sufficient to holistically assess whether the deployment of an artificial intelligence (AI) system promotes equitable health outcomes, we hope that it can be leveraged and built upon toward a shared goal of LLMs that promote accessible and equitable healthcare., Competing Interests: Competing interests: This study was funded by Google LLC and/or its subsidiary (Google). S.R.P., H.C.-L., R.S., D.N., M.A., A. Dieng, N.T., Q.M.R., S.A., N.R., Y.L., M.S., A.W., A.P., C.N., P.S., A. Dewitt, P.M., S.P., K.H., A.K., C.S., J.B., G.C., Y.M., J.S.-L., I.H. and K.S. are employees of Google and may own stock as a part of a standard compensation package. The other authors declare no competing interests., (© 2024. The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF
33. Detecting and Mitigating the Clever Hans Effect in Medical Imaging: A Scoping Review.
- Author
-
Vásquez-Venegas C, Wu C, Sundar S, Prôa R, Beloy FJ, Medina JR, McNichol M, Parvataneni K, Kurtzman N, Mirshawka F, Aguirre-Jerez M, Ebner DK, and Celi LA
- Abstract
The Clever Hans effect occurs when machine learning models rely on spurious correlations instead of clinically relevant features and poses significant challenges to the development of reliable artificial intelligence (AI) systems in medical imaging. This scoping review provides an overview of methods for identifying and addressing the Clever Hans effect in medical imaging AI algorithms. A total of 173 papers published between 2010 and 2024 were reviewed, and 37 articles were selected for detailed analysis, with classification into two categories: detection and mitigation approaches. Detection methods include model-centric, data-centric, and uncertainty and bias-based approaches, while mitigation strategies encompass data manipulation techniques, feature disentanglement and suppression, and domain knowledge-driven approaches. Despite the progress in detecting and mitigating the Clever Hans effect, the majority of current machine learning studies in medical imaging do not report or test for shortcut learning, highlighting the need for more rigorous validation and transparency in AI research. Future research should focus on creating standardized benchmarks, developing automated detection tools, and exploring the integration of detection and mitigation strategies to comprehensively address shortcut learning. Establishing community-driven best practices and leveraging interdisciplinary collaboration will be crucial for ensuring more reliable, generalizable, and equitable AI systems in healthcare., Competing Interests: Declarations. Ethics Approval: This study is a literature review and does not involve human or animal participants, nor does it include any original data collection. Therefore, ethics committee approval was not required. Consent to Participate: As this study is a literature review and does not involve the collection of new data from human or animal subjects, obtaining consent to participate was not applicable. Consent for Publication: This study is a literature review and does not involve individual data or images from participants. Therefore, obtaining consent to publish is not applicable. Conflict of Interest: The authors declare no competing interests., (© 2024. The Author(s) under exclusive licence to Society for Imaging Informatics in Medicine.)
- Published
- 2024
- Full Text
- View/download PDF
34. Analyzing how the components of the SOFA score change over time in their contribution to mortality.
- Author
-
Lam BD, Struja T, Li Y, Matos J, Chen Z, Liu X, Celi LA, Jia Y, and Raffa J
- Subjects
- Time Factors, Humans, Male, Female, Middle Aged, Aged, Intensive Care Units, Databases as Topic, Cohort Studies, Mortality, Organ Dysfunction Scores
- Abstract
Objective: Determine how each organ component of the SOFA score differs in its contribution to mortality risk and how that contribution may change over time., Methods: We performed multivariate logistic regression analysis to assess the contribution of each organ component to mortality risk on Days 1 and 7 of an intensive care unit stay. We used data from two publicly available datasets, eICU Collaborative Research Database (eICU-CRD) (208 hospitals) and Medical Information Mart for Intensive Care IV (MIMIC-IV) (1 hospital). The odds ratio of each SOFA component that contributed to mortality was calculated. Mortality was defined as death either in the intensive care unit or within 72 hours of discharge from the intensive care unit., Results: A total of 7,871 intensive care unit stays from eICU-CRD and 4,926 intensive care unit stays from MIMIC-IV were included. Liver dysfunction was most predictive of mortality on Day 1 in both cohorts (OR 1.3; 95%CI 1.2 - 1.4; OR 1.3; 95%CI 1.2 - 1.4, respectively). In the eICU-CRD cohort, central nervous system dysfunction was most predictive of mortality on Day 7 (OR 1.4; 95%CI 1.4 - 1.5). In the MIMIC-IV cohort, respiratory dysfunction (OR 1.4; 95%CI 1.3 - 1.5) and cardiovascular dysfunction (OR 1.4; 95%CI 1.3 - 1.5) were most predictive of mortality on Day 7., Conclusion: The SOFA score may be an oversimplification of how dysfunction of different organ systems contributes to mortality over time. Further research at a more granular timescale is needed to explore how the SOFA score can evolve and be ameliorated.
- Published
- 2024
- Full Text
- View/download PDF
35. Economics and Equity of Large Language Models: Health Care Perspective.
- Author
-
Nagarajan R, Kondo M, Salas F, Sezgin E, Yao Y, Klotzman V, Godambe SA, Khan N, Limon A, Stephenson G, Taraman S, Walton N, Ehwerhemuepha L, Pandit J, Pandita D, Weiss M, Golden C, Gold A, Henderson J, Shippy A, Celi LA, Hogan WR, Oermann EK, Sanger T, and Martel S
- Subjects
- Humans, Language, Delivery of Health Care
- Abstract
Large language models (LLMs) continue to exhibit noteworthy capabilities across a spectrum of areas, including emerging proficiencies across the health care continuum. Successful LLM implementation and adoption depend on digital readiness, modern infrastructure, a trained workforce, privacy, and an ethical regulatory landscape. These factors can vary significantly across health care ecosystems, dictating the choice of a particular LLM implementation pathway. This perspective discusses 3 LLM implementation pathways-training from scratch pathway (TSP), fine-tuned pathway (FTP), and out-of-the-box pathway (OBP)-as potential onboarding points for health systems while facilitating equitable adoption. The choice of a particular pathway is governed by needs as well as affordability. Therefore, the risks, benefits, and economics of these pathways across 4 major cloud service providers (Amazon, Microsoft, Google, and Oracle) are presented. While cost comparisons, such as on-demand and spot pricing across the cloud service providers for the 3 pathways, are presented for completeness, the usefulness of managed services and cloud enterprise tools is elucidated. Managed services can complement the traditional workforce and expertise, while enterprise tools, such as federated learning, can overcome sample size challenges when implementing LLMs using health care data. Of the 3 pathways, TSP is expected to be the most resource-intensive regarding infrastructure and workforce while providing maximum customization, enhanced transparency, and performance. Because TSP trains the LLM using enterprise health care data, it is expected to harness the digital signatures of the population served by the health care system with the potential to impact outcomes. The use of pretrained models in FTP is a limitation. It may impact its performance because the training data used in the pretrained model may have hidden bias and may not necessarily be health care-related. However, FTP provides a balance between customization, cost, and performance. While OBP can be rapidly deployed, it provides minimal customization and transparency without guaranteeing long-term availability. OBP may also present challenges in interfacing seamlessly with downstream applications in health care settings with variations in pricing and use over time. Lack of customization in OBP can significantly limit its ability to impact outcomes. Finally, potential applications of LLMs in health care, including conversational artificial intelligence, chatbots, summarization, and machine translation, are highlighted. While the 3 implementation pathways discussed in this perspective have the potential to facilitate equitable adoption and democratization of LLMs, transitions between them may be necessary as the needs of health systems evolve. Understanding the economics and trade-offs of these onboarding pathways can guide their strategic adoption and demonstrate value while impacting health care outcomes favorably., (©Radha Nagarajan, Midori Kondo, Franz Salas, Emre Sezgin, Yuan Yao, Vanessa Klotzman, Sandip A Godambe, Naqi Khan, Alfonso Limon, Graham Stephenson, Sharief Taraman, Nephi Walton, Louis Ehwerhemuepha, Jay Pandit, Deepti Pandita, Michael Weiss, Charles Golden, Adam Gold, John Henderson, Angela Shippy, Leo Anthony Celi, William R Hogan, Eric K Oermann, Terence Sanger, Steven Martel. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 14.11.2024.)
- Published
- 2024
- Full Text
- View/download PDF
36. Septic Shock Requiring Three Vasopressors: Patient Demographics and Outcomes.
- Author
-
Kwak GH, Madushani RWMA, Adhikari L, Yan AY, Rosenthal ES, Sebbane K, Yanes Z, Restrepo D, Wong A, Celi LA, and Kistler EA
- Subjects
- Humans, Male, Female, Aged, Retrospective Studies, Middle Aged, Intensive Care Units, Cohort Studies, Aged, 80 and over, Demography, Shock, Septic drug therapy, Shock, Septic mortality, Vasoconstrictor Agents therapeutic use, Hospital Mortality
- Abstract
Objectives: Septic shock is a common condition necessitating timely management including hemodynamic support with vasopressors. Despite the high prevalence and mortality, there is limited data characterizing patients who require three or more vasopressors. We sought to define the demographics, outcomes, and prognostic determinants associated with septic shock requiring three or more vasopressors., Design: This is a multicenter retrospective cohort of two ICU databases, Medical Information Mart for Intensive Care IV (MIMIC-IV) and electronic ICU-Clinical Research Database, which include over 400,000 patients admitted to 342 ICUs., Patients: Inclusion criteria entailed patients who were: 1) age 18 years old and older, 2) admitted to any ICU, 3) administered at least three vasopressors for at least 2 hours at any time during their ICU stay, and 4) identified to have sepsis based on the Sepsis-3 criteria., Interventions: None., Measurements and Main Results: A total of 3447 patients met inclusion criteria. The median age was 67 years, 60.5% were male, and 96.6% had full code orders at the time of the third vasopressor initiation. Septic shock requiring three or more vasopressors was associated with 57.6% in-hospital mortality. Code status changes occurred in 23.9% of patients following initiation of a third vasopressor. Elevated lactate upon ICU admission (odds ratio [95% CI], 2.79 [2.73-2.85]), increased duration of time between ICU admission and third vasopressor initiation (1.78 [1.69-1.87]), increased serum creatinine (1.61 [1.59-1.62]), and age above 60 years (1.47 [1.41-1.54]) were independently associated with an increased risk of mortality based on analysis of the MIMIC-IV database. Non-White race and Richmond Agitation-Sedation Scale scores were not associated with mortality., Conclusions: Septic shock requiring three vasopressors is associated with exceptionally high mortality. Knowledge of patients at highest risk of mortality in this population may inform management and expectations conveyed in shared decision-making., Competing Interests: The authors have disclosed that they do not have any potential conflicts of interest., (Copyright © 2024 The Authors. Published by Wolters Kluwer Health, Inc. on behalf of the Society of Critical Care Medicine.)
- Published
- 2024
- Full Text
- View/download PDF
37. Raising awareness of potential biases in medical machine learning: Experience from a Datathon.
- Author
-
Hochheiser H, Klug J, Mathie T, Pollard TJ, Raffa JD, Ballard SL, Conrad EA, Edakalavan S, Joseph A, Alnomasy N, Nutman S, Hill V, Kapoor S, Claudio EP, Kravchenko OV, Li R, Nourelahi M, Diaz J, Taylor WM, Rooney SR, Woeltje M, Celi LA, and Horvat CM
- Abstract
Objective: To challenge clinicians and informaticians to learn about potential sources of bias in medical machine learning models through investigation of data and predictions from an open-source severity of illness score., Methods: Over a two-day period (total elapsed time approximately 28 hours), we conducted a datathon that challenged interdisciplinary teams to investigate potential sources of bias in the Global Open Source Severity of Illness Score. Teams were invited to develop hypotheses, to use tools of their choosing to identify potential sources of bias, and to provide a final report., Results: Five teams participated, three of which included both informaticians and clinicians. Most (4/5) used Python for analyses, the remaining team used R. Common analysis themes included relationship of the GOSSIS-1 prediction score with demographics and care related variables; relationships between demographics and outcomes; calibration and factors related to the context of care; and the impact of missingness. Representativeness of the population, differences in calibration and model performance among groups, and differences in performance across hospital settings were identified as possible sources of bias., Discussion: Datathons are a promising approach for challenging developers and users to explore questions relating to unrecognized biases in medical machine learning algorithms.
- Published
- 2024
- Full Text
- View/download PDF
38. Artificial intelligence and global health equity.
- Author
-
Dychiao RG, Nazer L, Mlombwa D, and Celi LA
- Subjects
- Humans, Artificial Intelligence, Health Equity, Global Health
- Abstract
Competing Interests: Competing interests: The BMJ has judged that there are no disqualifying financial ties to commercial companies. The authors declare the following other interests: DM declares receiving travel expenses, meals, and accommodation from Operation Smile and St Luke’s College of Health Sciences. Further details of The BMJ policy on financial interests are here: https://www.bmj.com/sites/default/files/attachments/resources/2016/03/16-current-bmj-education-coi-form.pdf.
- Published
- 2024
- Full Text
- View/download PDF
39. Unmasking biases and navigating pitfalls in the ophthalmic artificial intelligence lifecycle: A narrative review.
- Author
-
Nakayama LF, Matos J, Quion J, Novaes F, Mitchell WG, Mwavu R, Hung CJJ, Santiago APD, Phanphruk W, Cardoso JS, and Celi LA
- Abstract
Over the past 2 decades, exponential growth in data availability, computational power, and newly available modeling techniques has led to an expansion in interest, investment, and research in Artificial Intelligence (AI) applications. Ophthalmology is one of many fields that seek to benefit from AI given the advent of telemedicine screening programs and the use of ancillary imaging. However, before AI can be widely deployed, further work must be done to avoid the pitfalls within the AI lifecycle. This review article breaks down the AI lifecycle into seven steps-data collection; defining the model task; data preprocessing and labeling; model development; model evaluation and validation; deployment; and finally, post-deployment evaluation, monitoring, and system recalibration-and delves into the risks for harm at each step and strategies for mitigating them., Competing Interests: LAC is the Editor-In-Chief of PLOS Digital Health., (Copyright: © 2024 Nakayama et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.)
- Published
- 2024
- Full Text
- View/download PDF
40. Understanding and training for the impact of large language models and artificial intelligence in healthcare practice: a narrative review.
- Author
-
McCoy LG, Ci Ng FY, Sauer CM, Yap Legaspi KE, Jain B, Gallifant J, McClurkin M, Hammond A, Goode D, Gichoya J, and Celi LA
- Subjects
- Humans, Education, Medical methods, Clinical Competence, Language, Delivery of Health Care, Artificial Intelligence
- Abstract
Reports of Large Language Models (LLMs) passing board examinations have spurred medical enthusiasm for their clinical integration. Through a narrative review, we reflect upon the skill shifts necessary for clinicians to succeed in an LLM-enabled world, achieving benefits while minimizing risks. We suggest how medical education must evolve to prepare clinicians capable of navigating human-AI systems., (© 2024. The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF
41. Towards long-tailed, multi-label disease classification from chest X-ray: Overview of the CXR-LT challenge.
- Author
-
Holste G, Zhou Y, Wang S, Jaiswal A, Lin M, Zhuge S, Yang Y, Kim D, Nguyen-Mau TH, Tran MT, Jeong J, Park W, Ryu J, Hong F, Verma A, Yamagishi Y, Kim C, Seo H, Kang M, Celi LA, Lu Z, Summers RM, Shih G, Wang Z, and Peng Y
- Subjects
- Humans, Radiographic Image Interpretation, Computer-Assisted methods, Thoracic Diseases diagnostic imaging, Thoracic Diseases classification, Algorithms, Radiography, Thoracic methods
- Abstract
Many real-world image recognition problems, such as diagnostic medical imaging exams, are "long-tailed" - there are a few common findings followed by many more relatively rare conditions. In chest radiography, diagnosis is both a long-tailed and multi-label problem, as patients often present with multiple findings simultaneously. While researchers have begun to study the problem of long-tailed learning in medical image recognition, few have studied the interaction of label imbalance and label co-occurrence posed by long-tailed, multi-label disease classification. To engage with the research community on this emerging topic, we conducted an open challenge, CXR-LT, on long-tailed, multi-label thorax disease classification from chest X-rays (CXRs). We publicly release a large-scale benchmark dataset of over 350,000 CXRs, each labeled with at least one of 26 clinical findings following a long-tailed distribution. We synthesize common themes of top-performing solutions, providing practical recommendations for long-tailed, multi-label medical image classification. Finally, we use these insights to propose a path forward involving vision-language foundation models for few- and zero-shot disease classification., Competing Interests: Declaration of competing interest The authors declare the following financial interests/personal relationships which may be considered as potential competing interests: R.M.S has received royalties for patent or software licenses from iCAD, Philips, PingAn, ScanMed, Translation Holdings, and MGB as well as research support form CRADA with PingAn. The remaining authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper., (Copyright © 2024 Elsevier B.V. All rights reserved.)
- Published
- 2024
- Full Text
- View/download PDF
42. A Clinician's Guide to Understanding Bias in Critical Clinical Prediction Models.
- Author
-
Matos J, Gallifant J, Chowdhury A, Economou-Zavlanos N, Charpignon ML, Gichoya J, Celi LA, Nazer L, King H, and Wong AI
- Subjects
- Humans, Bias, Clinical Decision-Making, Critical Care standards, Artificial Intelligence
- Abstract
This narrative review focuses on the role of clinical prediction models in supporting informed decision-making in critical care, emphasizing their 2 forms: traditional scores and artificial intelligence (AI)-based models. Acknowledging the potential for both types to embed biases, the authors underscore the importance of critical appraisal to increase our trust in models. The authors outline recommendations and critical care examples to manage risk of bias in AI models. The authors advocate for enhanced interdisciplinary training for clinicians, who are encouraged to explore various resources (books, journals, news Web sites, and social media) and events (Datathons) to deepen their understanding of risk of bias., Competing Interests: Disclosure A.I. Wong holds equity and management roles in Ataia Medical. A.I. Wong is supported by the Duke CTSI by the National Center for Advancing Translational Sciences, United States (NCATS) of the National Institutes of Health, United States under UL1TR002553 and REACH Equity under the National Institute on Minority Health and Health Disparities, United States (NIMHD) of the National Institutes of Health under U54MD012530. All other authors have no conflicts to disclose., (Copyright © 2024 Elsevier Inc. All rights reserved.)
- Published
- 2024
- Full Text
- View/download PDF
43. Echocardiogram Vector Embeddings Via R3D Transformer for the Advancement of Automated Echocardiography.
- Author
-
Chung DJ, Lee SM, Kaker V, Zhao Y, Bin I, Perera S, Sasankan P, Tang G, Kazzi B, Kuo PC, Celi LA, and Kpodonu J
- Abstract
Background: Ejection fraction (EF) estimation informs patient plans in the ICU, and low EF can indicate ventricular systolic dysfunction, which increases the risk of adverse events including heart failure. Automated echocardiography models are an attractive solution for high-variance human EF estimation, and key to this goal are echocardiogram vector embeddings, which are a critical resource for computational researchers., Objectives: The authors aimed to extract the vector embeddings from each echocardiogram in the EchoNet dataset using a classifier trained to classify EF as healthy (>50%) or unhealthy (<= 50%) to create an embeddings dataset for computational researchers., Methods: We repurposed an R3D transformer to classify whether patient EF is below or above 50%. Training, validation, and testing were done on the EchoNet dataset of 10,030 echocardiograms, and the resulting model generated embeddings for each of these videos., Results: We extracted 400-dimensional vector embeddings for each of the 10,030 EchoNet echocardiograms using the trained R3D model, which achieved a test AUC of 0.916 and 87.5% accuracy, approaching the performance of comparable studies., Conclusions: We present 10,030 vector embeddings learned by this model as a resource to the cardiology research community, as well as the trained model itself. These vectors enable algorithmic improvements and multimodal applications within automated echocardiography, benefitting the research community and those with ventricular systolic dysfunction (https://github.com/Team-Echo-MIT/r3d-v0-embeddings)., Competing Interests: The authors have reported that they have no relationships relevant to the contents of this paper to disclose., (© 2024 The Authors.)
- Published
- 2024
- Full Text
- View/download PDF
44. Can we ensure a safe and effective integration of language models in oncology?
- Author
-
Corti C and Celi LA
- Abstract
Competing Interests: CCo reports reimbursement for travel and lodging by Veracyte. The competing interests were outside the submitted work. LAC is funded by the National Institute of Health through R01 EB017205, DS-I Africa U54 TW012043-01 and Bridge2AI OT2OD032701, and the National Science Foundation through ITEST #2148451. The competing interests were outside the submitted work.
- Published
- 2024
- Full Text
- View/download PDF
45. Low-cost and convenient screening of disease using analysis of physical measurements and recordings.
- Author
-
Chandra J, Lin R, Kancherla D, Scott S, Sul D, Andrade D, Marzouk S, Iyer JM, Wasswa W, Villanueva C, and Celi LA
- Abstract
In recent years, there has been substantial work in low-cost medical diagnostics based on the physical manifestations of disease. This is due to advancements in data analysis techniques and classification algorithms and the increased availability of computing power through smart devices. Smartphones and their ability to interface with simple sensors such as inertial measurement units (IMUs), microphones, piezoelectric sensors, etc., or with convenient attachments such as lenses have revolutionized the ability collect medically relevant data easily. Even if the data has relatively low resolution or signal to noise ratio, newer algorithms have made it possible to identify disease with this data. Many low-cost diagnostic tools have been created in medical fields spanning from neurology to dermatology to obstetrics. These tools are particularly useful in low-resource areas where access to expensive diagnostic equipment may not be possible. The ultimate goal would be the creation of a "diagnostic toolkit" consisting of a smartphone and a set of sensors and attachments that can be used to screen for a wide set of diseases in a community healthcare setting. However, there are a few concerns that still need to be overcome in low-cost diagnostics: lack of incentives to bring these devices to market, algorithmic bias, "black box" nature of the algorithms, and data storage/transfer concerns., Competing Interests: Leo Anthony Celi is the Editor-in Chief of PLOS Digital Health. Leo Anthony Celi is also funded by the National Institute of Health through R01 EB017205, DS-I Africa U54 TW012043-01 and Bridge2AI OT2OD032701, and the National Science Foundation through ITEST #2148451., (Copyright: © 2024 Chandra et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.)
- Published
- 2024
- Full Text
- View/download PDF
46. Assessment of Clinical Metadata on the Accuracy of Retinal Fundus Image Labels in Diabetic Retinopathy in Uganda: Case-Crossover Study Using the Multimodal Database of Retinal Images in Africa.
- Author
-
Arunga S, Morley KE, Kwaga T, Morley MG, Nakayama LF, Mwavu R, Kaggwa F, Ssempiira J, Celi LA, Haberer JE, and Obua C
- Subjects
- Humans, Uganda, Female, Male, Cross-Over Studies, Databases, Factual, Middle Aged, Fundus Oculi, Adult, Sensitivity and Specificity, Retina diagnostic imaging, Retina pathology, Diabetic Retinopathy diagnostic imaging, Diabetic Retinopathy diagnosis, Metadata
- Abstract
Background: Labeling color fundus photos (CFP) is an important step in the development of artificial intelligence screening algorithms for the detection of diabetic retinopathy (DR). Most studies use the International Classification of Diabetic Retinopathy (ICDR) to assign labels to CFP, plus the presence or absence of macular edema (ME). Images can be grouped as referrable or nonreferrable according to these classifications. There is little guidance in the literature about how to collect and use metadata as a part of the CFP labeling process., Objective: This study aimed to improve the quality of the Multimodal Database of Retinal Images in Africa (MoDRIA) by determining whether the availability of metadata during the image labeling process influences the accuracy, sensitivity, and specificity of image labels. MoDRIA was developed as one of the inaugural research projects of the Mbarara University Data Science Research Hub, part of the Data Science for Health Discovery and Innovation in Africa (DS-I Africa) initiative., Methods: This is a crossover assessment with 2 groups and 2 phases. Each group had 10 randomly assigned labelers who provided an ICDR score and the presence or absence of ME for each of the 50 CFP in a test image with and without metadata including blood pressure, visual acuity, glucose, and medical history. Specificity and sensitivity of referable retinopathy were based on ICDR scores, and ME was calculated using a 2-sided t test. Comparison of sensitivity and specificity for ICDR scores and ME with and without metadata for each participant was calculated using the Wilcoxon signed rank test. Statistical significance was set at P<.05., Results: The sensitivity for identifying referrable DR with metadata was 92.8% (95% CI 87.6-98.0) compared with 93.3% (95% CI 87.6-98.9) without metadata, and the specificity was 84.9% (95% CI 75.1-94.6) with metadata compared with 88.2% (95% CI 79.5-96.8) without metadata. The sensitivity for identifying the presence of ME was 64.3% (95% CI 57.6-71.0) with metadata, compared with 63.1% (95% CI 53.4-73.0) without metadata, and the specificity was 86.5% (95% CI 81.4-91.5) with metadata compared with 87.7% (95% CI 83.9-91.5) without metadata. The sensitivity and specificity of the ICDR score and the presence or absence of ME were calculated for each labeler with and without metadata. No findings were statistically significant., Conclusions: The sensitivity and specificity scores for the detection of referrable DR were slightly better without metadata, but the difference was not statistically significant. We cannot make definitive conclusions about the impact of metadata on the sensitivity and specificity of image labels in our study. Given the importance of metadata in clinical situations, we believe that metadata may benefit labeling quality. A more rigorous study to determine the sensitivity and specificity of CFP labels with and without metadata is recommended., (©Simon Arunga, Katharine Elise Morley, Teddy Kwaga, Michael Gerard Morley, Luis Filipe Nakayama, Rogers Mwavu, Fred Kaggwa, Julius Ssempiira, Leo Anthony Celi, Jessica E Haberer, Celestino Obua. Originally published in JMIR Formative Research (https://formative.jmir.org), 18.09.2024.)
- Published
- 2024
- Full Text
- View/download PDF
47. The role of Open Access Data in democratizing healthcare AI: A pathway to research enhancement, patient well-being and treatment equity in Andalusia, Spain.
- Author
-
Ritoré Á, Jiménez CM, González JL, Rejón-Parrilla JC, Hervás P, Toro E, Parra-Calderón CL, Celi LA, Túnez I, and Armengol de la Hoz MÁ
- Abstract
Competing Interests: Leo Anthony Celi is the Editor-in-Chief of PLOS Digital Health and Miguel Ángel Armengol de la Hoz is a Section Editor of PLOS Digital Health. AR, CMJ, JLG, JCR, PH, ET, IT and MAA are associated with the Andalusian Regional Ministry of Health and Consumer Affairs.
- Published
- 2024
- Full Text
- View/download PDF
48. Ethical debates amidst flawed healthcare artificial intelligence metrics.
- Author
-
Gallifant J, Bitterman DS, Celi LA, Gichoya JW, Matos J, McCoy LG, and Pierce RL
- Published
- 2024
- Full Text
- View/download PDF
49. Smartphone Imaging and AI: A Commentary on Cardiac Device Classification.
- Author
-
Júdice de Mattos Farina EM and Celi LA
- Subjects
- Humans, Smartphone, Artificial Intelligence
- Published
- 2024
- Full Text
- View/download PDF
50. Assessment of fluid responsiveness using pulse pressure variation, stroke volume variation, plethysmographic variability index, central venous pressure, and inferior vena cava variation in patients undergoing mechanical ventilation: a systematic review and meta-analysis.
- Author
-
Chaves RCF, Barbas CSV, Queiroz VNF, Serpa Neto A, Deliberato RO, Pereira AJ, Timenetsky KT, Silva Júnior JM, Takaoka F, de Backer D, Celi LA, and Corrêa TD
- Subjects
- Humans, Blood Pressure physiology, Respiration, Artificial methods, Respiration, Artificial statistics & numerical data, Central Venous Pressure physiology, Fluid Therapy methods, Fluid Therapy standards, Fluid Therapy statistics & numerical data, Vena Cava, Inferior physiology, Stroke Volume physiology, Plethysmography methods
- Abstract
Importance: Maneuvers assessing fluid responsiveness before an intravascular volume expansion may limit useless fluid administration, which in turn may improve outcomes., Objective: To describe maneuvers for assessing fluid responsiveness in mechanically ventilated patients., Registration: The protocol was registered at PROSPERO: CRD42019146781., Information Sources and Search: PubMed, EMBASE, CINAHL, SCOPUS, and Web of Science were search from inception to 08/08/2023., Study Selection and Data Collection: Prospective and intervention studies were selected., Statistical Analysis: Data for each maneuver were reported individually and data from the five most employed maneuvers were aggregated. A traditional and a Bayesian meta-analysis approach were performed., Results: A total of 69 studies, encompassing 3185 fluid challenges and 2711 patients were analyzed. The prevalence of fluid responsiveness was 49.9%. Pulse pressure variation (PPV) was studied in 40 studies, mean threshold with 95% confidence intervals (95% CI) = 11.5 (10.5-12.4)%, and area under the receiver operating characteristics curve (AUC) with 95% CI was 0.87 (0.84-0.90). Stroke volume variation (SVV) was studied in 24 studies, mean threshold with 95% CI = 12.1 (10.9-13.3)%, and AUC with 95% CI was 0.87 (0.84-0.91). The plethysmographic variability index (PVI) was studied in 17 studies, mean threshold = 13.8 (12.3-15.3)%, and AUC was 0.88 (0.82-0.94). Central venous pressure (CVP) was studied in 12 studies, mean threshold with 95% CI = 9.0 (7.7-10.1) mmHg, and AUC with 95% CI was 0.77 (0.69-0.87). Inferior vena cava variation (∆IVC) was studied in 8 studies, mean threshold = 15.4 (13.3-17.6)%, and AUC with 95% CI was 0.83 (0.78-0.89)., Conclusions: Fluid responsiveness can be reliably assessed in adult patients under mechanical ventilation. Among the five maneuvers compared in predicting fluid responsiveness, PPV, SVV, and PVI were superior to CVP and ∆IVC. However, there is no data supporting any of the above mentioned as being the best maneuver. Additionally, other well-established tests, such as the passive leg raising test, end-expiratory occlusion test, and tidal volume challenge, are also reliable., (© 2024. The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.