Back to Search
Start Over
Evaluating the performance of large language models in haematopoietic stem cell transplantation decision‐making.
- Source :
-
British Journal of Haematology . Apr2024, Vol. 204 Issue 4, p1523-1528. 6p. - Publication Year :
- 2024
-
Abstract
- Summary: In a first‐of‐its‐kind study, we assessed the capabilities of large language models (LLMs) in making complex decisions in haematopoietic stem cell transplantation. The evaluation was conducted not only for Generative Pre‐trained Transformer 4 (GPT‐4) but also conducted on other artificial intelligence models: PaLm 2 and Llama‐2. Using detailed haematological histories that include both clinical, molecular and donor data, we conducted a triple‐blind survey to compare LLMs to haematology residents. We found that residents significantly outperformed LLMs (p = 0.02), particularly in transplant eligibility assessment (p = 0.01). Our triple‐blind methodology aimed to mitigate potential biases in evaluating LLMs and revealed both their promise and limitations in deciphering complex haematological clinical scenarios. [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 00071048
- Volume :
- 204
- Issue :
- 4
- Database :
- Academic Search Index
- Journal :
- British Journal of Haematology
- Publication Type :
- Academic Journal
- Accession number :
- 176535359
- Full Text :
- https://doi.org/10.1111/bjh.19200