Back to Search Start Over

Development and testing of a multi-lingual Natural Language Processing-based deep learning system in 10 languages for COVID-19 pandemic crisis: A multi-center study.

Authors :
Yang LWY
Ng WY
Lei X
Tan SCY
Wang Z
Yan M
Pargi MK
Zhang X
Lim JS
Gunasekeran DV
Tan FCP
Lee CE
Yeo KK
Tan HK
Ho HSS
Tan BWB
Wong TY
Kwek KYC
Goh RSM
Liu Y
Ting DSW
Source :
Frontiers in public health [Front Public Health] 2023 Feb 13; Vol. 11, pp. 1063466. Date of Electronic Publication: 2023 Feb 13 (Print Publication: 2023).
Publication Year :
2023

Abstract

Purpose: The COVID-19 pandemic has drastically disrupted global healthcare systems. With the higher demand for healthcare and misinformation related to COVID-19, there is a need to explore alternative models to improve communication. Artificial Intelligence (AI) and Natural Language Processing (NLP) have emerged as promising solutions to improve healthcare delivery. Chatbots could fill a pivotal role in the dissemination and easy accessibility of accurate information in a pandemic. In this study, we developed a multi-lingual NLP-based AI chatbot, DR-COVID, which responds accurately to open-ended, COVID-19 related questions. This was used to facilitate pandemic education and healthcare delivery.<br />Methods: First, we developed DR-COVID with an ensemble NLP model on the Telegram platform (https://t.me/drcovid_nlp_chatbot). Second, we evaluated various performance metrics. Third, we evaluated multi-lingual text-to-text translation to Chinese, Malay, Tamil, Filipino, Thai, Japanese, French, Spanish, and Portuguese. We utilized 2,728 training questions and 821 test questions in English. Primary outcome measurements were (A) overall and top 3 accuracies; (B) Area Under the Curve (AUC), precision, recall, and F1 score. Overall accuracy referred to a correct response for the top answer, whereas top 3 accuracy referred to an appropriate response for any one answer amongst the top 3 answers. AUC and its relevant matrices were obtained from the Receiver Operation Characteristics (ROC) curve. Secondary outcomes were (A) multi-lingual accuracy; (B) comparison to enterprise-grade chatbot systems. The sharing of training and testing datasets on an open-source platform will also contribute to existing data.<br />Results: Our NLP model, utilizing the ensemble architecture, achieved overall and top 3 accuracies of 0.838 [95% confidence interval (CI): 0.826-0.851] and 0.922 [95% CI: 0.913-0.932] respectively. For overall and top 3 results, AUC scores of 0.917 [95% CI: 0.911-0.925] and 0.960 [95% CI: 0.955-0.964] were achieved respectively. We achieved multi-linguicism with nine non-English languages, with Portuguese performing the best overall at 0.900. Lastly, DR-COVID generated answers more accurately and quickly than other chatbots, within 1.12-2.15 s across three devices tested.<br />Conclusion: DR-COVID is a clinically effective NLP-based conversational AI chatbot, and a promising solution for healthcare delivery in the pandemic era.<br />Competing Interests: The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.<br /> (Copyright © 2023 Yang, Ng, Lei, Tan, Wang, Yan, Pargi, Zhang, Lim, Gunasekeran, Tan, Lee, Yeo, Tan, Ho, Tan, Wong, Kwek, Goh, Liu and Ting.)

Details

Language :
English
ISSN :
2296-2565
Volume :
11
Database :
MEDLINE
Journal :
Frontiers in public health
Publication Type :
Academic Journal
Accession number :
36860378
Full Text :
https://doi.org/10.3389/fpubh.2023.1063466