Back to Search
Start Over
Text mining approaches for dealing with the rapidly expanding literature on COVID-19
- Source :
- Briefings in Bioinformatics
- Publication Year :
- 2020
- Publisher :
- Oxford University Press, 2020.
-
Abstract
- More than 50 000 papers have been published about COVID-19 since the beginning of 2020 and several hundred new papers continue to be published every day. This incredible rate of scientific productivity leads to information overload, making it difficult for researchers, clinicians and public health officials to keep up with the latest findings. Automated text mining techniques for searching, reading and summarizing papers are helpful for addressing information overload. In this review, we describe the many resources that have been introduced to support text mining applications over the COVID-19 literature; specifically, we discuss the corpora, modeling resources, systems and shared tasks that have been introduced for COVID-19. We compile a list of 39 systems that provide functionality such as search, discovery, visualization and summarization over the COVID-19 literature. For each system, we provide a qualitative description and assessment of the system’s performance, unique data or user interface features and modeling decisions. Many systems focus on search and discovery, though several systems provide novel features, such as the ability to summarize findings over multiple documents or linking between scientific articles and clinical trials. We also describe the public corpora, models and shared tasks that have been introduced to help reduce repeated effort among community members; some of these resources (especially shared tasks) can provide a basis for comparing the performance of different systems. Finally, we summarize promising results and open challenges for text mining the COVID-19 literature.
- Subjects :
- AcademicSubjects/SCI01060
Computer science
media_common.quotation_subject
text mining
CORD-19
computer.software_genre
03 medical and health sciences
0302 clinical medicine
Reading (process)
Question answering
Data Mining
Humans
030212 general & internal medicine
information retrieval
information extraction
natural language processing
shared tasks
Molecular Biology
030304 developmental biology
media_common
0303 health sciences
Focus (computing)
SARS-CoV-2
summarization
COVID-19
Articles
Data science
Automatic summarization
Information overload
Visualization
Information extraction
question answering
User interface
computer
Information Systems
Subjects
Details
- Language :
- English
- ISSN :
- 14774054 and 14675463
- Database :
- OpenAIRE
- Journal :
- Briefings in Bioinformatics
- Accession number :
- edsair.doi.dedup.....6aae3c75bd98f1d47014c874ed529e0f