Start Over

Performance of active learning models for screening prioritization in systematic reviews: a simulation study into the Average Time to Discover relevant records.

Authors :: Ferdinands G
Schram R
de Bruin J
Bagheri A
Oberski DL
Tummers L
Teijema JJ
van de Schoot R
Source :: Systematic reviews [Syst Rev] 2023 Jun 20; Vol. 12 (1), pp. 100. Date of Electronic Publication: 2023 Jun 20.
Publication Year :: 2023
Abstract: Background: Conducting a systematic review demands a significant amount of effort in screening titles and abstracts. To accelerate this process, various tools that utilize active learning have been proposed. These tools allow the reviewer to interact with machine learning software to identify relevant publications as early as possible. The goal of this study is to gain a comprehensive understanding of active learning models for reducing the workload in systematic reviews through a simulation study.<br />Methods: The simulation study mimics the process of a human reviewer screening records while interacting with an active learning model. Different active learning models were compared based on four classification techniques (naive Bayes, logistic regression, support vector machines, and random forest) and two feature extraction strategies (TF-IDF and doc2vec). The performance of the models was compared for six systematic review datasets from different research areas. The evaluation of the models was based on the Work Saved over Sampling (WSS) and recall. Additionally, this study introduces two new statistics, Time to Discovery (TD) and Average Time to Discovery (ATD).<br />Results: The models reduce the number of publications needed to screen by 91.7 to 63.9% while still finding 95% of all relevant records (WSS@95). Recall of the models was defined as the proportion of relevant records found after screening 10% of of all records and ranges from 53.6 to 99.8%. The ATD values range from 1.4% till 11.7%, which indicate the average proportion of labeling decisions the researcher needs to make to detect a relevant record. The ATD values display a similar ranking across the simulations as the recall and WSS values.<br />Conclusions: Active learning models for screening prioritization demonstrate significant potential for reducing the workload in systematic reviews. The Naive Bayes + TF-IDF model yielded the best results overall. The Average Time to Discovery (ATD) measures performance of active learning models throughout the entire screening process without the need for an arbitrary cut-off point. This makes the ATD a promising metric for comparing the performance of different models across different datasets.<br /> (© 2023. The Author(s).)

Subjects :: Humans
Bayes Theorem
Systematic Reviews as Topic
Computer Simulation
Software
Machine Learning

Details

Language :: English
ISSN :: 2046-4053
Volume :: 12
Issue :: 1
Database :: MEDLINE
Journal :: Systematic reviews
Publication Type :: Academic Journal
Accession number :: 37340494
Full Text :: https://doi.org/10.1186/s13643-023-02257-7

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Performance of active learning models for screening prioritization in systematic reviews: a simulation study into the Average Time to Discover relevant records.

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Performance of active learning models for screening prioritization in systematic reviews: a simulation study into the Average Time to Discover relevant records.

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources