Start Over

Sampling Query Variations for Learning to Rank to Improve Automatic Boolean Query Generation in Systematic Reviews

Authors :: Bevan Koopman
Harrisen Scells
Guido Zuccon
Mohamed A. Sharaf
Source :: WWW
Publication Year :: 2020
Publisher :: ACM, 2020.
Abstract: Searching medical literature for synthesis in a systematic review is a complex and labour intensive task. In this context, expert searchers construct lengthy Boolean queries. The universe of possible query variations can be massive: a single query can be composed of hundreds of field-restricted search terms/phrases or ontological concepts, each grouped by a logical operator nested to depths of sometimes five or more levels deep. With the many choices about how to construct a query, it is difficult to both formulate and recognise effective queries. To address this challenge, automatic methods have recently been explored for generating and selecting effective Boolean query variations for systematic reviews. The limiting factor of these methods is that it is computationally infeasible to process all query variations for training the methods. To overcome this, we propose novel query variation sampling methods for training Learning to Rank models to rank queries. Our results show that query sampling methods do directly impact the ability of a Learning to Rank model to effectively identify good query variations. Thus, selecting appropriate query sampling methods is a key problem for the automatic reformulation of effective Boolean queries for systematic review literature search. We find that the best sampling strategies are those which balance the diversity of queries with the quantity of queries.

Subjects :: Information retrieval
Computer science
InformationSystems_INFORMATIONSTORAGEANDRETRIEVAL
Rank (computer programming)
Sampling (statistics)
Context (language use)
02 engineering and technology
Construct (python library)
03 medical and health sciences
Task (computing)
0302 clinical medicine
020204 information systems
0202 electrical engineering, electronic engineering, information engineering
Ontology
Learning to rank
030212 general & internal medicine
Boolean conjunctive query

Details

Database :: OpenAIRE
Journal :: Proceedings of The Web Conference 2020
Accession number :: edsair.doi...........10940dcf3ffeca28dcb423c58a7e1ea8
Full Text :: https://doi.org/10.1145/3366423.3380075

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Sampling Query Variations for Learning to Rank to Improve Automatic Boolean Query Generation in Systematic Reviews

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Sampling Query Variations for Learning to Rank to Improve Automatic Boolean Query Generation in Systematic Reviews

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources