Back to Search Start Over

Multi-stage enhanced representation learning for document reranking based on query view.

Authors :
Liu, Hai
Zhu, Xiaozhi
Tang, Yong
He, Chaobo
Hao, Tianyong
Source :
World Wide Web. Sep2024, Vol. 27 Issue 5, p1-17. 17p.
Publication Year :
2024

Abstract

The large-size language model is able to implicitly extract informative semantic features from queries and candidate documents to achieve impressive reranking performance. However, the large model relies on its own large number of parameters to achieve it and it is not known exactly what semantic information has been learned. In this paper, we propose a multi-stage enhanced representation learning method based on Query-View (MERL) with Intra-query stage and Inter-query stage to guide the model to explicitly learn the semantic relationship between the query and documents. In the Intra-query training stage, a content-based contrastive learning module without considering the special token [CLS] of BERT is utilized to optimize the semantic similarity of query and relevant documents. In the Inter-query training stage, an entity-oriented masked query prediction for establish a semantic relation of query-document pairs and an Inter-query contrastive learning module for extracting similar matching pattern of query-relevant documents are employed. Extensive experiments on MS MARCO passage ranking and TREC DL datasets show that the MERL method obtain significant improvements with a low number of parameters compared to the baseline models. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
1386145X
Volume :
27
Issue :
5
Database :
Academic Search Index
Journal :
World Wide Web
Publication Type :
Academic Journal
Accession number :
179151496
Full Text :
https://doi.org/10.1007/s11280-024-01296-x