Back to Search Start Over

Regularizing ad hoc retrieval scores

Authors :
Fernando Diaz
Source :
CIKM
Publication Year :
2005
Publisher :
ACM, 2005.

Abstract

The cluster hypothesis states: closely related documents tend to be relevant to the same request. We exploit this hypothesis directly by adjusting ad hoc retrieval scores from an initial retrieval so that topically related documents receive similar scores. We refer to this process as score regularization. Score regularization can be presented as an optimization problem, allowing the use of results from semi-supervised learning. We demonstrate that regularized scores consistently and significantly rank documents better than unregularized scores, given a variety of initial retrieval algorithms. We evaluate our method on two large corpora across a substantial number of topics.

Details

Database :
OpenAIRE
Journal :
Proceedings of the 14th ACM international conference on Information and knowledge management
Accession number :
edsair.doi...........7d8492b30ab8221cede93cadb77655b2
Full Text :
https://doi.org/10.1145/1099554.1099722