1. Optimization of an integrated model for automatic reduction and expansion of long queries
- Author
-
Song, Dawei, Shi, Yanjie, Zhang, Peng, Hou, Yuexian, Hu, Bin, Jia, Yuan, Huang, Qiang, Kruschwitz, Udo, De Roeck, Anne, and Bruza, Peter
- Subjects
InformationSystems_INFORMATIONSTORAGEANDRETRIEVAL - Abstract
A long query provides more useful hints for searching relevant documents, but it is likely to introduce noise which affects retrieval performance. In order to smooth such adverse effect, it is important to reduce noisy terms, introduce and boost additional relevant terms. This paper presents a comprehensive framework, called Aspect Hidden Markov Model (AHMM), which integrates query reduction and expansion, for retrieval with long queries. It optimizes the probability distribution of query terms by utilizing intra-query term dependencies as well as the relationships between query terms and words observed in relevance feedback documents. Empirical evaluation on three large-scale TREC collections demonstrates that our approach, which is automatic, achieves salient improvements over various strong baselines, and also reaches a comparable performance to a state of the art method based on user’s interactive query term reduction and expansion.
- Published
- 2013