1. Characterizing, predicting, and handling web search queries that match very few or no results
- Author
-
Roi Blanco, Rifat Ozcan, B. Barla Cambazoglu, Erdem Sarigil, Özgür Ulusoy, Ismail Sengor Altingovde, Ulusoy, Özgür, and Sarıgil, Erdem
- Subjects
Information Systems and Management ,Information retrieval ,Web search query ,Computer Networks and Communications ,Computer science ,05 social sciences ,02 engineering and technology ,Range query (database) ,Library and Information Sciences ,computer.software_genre ,Query language ,Spatial query ,Query expansion ,Search engine ,Web query classification ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,Queries per second ,Data mining ,0509 other social sciences ,050904 information & library sciences ,computer ,Information Systems - Abstract
A non‐negligible fraction of user queries end up with very few or even no matching results in leading commercial web search engines. In this work, we provide a detailed characterization of such queries and show that search engines try to improve such queries by showing the results of related queries. Through a user study, we show that these query suggestions are usually perceived as relevant. Also, through a query log analysis, we show that the users are dissatisfied after submitting a query that match no results at least 88.5% of the time. As a first step towards solving these no‐answer queries, we devised a large number of features that can be used to identify such queries and built machine‐learning models. These models can be useful for scenarios such as the mobile‐ or meta‐search, where identifying a query that will retrieve no results at the client device (i.e., even before submitting it to the search engine) may yield gains in terms of the bandwidth usage, power consumption, and/or monetary costs. Experiments over query logs indicate that, despite the heavy skew in class sizes, our models achieve good prediction quality, with accuracy (in terms of area under the curve) up to 0.95.
- Published
- 2017