Back to Search Start Over

Aggregate Estimation in Hidden Databases with Checkbox Interfaces.

Authors :
Yan, Hui
Gong, Zhiguo
Zhang, Nan
Huang, Tao
Zhong, Hua
Wei, Jun
Source :
IEEE Transactions on Knowledge & Data Engineering. May2015, Vol. 27 Issue 5, p1192-1204. 13p.
Publication Year :
2015

Abstract

A large number of web data repositories are hidden behind restrictive web interfaces, making it an important challenge to enable data analytics over these hidden web databases. Most existing techniques assume a form-like web interface which consists solely of categorical attributes (or numeric ones that can be discretized). Nonetheless, many real-world web interfaces (of hidden databases) also feature checkbox interfaces—e.g., the specification of a set of desired features, such as A/C, navigation, etc., for a car-search website like Yahoo! Autos. We find that, for the purpose of data analytics, such checkbox-represented attributes differ fundamentally from the categorical/numerical ones that were traditionally studied. In this paper, we address the problem of data analytics over hidden databases with checkbox interfaces. Extensive experiments on both synthetic and real datasets demonstrate the accuracy and efficiency of our proposed algorithms. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
10414347
Volume :
27
Issue :
5
Database :
Academic Search Index
Journal :
IEEE Transactions on Knowledge & Data Engineering
Publication Type :
Academic Journal
Accession number :
101862718
Full Text :
https://doi.org/10.1109/TKDE.2014.2365800