Back to Search
Start Over
TKAP: Efficiently processing top-k query on massive data by adaptive pruning
- Source :
- Knowledge and Information Systems. 47:301-328
- Publication Year :
- 2015
- Publisher :
- Springer Science and Business Media LLC, 2015.
-
Abstract
- In many applications, top-k query is an important operation to return a set of interesting points in a potentially huge data space. The existing algorithms, either maintaining too many candidates, or requiring assistant structures built on the specific attribute subset, or returning results with probabilistic guarantee, cannot process top-k query on massive data efficiently. This paper proposes a sorted-list-based TKAP algorithm, which utilizes some data structures of low space overhead, to efficiently compute top-k results on massive data. In round-robin retrieval on sorted lists, TKAP performs adaptive pruning operation and maintains the required candidates until the stop condition is satisfied. The adaptive pruning operation can be adjusted by the information obtained in round-robin retrieval to achieve a better pruning effect. The adaptive pruning rule is developed in this paper, along with its theoretical analysis. The extensive experimental results, conducted on synthetic and real-life data sets, show the significant advantage of TKAP over the existing algorithms.
- Subjects :
- Computer science
Probabilistic logic
Process (computing)
0102 computer and information sciences
02 engineering and technology
Space (commercial competition)
Data structure
computer.software_genre
01 natural sciences
Human-Computer Interaction
Set (abstract data type)
010201 computation theory & mathematics
Artificial Intelligence
Hardware and Architecture
Principal variation search
020204 information systems
0202 electrical engineering, electronic engineering, information engineering
Overhead (computing)
Pruning (decision trees)
Data mining
computer
Software
Information Systems
Subjects
Details
- ISSN :
- 02193116 and 02191377
- Volume :
- 47
- Database :
- OpenAIRE
- Journal :
- Knowledge and Information Systems
- Accession number :
- edsair.doi...........4ca9b288b7a16e17605b2855415d189c
- Full Text :
- https://doi.org/10.1007/s10115-015-0836-5