Back to Search Start Over

An architecture for recycling intermediates in a column-store

Authors :
Ivanova, Milena
Kersten, Martin
Nes, Niels
Pereira Goncalves, Romulo Antonio
Database Architectures
Standardization and Knowledge Transfer
name unknown
Source :
ACM Transactions on Database Systems, 35(4), 1-41
Publication Year :
2009
Publisher :
ACM, 2009.

Abstract

Automatically recycling (intermediate) results is a grand challenge for state-of-the-art databases to improve both query response time and throughput. Tuples are loaded and streamed through a tuple-at-a-time processing pipeline avoiding materialization of intermediates as much as possible. This limits the opportunities for reuse of overlapping computations to DBA-defined materialized views and function/result cache tuning. In contrast, the operator-at-a-time execution paradigm produces fully materialized results in each step of the query plan. To avoid resource contention, these intermediates are evicted as soon as possible. In this paper we study an architecture that harvests the by-products of the operator-at-a-time paradigm in a column store system using a lightweight mechanism, the recycler. The key challenge then becomes selection of the policies to admit intermediates to the resource pool, their retention period, and the eviction strategy when facing resource limitations. The proposed recycling architecture has been implemented in an open-source system. An experimental analysis against the TPC-H ad-hoc decision support benchmark and a complex, real-world application (SkyServer) demonstrates its effectiveness in terms of self-organizing behavior and its significant performance gains. The results indicate the potentials of recycling intermediates and charters a route for further development of database kernels.

Details

Language :
English
ISSN :
03625915
Database :
OpenAIRE
Journal :
ACM Transactions on Database Systems, 35(4), 1-41
Accession number :
edsair.dedup.wf.001..2e0414b280be9cca4d3a85aef89244b2