Back to Search Start Over

NeuroCard

Authors :
Sifei Luan
Zongheng Yang
Xi Chen
Ion Stoica
Yan Duan
Amog Kamsetty
Eric Liang
Source :
Proceedings of the VLDB Endowment. 14:61-73
Publication Year :
2020
Publisher :
Association for Computing Machinery (ACM), 2020.

Abstract

Query optimizers rely on accurate cardinality estimates to produce good execution plans. Despite decades of research, existing cardinality estimators are inaccurate for complex queries, due to making lossy modeling assumptions and not capturing inter-table correlations. In this work, we show that it is possible to learn the correlations across all tables in a database without any independence assumptions. We present NeuroCard, a join cardinality estimator that builds a single neural density estimator over an entire database. Leveraging join sampling and modern deep autoregressive models, NeuroCard makes no inter-table or inter-column independence assumptions in its probabilistic modeling. NeuroCard achieves orders of magnitude higher accuracy than the best prior methods (a new state-of-the-art result of 8.5$\times$ maximum error on JOB-light), scales to dozens of tables, while being compact in space (several MBs) and efficient to construct or update (seconds to minutes).<br />Comment: VLDB 2021

Details

ISSN :
21508097
Volume :
14
Database :
OpenAIRE
Journal :
Proceedings of the VLDB Endowment
Accession number :
edsair.doi.dedup.....9c2baa6ff6e280d868a4ead2a0c6c989
Full Text :
https://doi.org/10.14778/3421424.3421432