Back to Search
Start Over
NeuroCard
- Source :
- Proceedings of the VLDB Endowment. 14:61-73
- Publication Year :
- 2020
- Publisher :
- Association for Computing Machinery (ACM), 2020.
-
Abstract
- Query optimizers rely on accurate cardinality estimates to produce good execution plans. Despite decades of research, existing cardinality estimators are inaccurate for complex queries, due to making lossy modeling assumptions and not capturing inter-table correlations. In this work, we show that it is possible to learn the correlations across all tables in a database without any independence assumptions. We present NeuroCard, a join cardinality estimator that builds a single neural density estimator over an entire database. Leveraging join sampling and modern deep autoregressive models, NeuroCard makes no inter-table or inter-column independence assumptions in its probabilistic modeling. NeuroCard achieves orders of magnitude higher accuracy than the best prior methods (a new state-of-the-art result of 8.5$\times$ maximum error on JOB-light), scales to dozens of tables, while being compact in space (several MBs) and efficient to construct or update (seconds to minutes).<br />Comment: VLDB 2021
- Subjects :
- FOS: Computer and information sciences
Computer Science - Machine Learning
Computer science
General Engineering
Probabilistic logic
Sampling (statistics)
Estimator
Databases (cs.DB)
02 engineering and technology
Lossy compression
Machine Learning (cs.LG)
Computer Science - Databases
Autoregressive model
020204 information systems
0202 electrical engineering, electronic engineering, information engineering
Join (sigma algebra)
020201 artificial intelligence & image processing
Cardinality (SQL statements)
Algorithm
Independence (probability theory)
Subjects
Details
- ISSN :
- 21508097
- Volume :
- 14
- Database :
- OpenAIRE
- Journal :
- Proceedings of the VLDB Endowment
- Accession number :
- edsair.doi.dedup.....9c2baa6ff6e280d868a4ead2a0c6c989
- Full Text :
- https://doi.org/10.14778/3421424.3421432