Database Paper Browser

Back to papers

Database Learning: Toward a Database that Becomes Smarter Every Time

Summary: Database Learning: AQP that improves with each query by learning from past answers and exploiting shared distributions. Verdict on Spark SQL uses maximum entropy to yield tighter estimates; real traces show 73.7% coverage and up to 23x speedups. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
5389
Venue
SIGMOD
Year
2017
Pagerank
8.4909562e-05
Overall Rank
2,588 | 82.00%
DOI
10.1145/3035918.3064013

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 28 of 28 citing papers.

Rank Citing Paper Year Venue Pagerank
608 DeepDB: Learn from Data, not from Queries! 2020 VLDB 0.00019235898
801 SageDB: A Learned Database System 2019 CIDR 0.00016505496
1,204 VerdictDB: Universalizing Approximate Query Processing 2018 SIGMOD 0.00013319541
1,350 Northstar: An Interactive Data Science System 2018 VLDB 0.00012431059
1,737 QuickSel: Quick Selectivity Learning with Mixture Models 2020 SIGMOD 0.00010720294
2,501 DBEst: Revisiting Approximate Query Processing Engines with Machine Learning Models 2019 SIGMOD 8.6453446e-05
2,762 FLAT: Fast, Lightweight and Accurate Method for Cardinality Estimation 2021 VLDB 8.1585394e-05
3,499 Fauce: Fast and Accurate Deep Ensembles with Uncertainty for Cardinality Estimation 2021 VLDB 7.0376445e-05
3,944 AQP++: Connecting Approximate Query Processing With Aggregate Precomputation for Interactive Analytics 2018 SIGMOD 6.6078243e-05
4,030 Revisiting Reuse for Approximate Query Processing 2017 VLDB 6.5129665e-05
4,375 Sample Debiasing in the Themis Open World Database System 2020 SIGMOD 6.2427076e-05
5,719 Survivability of Cloud Databases - Factors and Prediction 2018 SIGMOD 5.3550742e-05
5,806 BlinkML: Efficient Maximum Likelihood Estimation with Probabilistic Guarantees 2019 SIGMOD 5.3200643e-05
5,951 PGMJoins: Random Join Sampling with Graphical Models 2021 SIGMOD 5.2592385e-05
6,233 Mosaic: A Sample-Based Database System for Open World Query Processing 2020 CIDR 5.1451876e-05
6,411 Approximate Query Engines: Commercial Challenges and Research Opportunities 2017 SIGMOD 5.0752468e-05
6,493 Joins on Samples: A Theoretical Guide for Practitioners 2020 VLDB 5.0424713e-05
6,740 Combining Aggregation and Sampling (Nearly) Optimally for Approximate Query Processing 2021 SIGMOD 4.944395e-05
8,080 Biathlon: Harnessing Model Resilience for Accelerating ML Inference Pipelines 2024 VLDB 4.5911668e-05
8,643 One Size Does Not Fit All: A Bandit-Based Sampler Combination Framework with Theoretical Guarantees 2022 SIGMOD 4.4777916e-05
10,481 FAAQP: Fast and Accurate Approximate Query Processing based on Bitmap-augmented Sum-Product Network 2025 SIGMOD 4.1945683e-05
11,194 A Step Toward Deep Online Aggregation 2023 SIGMOD 4.1945683e-05
11,453 XLJoins 2021 SIGMOD 4.1945683e-05
11,487 Toto - Benchmarking the Efficiency of a Cloud Service 2021 SIGMOD 4.1945683e-05
11,552 BitGourmet: Deterministic Approximation via Optimized Bit Selection 2020 CIDR 4.1945683e-05
11,585 Demonstration of BitGourmet: Data Analysis via Deterministic Approximation 2020 SIGMOD 4.1945683e-05
11,650 Query-Driven Learning for Next Generation Predictive Modeling & Analytics 2019 SIGMOD 4.1945683e-05
11,711 Demonstration of VerdictDB, the Platform-Independent AQP System 2018 SIGMOD 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 40 of 40 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
14 Online Aggregation 1997 SIGMOD 0.0010801504
66 Spark SQL: Relational Data Processing in Spark 2015 SIGMOD 0.00061639801
158 Automated Selection of Materialized Views and Indexes for SQL Databases 2000 VLDB 0.00040071492
211 Join Synopses for Approximate Query Answering 1999 SIGMOD 0.00033981214
408 Database Cracking 2007 CIDR 0.00023953844
429 The Aqua Approximate Query Answering System 1999 SIGMOD 0.00023476494
460 SeeDB: Efficient Data-Driven Visualization Recommendations to Support Visual Analytics 2015 VLDB 0.00022516069
848 Approximate Counts and Quantiles over Sliding Windows 2004 PODS 0.0001597308
967 Aqua: A Fast Decision Support System Using Approximate Query Answers 1999 VLDB 0.00014959939
1,137 User-adaptive exploration of multidimensional data 2000 VLDB 0.00013730532
1,152 Blink and It's Done: Interactive Queries on Very Large Data 2012 VLDB 0.00013645792
1,260 Dynamic Sample Selection for Approximate Query Processing 2003 SIGMOD 0.00012993347
1,323 Quickr: Lazily Approximating Complex AdHoc Queries in BigData Clusters 2016 SIGMOD 0.00012601997
1,335 ICICLES: Self-tuning Samples for Approximate Query Answering 2000 VLDB 0.00012502131
1,464 Online Aggregation for Large MapReduce Jobs 2011 VLDB 0.00011865546
1,587 Dynamic Prefetching of Data Tiles for Interactive Visualization 2016 SIGMOD 0.00011245116
1,874 Knowing When You’re Wrong: Building Fast and Reliable Approximate Query Processing Systems 2014 SIGMOD 0.00010244443
1,909 SciBORQ: Scientific data management with Bounds On Runtime and Quality 2011 CIDR 0.00010121304
2,011 Rapid Sampling for Visualizations with Ordering Guarantees 2015 VLDB 9.7964875e-05
2,229 Self-organizing Tuple Reconstruction in Column-stores 2009 SIGMOD 9.2350274e-05
2,355 G-OLA: Generalized On-Line Aggregation for Interactive Analysis on Big Data 2015 SIGMOD 8.9677847e-05
2,365 The Analytical Bootstrap: a New Method for Fast Error Estimation in Approximate Query Processing 2014 SIGMOD 8.9551432e-05
2,616 DAQ: A New Paradigm for Approximate Query Processing 2015 VLDB 8.4471955e-05
2,733 The Case for Data Visualization Management Systems [Vision Paper] 2014 VLDB 8.2078862e-05
3,167 Relational Confidence Bounds Are Easy With The Bootstrap* 2005 SIGMOD 7.4523397e-05
3,333 SnappyData: A Unified Cluster for Streaming, Transactions, and Interactive Analytics 2017 CIDR 7.2093479e-05
3,594 Continuous Sampling for Online Aggregation Over Multiple Queries 2010 SIGMOD 6.9381343e-05
3,842 Turbo-Charging Estimate Convergence in DBO 2009 VLDB 6.7102374e-05
4,052 Interactive Analysis of Web-Scale Data 2009 CIDR 6.4936745e-05
5,224 Neighbor-Sensitive Hashing 2016 VLDB 5.6197981e-05
5,262 SnappyData: A Hybrid Transactional Analytical Store Built On Spark 2016 SIGMOD 5.5977349e-05
5,376 Holistic Indexing in Main-memory Column-stores 2015 SIGMOD 5.5417421e-05
5,581 CliffGuard: A Principled Framework for Finding Robust Database Designs 2015 SIGMOD 5.424205e-05
5,815 StatAdvisor: Recommending Statistical Views 2009 VLDB 5.3165295e-05
5,868 ABS: a System for Scalable Approximate Queries with Accuracy Guarantees 2014 SIGMOD 5.2959352e-05
6,169 Approximate Lifted Inference with Probabilistic Databases 2015 VLDB 5.1716068e-05
6,400 iOLAP: Managing Uncertainty for Efficient Incremental OLAP 2016 SIGMOD 5.0803518e-05
6,411 Approximate Query Engines: Commercial Challenges and Research Opportunities 2017 SIGMOD 5.0752468e-05
7,085 Querying Big Data by Accessing Small Data 2015 PODS 4.8388174e-05
13,354 Verdict: A System for Stochastic Query Planning 2015 CIDR -
Previous Page 1 / 1 Next

Semantically Similar Papers