Database Paper Browser

Back to papers

Towards a Learning Optimizer for Shared Clouds

Summary: CARDLEARNER learns cardinalities from past cloud runs, using subgraph templates for accurate estimates. Explores join variations to curb bias, uses many small models, and feeds predictions back to future runs, achieving 5x error reduction and 2-3x speedups. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
11930
Venue
VLDB
Year
2019
Pagerank
9.5834572e-05
Overall Rank
2,083 | 85.52%
DOI
10.14778/3291264.3291267

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 44 of 44 citing papers.

Rank Citing Paper Year Venue Pagerank
608 DeepDB: Learn from Data, not from Queries! 2020 VLDB 0.00019235898
758 Deep Unsupervised Cardinality Estimation 2020 VLDB 0.0001706608
910 NeuroCard: One Cardinality Estimator for All Tables 2021 VLDB 0.00015423056
1,638 Cardinality Estimation in DBMS: A Comprehensive Benchmark Evaluation 2022 VLDB 0.00011049779
2,121 Balsa: Learning a Query Optimizer Without Expert Demonstrations 2022 SIGMOD 9.5017232e-05
2,552 Updatable Learned Index with Precise Positions 2021 VLDB 8.5530411e-05
2,762 FLAT: Fast, Lightweight and Accurate Method for Cardinality Estimation 2021 VLDB 8.1585394e-05
2,783 Flow-Loss: Learning Cardinality Estimates That Matter 2021 VLDB 8.1293383e-05
3,248 A Learned Query Rewrite System using Monte Carlo Tree Search 2022 VLDB 7.3258782e-05
3,473 AI Meets Database: AI4DB and DB4AI 2021 SIGMOD 7.062864e-05
3,625 Cost Models for Big Data Query Processing: Learning, Retrofitting, and Our Findings 2020 SIGMOD 6.9055212e-05
3,725 Estimating Cardinalities with Deep Sketches 2019 SIGMOD 6.8170734e-05
3,828 Zero-Shot Cost Models for Out-of-the-box Learned Cost Prediction 2022 VLDB 6.7208524e-05
3,875 Cloudy with High Chance of DBMS: A 10-year Prediction for Enterprise-Grade ML 2020 CIDR 6.675257e-05
3,924 A Unified Deep Model of Learning from both Data and Queries for Cardinality Estimation 2021 SIGMOD 6.6271553e-05
3,954 Efficiently Approximating Selectivity Functions using Low Overhead Regression Models 2020 VLDB 6.5926838e-05
3,990 FactorJoin: A New Cardinality Estimation Framework for Join Queries 2023 SIGMOD 6.5581983e-05
4,097 The Case for a Learned Sorting Algorithm 2020 SIGMOD 6.4551616e-05
4,399 HUNTER: An Online Cloud Database Hybrid Tuning System for Personalized Requirements 2022 SIGMOD 6.2225151e-05
4,446 Stable Learned Bloom Filters for Data Streams 2020 VLDB 6.1800659e-05
4,590 MB2: Decomposed Behavior Modeling for Self-Driving Database Management Systems 2021 SIGMOD 6.0620053e-05
4,690 Deploying a Steered Query Optimizer in Production at Microsoft 2022 SIGMOD 5.997226e-05
5,368 Fine-Grained Modeling and Optimization for Intelligent Resource Management in Big Data Processing 2022 VLDB 5.5457532e-05
5,401 ALECE: An Attention-based Learned Cardinality Estimator for SPJ Queries on Dynamic Workloads 2024 VLDB 5.5285035e-05
5,489 To Share, or not to Share Online Event Trend Aggregation Over Bursty Event Streams 2021 SIGMOD 5.4782335e-05
5,942 SAM: Database Generation from Query Workloads with Supervised Autoregressive Models 2022 SIGMOD 5.2634242e-05
6,040 Steering Query Optimizers: A Practical Take on Big Data Workloads 2021 SIGMOD 5.2412035e-05
6,230 Learned Approximate Query Processing: Make it Light, Accurate and Fast 2021 CIDR 5.145989e-05
6,261 The Cosmos Big Data Platform at Microsoft: Over a Decade of Progress and a Decade to Look Forward 2021 VLDB 5.1350714e-05
7,467 Yannakakis+: Practical Acyclic Query Evaluation with Theoretical Guarantees 2025 SIGMOD 4.7218691e-05
7,655 Machine Learning for Cloud Data Systems: the Progress so far and the Path Forward 2021 VLDB 4.6872456e-05
7,684 AutoToken: Predicting Peak Parallelism for Big Data Analytics at Microsoft 2020 VLDB 4.6796855e-05
7,828 Modeling Shifting Workloads for Learned Database Systems 2024 SIGMOD 4.6407986e-05
8,131 Sibyl: Forecasting Time-Evolving Query Workloads 2024 SIGMOD 4.5784634e-05
8,197 SparkCruise: Workload Optimization in Managed Spark Clusters at Microsoft 2021 VLDB 4.5607121e-05
8,220 PerfGuard: Deploying ML-for-Systems without Performance Regressions, Almost! 2021 VLDB 4.5557328e-05
8,416 Towards Building Autonomous Data Services on Azure 2023 SIGMOD 4.5196199e-05
8,582 Towards Query Optimizer as a Service (QOaaS) in a Unified LakeHouse Ecosystem: Can One QO Rule Them All? 2025 CIDR 4.492033e-05
9,194 Phoebe: A Learning-based Checkpoint Optimizer 2021 VLDB 4.3761777e-05
9,213 PACE: Poisoning Attacks on Learned Cardinality Estimation 2024 SIGMOD 4.3721075e-05
9,812 A Practical Theory of Generalization in Selectivity Learning 2025 VLDB 4.2783272e-05
9,878 PRICE: A Pretrained Model for Cross-Database Cardinality Estimation 2025 VLDB 4.2656547e-05
10,859 Graph Transformers for Query Plan Representation: Potentials and Challenges 2025 VLDB 4.1945683e-05
10,995 Understanding and Reusing Test Suites Across Database Systems 2024 SIGMOD 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 19 of 19 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
1 Access Path Selection in a Relational Database Management System 1979 SIGMOD 0.0040449103
22 SCOPE: Easy and Efficient Parallel Processing of Massive Data Sets 2008 VLDB 0.0008456613
70 Hive - A Warehousing Solution Over a Map-Reduce Framework 2009 VLDB 0.00059533166
71 How Good Are Query Optimizers, Really? 2016 VLDB 0.00059038975
99 On the Propagation of Errors in the Size of Join Results 1991 SIGMOD 0.00050022914
102 The Case for Learned Index Structures 2018 SIGMOD 0.00049545203
182 LEO - DB2's LEarning Optimizer 2001 VLDB 0.00036962631
183 Automatic Database Management System Tuning Through Large-scale Machine Learning 2017 SIGMOD 0.00036721403
394 An Adaptive Query Execution System for Data Integration* 1999 SIGMOD 0.00024460855
650 Robust Query Processing through Progressive Optimization 2004 SIGMOD 0.00018659177
1,323 Quickr: Lazily Approximating Complex AdHoc Queries in BigData Clusters 2016 SIGMOD 0.00012601997
1,477 Fine-grained Partitioning for Aggressive Data Skipping 2014 SIGMOD 0.00011770865
3,038 Azure Data Lake Store: A Hyperscale Distributed File Service for Big Data Analytics 2017 SIGMOD 7.6717218e-05
4,174 Computation Reuse in Analytics Job Service at Microsoft 2018 SIGMOD 6.3856219e-05
5,014 Dynamically Optimizing Queries over Large Scale Data Platforms 2014 SIGMOD 5.7586174e-05
5,297 Continuous Cloud-Scale Query Optimization and Processing 2013 VLDB 5.5801669e-05
5,668 A Pay-As-You-Go Framework for Query Execution Feedback 2008 VLDB 5.3806337e-05
7,465 Non-Invasive Progressive Optimization for In-Memory Databases 2016 VLDB 4.7228742e-05
8,725 A Fast Randomized Algorithm for Multi-Objective Query Optimization 2016 SIGMOD 4.4600243e-05
Previous Page 1 / 1 Next

Semantically Similar Papers