Back to papers
Active Learning for ML Enhanced Database Systems
Summary: ADCP uses active learning to collect deployment data. HAL fuses signals to guide data gathering under varying budgets, delivering up to 2x prediction performance at the same cost and 75% error reduction with ~100 extra queries on production workloads.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 5983
- Venue
- SIGMOD
- Year
- 2020
- Pagerank
- 7.4815444e-05
- Overall Rank
- 3,142 | 78.15%
- DOI
-
10.1145/3318464.3389768
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 28 of 28 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
| 2,985 |
DSB: A Decision Support Benchmark for Workload-Driven and Traditional Database Systems |
2021 |
VLDB |
7.7795847e-05 |
| 3,473 |
AI Meets Database: AI4DB and DB4AI |
2021 |
SIGMOD |
7.062864e-05 |
| 3,750 |
Data Acquisition for Improving Machine Learning Models |
2021 |
VLDB |
6.7895763e-05 |
| 3,778 |
A Learned Sketch for Subgraph Counting |
2021 |
SIGMOD |
6.7747398e-05 |
| 3,990 |
FactorJoin: A New Cardinality Estimation Framework for Join Queries |
2023 |
SIGMOD |
6.5581983e-05 |
| 4,240 |
Make Your Database System Dream of Electric Sheep: Towards Self-Driving Operation |
2021 |
VLDB |
6.3318228e-05 |
| 4,399 |
HUNTER: An Online Cloud Database Hybrid Tuning System for Personalized Requirements |
2022 |
SIGMOD |
6.2225151e-05 |
| 4,590 |
MB2: Decomposed Behavior Modeling for Self-Driving Database Management Systems |
2021 |
SIGMOD |
6.0620053e-05 |
| 4,913 |
UDO: Universal Database Optimization using Reinforcement Learning |
2021 |
VLDB |
5.8316231e-05 |
| 5,028 |
Adaptive Data Augmentation for Supervised Learning over Missing Data |
2021 |
VLDB |
5.7506746e-05 |
| 5,334 |
LEON: A New Framework for ML-Aided Query Optimization |
2023 |
VLDB |
5.5649836e-05 |
| 5,645 |
Warper: Efficiently Adapting Learned Cardinality Estimators to Data and Workload Drifts |
2022 |
SIGMOD |
5.3923454e-05 |
| 5,861 |
Machine Learning for Databases |
2021 |
VLDB |
5.298883e-05 |
| 6,519 |
Expand your Training Limits! Generating Training Data for ML-based Data Management |
2021 |
SIGMOD |
5.0316686e-05 |
| 6,775 |
A Unified Transferable Model for ML-Enhanced DBMS |
2022 |
CIDR |
4.9299192e-05 |
| 6,879 |
Detect, Distill and Update: Learned DB Systems Facing Out of Distribution Data |
2023 |
SIGMOD |
4.8971368e-05 |
| 7,296 |
Multi-Tenant Cloud Data Services: State-of-the-Art, Challenges and Opportunities |
2022 |
SIGMOD |
4.7723197e-05 |
| 7,828 |
Modeling Shifting Workloads for Learned Database Systems |
2024 |
SIGMOD |
4.6407986e-05 |
| 8,009 |
CAMAL: Optimizing LSM-trees via Active Learning |
2024 |
SIGMOD |
4.6066863e-05 |
| 8,020 |
The Holon Approach for Simultaneously Tuning Multiple Components in a Self-Driving Database Management System with Machine Learning via Synthesized Proto-Actions |
2024 |
VLDB |
4.6040862e-05 |
| 8,615 |
The Case for NLP-Enhanced Database Tuning: Towards Tuning Tools that "Read the Manual" |
2021 |
VLDB |
4.484683e-05 |
| 8,774 |
Tiresias: Enabling Predictive Autonomous Storage and Indexing |
2022 |
VLDB |
4.4559995e-05 |
| 9,006 |
Hit the Gym: Accelerating Query Execution to Efficiently Bootstrap Behavior Models for Self-Driving Database Management Systems |
2024 |
VLDB |
4.4101482e-05 |
| 9,108 |
BASE: Bridging the Gap between Cost and Latency for Query Optimization |
2023 |
VLDB |
4.3950066e-05 |
| 9,467 |
Database Gyms |
2023 |
CIDR |
4.3346412e-05 |
| 9,917 |
Check Out the Big Brain on BRAD: Simplifying Cloud Data Processing with Learned Automated Data Meshes |
2023 |
VLDB |
4.2561557e-05 |
| 10,217 |
This is Going to Sound Crazy, But What If We Used Large Language Models to Boost Automatic Database Tuning Algorithms By Leveraging Prior History? We Will Find Better Configurations More Quickly Than Retraining From Scratch! |
2026 |
SIGMOD |
4.1945683e-05 |
| 10,955 |
Data Acquisition for Improving Model Confidence |
2024 |
SIGMOD |
4.1945683e-05 |
Outgoing Citations (Sorted by Pagerank)
Showing 18 of 18 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 8,220 |
PerfGuard: Deploying ML-for-Systems without Performance Regressions, Almost! |
2021 |
VLDB |
4.5557328e-05 |
| 8,847 |
Towards Foundation Database Models |
2025 |
CIDR |
4.4371897e-05 |
| 9,852 |
Machine Unlearning in Learned Databases: An Experimental Analysis |
2024 |
SIGMOD |
4.2714575e-05 |
| 9,776 |
Structure-Aware Machine Learning over Multi-Relational Databases |
2021 |
SIGMOD |
4.2856106e-05 |
| 5,258 |
One Model to Rule them All: Towards Zero-Shot Learning for Databases |
2022 |
CIDR |
5.5998705e-05 |
| 608 |
DeepDB: Learn from Data, not from Queries! |
2020 |
VLDB |
0.00019235898 |
| 6,775 |
A Unified Transferable Model for ML-Enhanced DBMS |
2022 |
CIDR |
4.9299192e-05 |
| 5,337 |
Learned Index Benefits: Machine Learning Based Index Performance Estimation |
2022 |
VLDB |
5.5635208e-05 |
| 5,861 |
Machine Learning for Databases |
2021 |
VLDB |
5.298883e-05 |
| 4,758 |
Optimization for Active Learning-based Interactive Database Exploration |
2019 |
VLDB |
5.9422515e-05 |