Back to papers
DISTILL: Low-Overhead Data-Driven Techniques for Filtering and Costing Indexes for Scalable Index Tuning
Summary: DISTILL enables index tuning via pattern-based pruning of spurious, rule-based indexes to cut optimizer calls. It learns cost models via workload similarity across configs to estimate costs for many candidates, enabling up to 12x faster tuning with high quality.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 12699
- Venue
- VLDB
- Year
- 2022
- Pagerank
- 4.5998045e-05
- Overall Rank
- 8,041 | 44.07%
- DOI
-
10.14778/3547305.3547309
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 11 of 11 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
| 6,750 |
Breaking It Down: An In-depth Study of Index Advisors |
2024 |
VLDB |
4.9392771e-05 |
| 7,336 |
Refactoring Index Tuning Process with Benefit Estimation |
2024 |
VLDB |
4.7599411e-05 |
| 8,020 |
The Holon Approach for Simultaneously Tuning Multiple Components in a Self-Driving Database Management System with Machine Learning via Synthesized Proto-Actions |
2024 |
VLDB |
4.6040862e-05 |
| 9,006 |
Hit the Gym: Accelerating Query Execution to Efficiently Bootstrap Behavior Models for Self-Driving Database Management Systems |
2024 |
VLDB |
4.4101482e-05 |
| 9,929 |
Wred: Workload Reduction for Scalable Index Tuning |
2024 |
SIGMOD |
4.2510122e-05 |
| 9,930 |
Wii: Dynamic Budget Reallocation In Index Tuning |
2024 |
SIGMOD |
4.2510122e-05 |
| 10,032 |
Rainbow: Risk-aware Index Benefit Estimation Facing Out Of Distribution Workloads |
2026 |
SIGMOD |
4.1945683e-05 |
| 10,125 |
Understanding and Detecting Query Performance Regression in Practical Index Tuning: [Experiments & Analysis] |
2026 |
SIGMOD |
4.1945683e-05 |
| 10,205 |
RIB: Robust Learning-based Index Benefit Estimation |
2026 |
SIGMOD |
4.1945683e-05 |
| 10,217 |
This is Going to Sound Crazy, But What If We Used Large Language Models to Boost Automatic Database Tuning Algorithms By Leveraging Prior History? We Will Find Better Configurations More Quickly Than Retraining From Scratch! |
2026 |
SIGMOD |
4.1945683e-05 |
| 10,543 |
Esc: An Early-Stopping Checker for Budget-aware Index Tuning |
2025 |
VLDB |
4.1945683e-05 |
Outgoing Citations (Sorted by Pagerank)
Showing 19 of 19 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 237 |
An Efficient, Cost-Driven Index Selection Tool for Microsoft SQL Server |
1997 |
VLDB |
0.00031726304 |
| 516 |
AutoAdmin "What-if" Index Analysis Utility |
1998 |
SIGMOD |
0.00021196031 |
| 661 |
Database Tuning Advisor for Microsoft SQL Server 2005 |
2004 |
VLDB |
0.00018481174 |
| 1,017 |
Automatic Physical Database Tuning: A Relaxation-based Approach |
2005 |
SIGMOD |
0.00014634307 |
| 1,070 |
Analyzing Plan Diagrams of Database Query Optimizers |
2005 |
VLDB |
0.00014316791 |
| 1,254 |
Selectivity Estimation for Range Predicates using Lightweight Models |
2019 |
VLDB |
0.00013027411 |
| 1,443 |
Compressing SQL Workloads |
2002 |
SIGMOD |
0.00011947004 |
| 1,647 |
Parametric Query Optimization for Linear and Piecewise Linear Cost Functions |
2002 |
VLDB |
0.00011033757 |
| 1,855 |
AI Meets AI: Leveraging Query Executions to Improve Index Recommendations |
2019 |
SIGMOD |
0.00010315245 |
| 1,962 |
Plan Selection based on Query Clustering |
2002 |
VLDB |
9.950467e-05 |
| 2,020 |
Magic mirror in my hand, which is the best in the land? An Experimental Evaluation of Index Selection Algorithms |
2020 |
VLDB |
9.762624e-05 |
| 2,047 |
Automatically Indexing Millions of Databases in Microsoft Azure SQL Database |
2019 |
SIGMOD |
9.6920209e-05 |
| 2,484 |
Efficient Use of the Query Optimizer for Automated Physical Design |
2007 |
VLDB |
8.6864279e-05 |
| 2,787 |
To Tune or not to Tune? A Lightweight Physical Design Alerter |
2006 |
VLDB |
8.1263608e-05 |
| 3,625 |
Cost Models for Big Data Query Processing: Learning, Retrofitting, and Our Findings |
2020 |
SIGMOD |
6.9055212e-05 |
| 3,954 |
Efficiently Approximating Selectivity Functions using Low Overhead Regression Models |
2020 |
VLDB |
6.5926838e-05 |
| 4,468 |
Comprehensive and Efficient Workload Compression |
2021 |
VLDB |
6.1584035e-05 |
| 5,686 |
Budget-aware Index Tuning with Reinforcement Learning |
2022 |
SIGMOD |
5.3712312e-05 |
| 6,366 |
ISUM: Efficiently Compressing Large and Complex Workloads for Scalable Index Tuning |
2022 |
SIGMOD |
5.0943443e-05 |
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 10,125 |
Understanding and Detecting Query Performance Regression in Practical Index Tuning: [Experiments & Analysis] |
2026 |
SIGMOD |
4.1945683e-05 |
| 12,099 |
Efficient Indexing for Diverse Query Results |
2013 |
VLDB |
4.1945683e-05 |
| 11,100 |
Looking Deeply into the Magic Mirror: An Interactive Analysis of Database Index Selection Approaches |
2024 |
VLDB |
4.1945683e-05 |
| 7,336 |
Refactoring Index Tuning Process with Benefit Estimation |
2024 |
VLDB |
4.7599411e-05 |
| 10,748 |
Benchmarking Adaptive Multidimensional Indices |
2025 |
VLDB |
4.1945683e-05 |
| 5,319 |
DILI: A Distribution-Driven Learned Index |
2023 |
VLDB |
5.5713974e-05 |
| 237 |
An Efficient, Cost-Driven Index Selection Tool for Microsoft SQL Server |
1997 |
VLDB |
0.00031726304 |
| 5,686 |
Budget-aware Index Tuning with Reinforcement Learning |
2022 |
SIGMOD |
5.3712312e-05 |
| 1,855 |
AI Meets AI: Leveraging Query Executions to Improve Index Recommendations |
2019 |
SIGMOD |
0.00010315245 |
| 5,337 |
Learned Index Benefits: Machine Learning Based Index Performance Estimation |
2022 |
VLDB |
5.5635208e-05 |