VLDB 2023

List of 7 paper (repeated below) presented at VLDB '23

I’m excited to be part of seven different papers, demos, and workshop papers at VLDB in Vancouver this year!

AdaChain

Every permissioned blockchain architecture has different performance characteristics. AdaChain automatically switches between architectures to optimize performance online, compensating for changes in workload and network conditions.

AdaChain: A Learned Adaptive Blockchain
1. Chenyuan Wu
2. Bhavana Mehta
3. Mohammad Javad Amiri
4. Ryan Marcus
5. Boon Thau Loo
VLDB '23 (pdf) (doi)

AutoSteer

Bringing a learned steering optimizer to a new database can be difficult, since optimizers can have 1000s of knobs. AutoSteer automatically finds a good set, and optimizes your queries as well! We tested a large deployment of AutoSteer at Meta.

AutoSteer: Learned Query Optimization for Any SQL Database
1. Christoph Anneser
2. Nesime Tatbul
3. David Cohen
4. Zhenggang Xu
5. Prithvi Pandian
6. Nikolay Leptev
7. Ryan Marcus
VLDB '23 (pdf) (doi)

QO-Insight

Alongside AutoSteer, we developed a tool called QO-Insight to help DBAs understand the decisions of learned query optimizers. We will present a demo of our tool which enables side-by-side query plan analysis!

QO-Insight: Inspecting Steered Query Optimizers Demo.
1. Christoph Anneser
2. Mario Petruccelli
3. Nesime Tatbul
4. David Cohen
5. Zhenggang Xu
6. Prithviraj Pandian
7. Nikolay Laptev
8. Ryan Marcus
9. Alfons Kemper
VLDB '23 (pdf) (doi)

Robust cardinality estimation

Query-driven cardinality estimators learn powerful, workload-tailored strategies, but have a hard time dealing with data drift. We show robust techniques that can tune a learned cardinality estimator online, as data changes.

Robust Query Driven Cardinality Estimation under Changing Workloads
1. Parimarjan Negi
2. Ziniu Wu
3. Andreas Kipf
4. Nesime Tatbul
5. Ryan Marcus
6. Sam Madden
7. Tim Kraska
8. Mohammad Alizadeh
VLDB '23 (pdf) (doi)

SageDB

The culmination of several years of work on instance-optimized system, SageDB is a prototype analytic database combining together several learned techniques at once for the first time.

(Technically, SageDB was published in VLDB ‘22 proceedings, but the presentation is happening this year!)

SageDB: An Instance-Optimized Data Analytics System
1. Jialin Ding
2. Ryan Marcus
3. Andreas Kipf
4. Vikram Nathan
5. Aniruddha Nrusimha
6. Kapil Vaidya
7. Alexander van Renen
8. Tim Kraska
VLDB '22 (pdf) (doi)

RLShard

Almost every distributed transactional database today can tolerate crashes, but not Byzantine failures. Here, we take a first look at building a distributed, sharded database that can tolerate – and adapt to – Byzantine adversaries.

Towards Adaptive Fault-Tolerant Sharded Databases
1. Bhavana Mehta
2. Neelesh Chinnakonda Ashok Kumar
3. Prashanth S Iyer
4. Mohammad Javad Amiri
5. Boon Thau Loo
6. Ryan Marcus
AIDB@VLDB '23 (pdf)

Learned query superoptimization

Modern analytics databases frequently run the same query multiple times. Could it be worth spending a long time – hours – optimizing such queries? I argue that doing so might allow DBMSes to capture some of the performance of bespoke systems.

Learned Query Superoptimization
1. Ryan Marcus
AIDB@VLDB '23 (pdf)