Database Paper Browser

Back to papers

Adaptive and Robust Query Execution for Lakehouses at Scale

Summary: AQE for lakehouses: use pipeline breakers to collect runtime statistics and reoptimize plans, mitigating missing/incorrect table/column stats and bad cardinality/UDF estimates. Up to 25× TPC‑DS speedup; deployed at Databricks for exabyte‑scale workloads to reduce data movement, spills, and memory pressure. (summarized by gpt-5-mini on Feb 09 2026)

Paper ID
13596
Venue
VLDB
Year
2024
Pagerank
4.8477825e-05
Overall Rank
7,059 | 50.90%
DOI
10.14778/3685800.3685818

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 9 of 9 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 24 of 24 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
1 Access Path Selection in a Relational Database Management System 1979 SIGMOD 0.0040449103
66 Spark SQL: Relational Data Processing in Spark 2015 SIGMOD 0.00061639801
115 Eddies: Continuously Adaptive Query Processing 2000 SIGMOD 0.00046221215
167 The Snowflake Elastic Data Warehouse 2016 SIGMOD 0.00039180521
182 LEO - DB2's LEarning Optimizer 2001 VLDB 0.00036962631
220 Efficient Mid-Query Re-Optimization of Sub-Optimal Query Execution Plans 1998 SIGMOD 0.00033194808
310 The Vertica Analytic Database: C-Store 7 Years Later 2012 VLDB 0.00028132402
426 Amazon Redshift and the Case for Simpler Data Warehouses 2015 SIGMOD 0.00023594359
476 Impala: A Modern, Open-Source SQL Engine for Hadoop 2015 CIDR 0.00022226941
508 Dynamic Query Evaluation Plans 1989 SIGMOD 0.00021463742
520 An Overview of The System Software of A Parallel Relational Database Machine GRACE 1986 VLDB 0.00021152636
542 Shark: SQL and Rich Analytics at Scale 2013 SIGMOD 0.00020595648
650 Robust Query Processing through Progressive Optimization 2004 SIGMOD 0.00018659177
746 Delta Lake: High-Performance ACID Table Storage over Cloud Object Stores 2020 VLDB 0.00017326979
1,377 Lakehouse: A New Generation of Open Platforms that Unify Data Warehousing and Advanced Analytics 2021 CIDR 0.00012296941
1,574 Approximate Query Processing: No Silver Bullet 2017 SIGMOD 0.00011287495
1,797 Effective Use of Block-Level Sampling in Statistics Estimation 2004 SIGMOD 0.00010523169
1,915 Handling Data Skew in Parallel Joins in Shared-Nothing Systems 2008 SIGMOD 0.00010104123
2,473 Photon: A Fast Query Engine for Lakehouse Systems 2022 SIGMOD 8.7237281e-05
2,504 Enhanced Subquery Optimizations in Oracle 2009 VLDB 8.6351917e-05
2,940 Operating System Extensions for the Teradata Parallel VLDB 2001 VLDB 7.8520709e-05
3,355 F1 Query: Declarative Querying at Scale 2018 VLDB 7.1829142e-05
5,531 Presto: A Decade of SQL Analytics at Meta 2023 SIGMOD 5.4549499e-05
6,390 Proactive Re-optimization with Rio 2005 SIGMOD 5.0842083e-05
Previous Page 1 / 1 Next

Semantically Similar Papers