Back to papers
Automated Multidimensional Data Layouts in Amazon Redshift
Summary: MDDL sorts by predicates to enable zone-map pruning instead of column order. Auto-learns best predicate set from workload telemetry; implemented in Redshift, achieving up to 85% endtoend speedup and the first commercial data layout that sorts by predicates.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 6787
- Venue
- SIGMOD
- Year
- 2024
- Pagerank
- 4.555289e-05
- Overall Rank
- 8,225 | 42.79%
- DOI
-
10.1145/3626246.3653379
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 5 of 5 citing papers.
Outgoing Citations (Sorted by Pagerank)
Showing 15 of 15 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 158 |
Automated Selection of Materialized Views and Indexes for SQL Databases |
2000 |
VLDB |
0.00040071492 |
| 368 |
Small Materialized Aggregates: A Light Weight Index Structure for Data Warehousing |
1998 |
VLDB |
0.000254931 |
| 1,017 |
Automatic Physical Database Tuning: A Relaxation-based Approach |
2005 |
SIGMOD |
0.00014634307 |
| 1,284 |
Amazon Redshift Re-invented |
2022 |
SIGMOD |
0.00012837822 |
| 1,478 |
Learning Multi-dimensional Indexes |
2020 |
SIGMOD |
0.00011762542 |
| 1,611 |
Qd-tree: Learning Data Layouts for Big Data Analytics |
2020 |
SIGMOD |
0.00011147324 |
| 1,889 |
Tsunami: A Learned Multi-dimensional Index for Correlated Data and Skewed Workloads |
2021 |
VLDB |
0.00010200865 |
| 3,779 |
Instance-Optimized Data Layouts for Cloud Analytics Workloads |
2021 |
SIGMOD |
6.7747205e-05 |
| 4,593 |
Auto-WLM: Machine Learning Enhanced Workload Management in Amazon Redshift |
2023 |
SIGMOD |
6.0606891e-05 |
| 6,466 |
Pando: Enhanced Data Skipping with Logical Data Partitioning |
2023 |
VLDB |
5.0528281e-05 |
| 6,659 |
Fast and Effective Distribution-Key Recommendation for Amazon Redshift |
2020 |
VLDB |
4.9710856e-05 |
| 6,984 |
Replicated Layout for In-Memory Database Systems |
2022 |
VLDB |
4.873081e-05 |
| 7,337 |
Unified Spatial Analytics from Heterogeneous Sources with Amazon Redshift |
2020 |
SIGMOD |
4.7584825e-05 |
| 8,181 |
Foreign Keys Open the Door for Faster Incremental View Maintenance |
2023 |
SIGMOD |
4.5660166e-05 |
| 8,442 |
SageDB: An Instance-Optimized Data Analytics System |
2022 |
VLDB |
4.5120602e-05 |
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 10,935 |
Automated Clustering Recommendation With Database Zone Maps |
2024 |
SIGMOD |
4.1945683e-05 |
| 6,659 |
Fast and Effective Distribution-Key Recommendation for Amazon Redshift |
2020 |
VLDB |
4.9710856e-05 |
| 2,045 |
Multi-Dimensional Clustering: A New Data Layout Scheme in DB2 |
2003 |
SIGMOD |
9.6939983e-05 |
| 3,779 |
Instance-Optimized Data Layouts for Cloud Analytics Workloads |
2021 |
SIGMOD |
6.7747205e-05 |
| 6,972 |
Predicate Caching: Query-Driven Secondary Indexing for Cloud Data Warehouses |
2024 |
SIGMOD |
4.8785237e-05 |
| 5,832 |
Stage: Query Execution Time Prediction in Amazon Redshift |
2024 |
SIGMOD |
5.3111109e-05 |
| 3,028 |
Efficient Query Processing for Multi-Dimensionally Clustered Tables in DB2 |
2003 |
VLDB |
7.6816205e-05 |
| 7,603 |
Automated design of multidimensional clustering tables for relational databases |
2004 |
VLDB |
4.6985903e-05 |
| 1,478 |
Learning Multi-dimensional Indexes |
2020 |
SIGMOD |
0.00011762542 |
| 4,593 |
Auto-WLM: Machine Learning Enhanced Workload Management in Amazon Redshift |
2023 |
SIGMOD |
6.0606891e-05 |