Database Paper Browser

Back to papers

Pruning in Snowflake: Working Smarter, Not Harder

Summary: Extends pruning from predicates to LIMIT, top-k, and JOIN, broadening pruning across workloads. Using min/max metadata and Iceberg-style formats, it prunes up to 99.4% of micro-partitions in Snowflake workloads and reveals higher real-world selectivity. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
7110
Venue
SIGMOD
Year
2025
Pagerank
4.5197687e-05
Overall Rank
8,415 | 41.46%
DOI
10.1145/3722212.3724447

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 3 of 3 citing papers.

Rank Citing Paper Year Venue Pagerank
10,196 PTO: A Workload-driven Predictive Table Optimizer for Lakehouse Systems 2026 SIGMOD 4.1945683e-05
10,241 Robust Predicate Transfer with Dynamic Execution 2026 VLDB 4.1945683e-05
10,749 Scaling GPU-Accelerated Databases beyond GPU Memory Size 2025 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 25 of 25 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
7 Optimal Aggregation Algorithms for Middleware [Extended Abstract] 2001 PODS 0.0015496097
44 The Design Of Postgres 1986 SIGMOD 0.00071838587
80 Weaving Relations for Cache Performance 2001 VLDB 0.00055721729
167 The Snowflake Elastic Data Warehouse 2016 SIGMOD 0.00039180521
185 DuckDB: an Embeddable Analytical Database 2019 SIGMOD 0.00036538405
268 R* Optimizer Validation and Performance Evaluation for Local Queries 1986 SIGMOD 0.00029662304
286 Integrating Vertical and Horizontal Partitioning into Automated Physical Database Design 2004 SIGMOD 0.00028990057
368 Small Materialized Aggregates: A Light Weight Index Structure for Data Warehousing 1998 VLDB 0.000254931
1,397 Analytic Database Technologies for a New Kind of User - The Data Enthusiast 2012 SIGMOD 0.00012199672
1,470 Processing a Trillion Cells per Mouse Click 2012 VLDB 0.00011833779
1,477 Fine-grained Partitioning for Aggressive Data Skipping 2014 SIGMOD 0.00011770865
1,611 Qd-tree: Learning Data Layouts for Big Data Analytics 2020 SIGMOD 0.00011147324
2,916 Quantifying TPC-H Choke Points and Their Optimizations 2020 VLDB 7.9068048e-05
2,985 DSB: A Decision Support Benchmark for Workload-Driven and Traditional Database Systems 2021 VLDB 7.7795847e-05
3,153 Horizontal Data Partitioning In Database Design 1982 SIGMOD 7.4707022e-05
3,488 Optimal Column Layout for Hybrid Workloads 2019 VLDB 7.0479329e-05
3,737 Skipping-oriented Partitioning for Columnar Layouts 2017 VLDB 6.8033227e-05
3,779 Instance-Optimized Data Layouts for Cloud Analytics Workloads 2021 SIGMOD 6.7747205e-05
3,922 Pushing Data-Induced Predicates Through Joins in Big-Data Clusters 2020 VLDB 6.6291079e-05
4,158 Performance-Optimal Filtering: Bloom Overtakes Cuckoo at High Throughput 2019 VLDB 6.3994318e-05
4,717 Cloud Analytics Benchmark 2023 VLDB 5.9751539e-05
5,315 Cuckoo Index: A Lightweight Secondary Index Structure 2020 VLDB 5.5723424e-05
6,972 Predicate Caching: Query-Driven Secondary Indexing for Cloud Data Warehouses 2024 SIGMOD 4.8785237e-05
8,442 SageDB: An Instance-Optimized Data Analytics System 2022 VLDB 4.5120602e-05
8,886 Provenance-based Data Skipping 2022 VLDB 4.4279829e-05
Previous Page 1 / 1 Next

Semantically Similar Papers