Workload Insights From The Snowflake Data Cloud: What Do Production Analytic Queries Really Look Like?
Summary: Analysis of 667M BI queries on Snowflake over two weeks, characterizing production analytic workloads at cloud scale across industries. Reveals detailed filter/join/aggregation behaviors and uncovers workload patterns absent from standard benchmarks, guiding DB research. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Jan Vincent Szlang
- 2. Sebastian Bress
- 3. Sebastian Cattes
- 4. Jonathan Dees
- 5. Florian Funke
- 6. Max Heimel
- 7. Michel Oleynik
- 8. Ismail Oukid
- 9. Tobias Maltenberger
Incoming Citations (Sorted by Pagerank)
Showing 2 of 2 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 8,207 | SQLStorm: Taking Database Benchmarking into the LLM Era | 2025 | VLDB | 4.5583637e-05 |
| 10,295 | Global Hash Tables Strike Back! An Analysis of Parallel GROUP BY Aggregation | 2026 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 19 of 19 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 9,973 | End-to-End Declarative Data Analytics: Co-designing Engines, Interfaces, and Cloud Infrastructure | 2026 | CIDR | 4.1945683e-05 |
| 3,429 | Real-time Workload Pattern Analysis for Large-scale Cloud Databases | 2023 | VLDB | 7.1010535e-05 |
| 11,650 | Query-Driven Learning for Next Generation Predictive Modeling & Analytics | 2019 | SIGMOD | 4.1945683e-05 |
| 3,122 | Dynamic Workload Management for Very Large Data Warehouses – Juggling Feathers and Bowling Balls | 2007 | VLDB | 7.5264988e-05 |
| 167 | The Snowflake Elastic Data Warehouse | 2016 | SIGMOD | 0.00039180521 |
| 2,568 | Towards Cost-Optimal Query Processing in the Cloud | 2021 | VLDB | 8.5239227e-05 |
| 13,228 | White-Box OLAP Performance Modeling for the Cloud | 2021 | CIDR | - |
| 3,779 | Instance-Optimized Data Layouts for Cloud Analytics Workloads | 2021 | SIGMOD | 6.7747205e-05 |
| 4,717 | Cloud Analytics Benchmark | 2023 | VLDB | 5.9751539e-05 |
| 8,415 | Pruning in Snowflake: Working Smarter, Not Harder | 2025 | SIGMOD | 4.5197687e-05 |