Comprehensive and Efficient Workload Compression
Summary: Construct a representative workload from an input analytical workload with guarantees on representativity and coverage. A greedy, approximation-guaranteed algorithm solves the NP-hard problem, preserving distributions and outliers, and compares with sampling and clustering. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Shaleen Deep
- 2. Anja Gruenheid
- 3. Paraschos Koutris
- 4. Jeffrey Naughton
- 5. Stratis Viglas
Incoming Citations (Sorted by Pagerank)
Showing 10 of 10 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3,429 | Real-time Workload Pattern Analysis for Large-scale Cloud Databases | 2023 | VLDB | 7.1010535e-05 |
| 3,789 | DIAMetrics: Benchmarking Query Engines at Scale | 2020 | VLDB | 6.7644737e-05 |
| 5,686 | Budget-aware Index Tuning with Reinforcement Learning | 2022 | SIGMOD | 5.3712312e-05 |
| 6,366 | ISUM: Efficiently Compressing Large and Complex Workloads for Scalable Index Tuning | 2022 | SIGMOD | 5.0943443e-05 |
| 7,336 | Refactoring Index Tuning Process with Benefit Estimation | 2024 | VLDB | 4.7599411e-05 |
| 8,041 | DISTILL: Low-Overhead Data-Driven Techniques for Filtering and Costing Indexes for Scalable Index Tuning | 2022 | VLDB | 4.5998045e-05 |
| 9,467 | Database Gyms | 2023 | CIDR | 4.3346412e-05 |
| 9,638 | TracEx: Understanding and Analyzing Database Traces | 2024 | CIDR | 4.3109052e-05 |
| 9,929 | Wred: Workload Reduction for Scalable Index Tuning | 2024 | SIGMOD | 4.2510122e-05 |
| 9,956 | SCompression: Enhancing Database Knob Tuning Efficiency Through Slice-Based OLTP Workload Compression | 2025 | VLDB | 4.2373024e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 10 of 10 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 333 | Neo: A Learned Query Optimizer | 2019 | VLDB | 0.00027206884 |
| 661 | Database Tuning Advisor for Microsoft SQL Server 2005 | 2004 | VLDB | 0.00018481174 |
| 1,443 | Compressing SQL Workloads | 2002 | SIGMOD | 0.00011947004 |
| 3,355 | F1 Query: Declarative Querying at Scale | 2018 | VLDB | 7.1829142e-05 |
| 3,789 | DIAMetrics: Benchmarking Query Engines at Scale | 2020 | VLDB | 6.7644737e-05 |
| 4,287 | Primitives for Workload Summarization and Implications for SQL | 2003 | VLDB | 6.2891702e-05 |
| 4,681 | Adaptive Sampling for Rapidly Matching Histograms | 2018 | VLDB | 6.0034918e-05 |
| 6,080 | Answering Top-k Representative Queries on Graph Databases | 2014 | SIGMOD | 5.2214553e-05 |
| 8,364 | Query Log Compression for Workload Analytics | 2019 | VLDB | 4.5357797e-05 |
| 8,365 | Efficient Summarization Framework for Multi-Attribute Uncertain Data | 2014 | SIGMOD | 4.5357797e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,808 | A Robust, Optimization-Based Approach for Approximate Answering of Aggregate Queries | 2001 | SIGMOD | 8.0870741e-05 |
| 7,828 | Modeling Shifting Workloads for Learned Database Systems | 2024 | SIGMOD | 4.6407986e-05 |
| 10,927 | Computing A Well-Representative Summary of Conjunctive Query Results | 2024 | PODS | 4.1945683e-05 |
| 7,429 | CompressDB: Enabling Efficient Compressed Data Direct Processing for Various Databases | 2022 | SIGMOD | 4.7320139e-05 |
| 6,157 | Compression Aware Physical Database Design | 2011 | VLDB | 5.1801143e-05 |
| 5,637 | Database Workload Characterization with Query Plan Encoders | 2022 | VLDB | 5.3979505e-05 |
| 8,578 | Robust and Budget-Constrained Encoding Configurations for In-Memory Database Systems | 2022 | VLDB | 4.4923477e-05 |
| 1,100 | Query Optimization In Compressed Database Systems | 2001 | SIGMOD | 0.00014072277 |
| 8,364 | Query Log Compression for Workload Analytics | 2019 | VLDB | 4.5357797e-05 |
| 1,443 | Compressing SQL Workloads | 2002 | SIGMOD | 0.00011947004 |