PBench: Workload Synthesizer with Real Statistics for Cloud Analytics Benchmarking
Summary: PBench synthesizes cloud analytics workloads that replicate real execution statistics (performance metrics, operator distributions, temporal dynamics) by selecting and combining benchmark components and augmenting missing pieces. Key innovations: multi-objective optimization for component selection, progressive timestamp assignment, and LLM-based component augmentation to preserve statistical fidelity, achieving up to 6x lower approximation error vs prior work. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Authors
- 1. Yan Zhou
- 2. Chunwei Liu
- 3. Bhuvan Urgaonkar
- 4. Zhengle Wang
- 5. Magnus Mueller
- 6. Chao Zhang
- 7. Songyue Zhang
- 8. Pascal Pfeil
- 9. Dominik Horn
- 10. Zhengchun Liu
- 11. Davide Pagano
- 12. Tim Kraska
- 13. Samuel Madden
- 14. Ju Fan
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|
Outgoing Citations (Sorted by Pagerank)
Showing 20 of 20 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 5,637 | Database Workload Characterization with Query Plan Encoders | 2022 | VLDB | 5.3979505e-05 |
| 8,985 | TSM-Bench: Benchmarking Time Series Database Systems for Monitoring Applications | 2023 | VLDB | 4.4156106e-05 |
| 7,892 | M2Bench: A Database Benchmark for Multi-Model Analytic Workloads | 2023 | VLDB | 4.6245179e-05 |
| 340 | OLTP-Bench: An Extensible Testbed for Benchmarking Relational Databases | 2014 | VLDB | 0.00026841628 |
| 1,727 | BigBench: Towards an Industry Standard Benchmark for Big Data Analytics | 2013 | SIGMOD | 0.00010740936 |
| 9,639 | CDSBen: Benchmarking the Performance of Storage Services in Cloud-native Database System at ByteDance | 2023 | VLDB | 4.3109052e-05 |
| 4,517 | Generating Databases for Query Workloads | 2010 | VLDB | 6.1178732e-05 |
| 3,178 | Why TPC Is Not Enough: An Analysis of the Amazon Redshift Fleet | 2024 | VLDB | 7.4325992e-05 |
| 10,724 | Privacy-Enhanced Database Synthesis for Benchmark Publishing | 2025 | VLDB | 4.1945683e-05 |
| 4,717 | Cloud Analytics Benchmark | 2023 | VLDB | 5.9751539e-05 |