Generating Databases for Query Workloads
Summary: MyBenchmark is an offline data-generation tool that ingests a query set and emits database instances whose data distributions reproduce workload. Focuses on application-driven benchmarking; architecture, algorithms, and evaluation on TPC workloads. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Eric Lo
- 2. Nick Cheng
- 3. Wing-Kai Hon
Incoming Citations (Sorted by Pagerank)
Showing 8 of 8 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,291 | Data Generation using Declarative Constraints | 2011 | SIGMOD | 9.0926719e-05 |
| 3,924 | A Unified Deep Model of Learning from both Data and Queries for Cardinality Estimation | 2021 | SIGMOD | 6.6271553e-05 |
| 5,942 | SAM: Database Generation from Query Workloads with Supervised Autoregressive Models | 2022 | SIGMOD | 5.2634242e-05 |
| 6,234 | Just can't get enough - Synthesizing Big Data | 2015 | SIGMOD | 5.1451686e-05 |
| 6,887 | Synthesizing Linked Data Under Cardinality and Integrity Constraints | 2021 | SIGMOD | 4.8937852e-05 |
| 7,759 | Dscaler: Synthetically Scaling A Given Relational Database | 2016 | VLDB | 4.6593145e-05 |
| 8,870 | DataSynth: Generating Synthetic Data using Declarative Constraints | 2011 | VLDB | 4.431665e-05 |
| 8,954 | Understanding Queries by Conditional Instances | 2022 | SIGMOD | 4.4221863e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 6 of 6 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 145 | Quickly Generating Billion-Record Synthetic Databases | 1994 | SIGMOD | 0.0004138408 |
| 888 | QAGen: Generating Query-Aware Test Databases | 2007 | SIGMOD | 0.00015578618 |
| 2,035 | Generating Example Data for Dataflow Programs | 2009 | SIGMOD | 9.7149269e-05 |
| 2,277 | Generating Targeted Queries for Database Testing | 2008 | SIGMOD | 9.1241198e-05 |
| 4,638 | Test Data for Relational Queries (Extended abstract) | 1986 | PODS | 6.0291138e-05 |
| 7,190 | Database Support for Matching: Limitations and Opportunities | 2006 | SIGMOD | 4.8051876e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 5,353 | An In-Depth Benchmarking of Text-to-SQL Systems | 2021 | SIGMOD | 5.5521332e-05 |
| 2,129 | IDEBench: A Benchmark for Interactive Data Exploration | 2020 | SIGMOD | 9.480002e-05 |
| 10,421 | A Query-Aware Enormous Database Generator For System Performance Evaluation | 2025 | SIGMOD | 4.1945683e-05 |
| 10 | Benchmarking Database Systems: A Systematic Approach | 1983 | VLDB | 0.0012103754 |
| 7,892 | M2Bench: A Database Benchmark for Multi-Model Analytic Workloads | 2023 | VLDB | 4.6245179e-05 |
| 8,624 | A Study of Database Performance Sensitivity to Experiment Settings | 2022 | VLDB | 4.483049e-05 |
| 9,780 | BenchPress: Dynamic Workload Control in the OLTP-Bench Testbed | 2015 | SIGMOD | 4.2856106e-05 |
| 2,994 | Data Generation for Application-Specific Benchmarking | 2011 | VLDB | 7.761114e-05 |
| 954 | Benchmarking Simple Database Operations | 1987 | SIGMOD | 0.0001507746 |
| 340 | OLTP-Bench: An Extensible Testbed for Benchmarking Relational Databases | 2014 | VLDB | 0.00026841628 |