Database Paper Browser

Back to papers

Quickly Generating Billion-Record Synthetic Databases

Summary: Scalable billion-record synthetic SQL databases on shared-nothing clusters for benchmarking. Key ideas: congruential generators for dense, unique uniform data; concurrent index generation via discrete logs; and support for exponential, normal, and self-similar distributions. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
2730
Venue
SIGMOD
Year
1994
Pagerank
0.0004138408
Overall Rank
145 | 99.00%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 28 of 78 citing papers.

Rank Citing Paper Year Venue Pagerank
7,024 Plush: A Write-Optimized Persistent Log-Structured Hash-Table 2022 VLDB 4.8575128e-05
7,399 SmartBench: A Benchmark For Data Management In Smart Spaces 2020 VLDB 4.7410149e-05
7,464 Testing Database Applications 2006 SIGMOD 4.722995e-05
7,493 Nezha: Deployable and High-Performance Consensus Using Synchronized Clocks 2023 VLDB 4.7180617e-05
7,630 Evaluating Persistent Memory Range Indexes: Part Two 2022 VLDB 4.6923637e-05
7,689 ROBUS: Fair Cache Allocation for Data-parallel Workloads 2017 SIGMOD 4.6765769e-05
7,759 Dscaler: Synthetically Scaling A Given Relational Database 2016 VLDB 4.6593145e-05
7,895 HYDRA: A Dynamic Big Data Regenerator 2018 VLDB 4.623701e-05
7,896 An Analysis of Concurrency Control Protocols for In-Memory Databases with CCBench 2020 VLDB 4.623192e-05
7,995 BP-tree: Overcoming the Point-Range Operation Tradeoff for In-Memory B-trees 2023 VLDB 4.6109825e-05
8,119 DecLog: Decentralized Logging in Non-Volatile Memory for Time Series Database Systems 2024 VLDB 4.5809563e-05
8,278 Constant Optimization Driven Database System Testing 2025 SIGMOD 4.5435639e-05
8,680 A Practical Approach to Groupjoin and Nested Aggregates 2021 VLDB 4.4694927e-05
8,684 Unbiased Estimation of Size and Other Aggregates Over Hidden Web Databases 2010 SIGMOD 4.4677591e-05
8,846 Scaling your Hybrid CPU-GPU DBMS to Multiple GPUs 2024 VLDB 4.4372012e-05
8,858 Automatic Contention Detection and Amelioration for Data-Intensive Operations 2010 SIGMOD 4.4344518e-05
8,870 DataSynth: Generating Synthetic Data using Declarative Constraints 2011 VLDB 4.431665e-05
9,185 Practical DB-OS Co-Design with Privileged Kernel Bypass 2025 SIGMOD 4.3792034e-05
9,454 OptiQL: Robust Optimistic Locking for Memory-Optimized Indexes 2023 SIGMOD 4.3391522e-05
9,815 Robustness against Read Committed for Transaction Templates 2021 VLDB 4.2783272e-05
9,836 Projection-Compliant Database Generation 2022 VLDB 4.2747054e-05
9,838 Efficiently Joining Large Relations on Multi-GPU Systems 2025 VLDB 4.2740344e-05
10,114 SRS: Detecting Logic Bugs of Join Implementation in DBMSs via Set Relation Synthesis 2026 SIGMOD 4.1945683e-05
10,622 Simple Testing Can Expose Most Critical Transaction Bugs: Understanding and Detecting Write-Specific Serializability Violations in Database Systems 2025 VLDB 4.1945683e-05
10,660 Rebirth-Retire: A Concurrency Control Protocol Adaptable to Different Levels of Contention 2025 VLDB 4.1945683e-05
11,142 Cache-Efficient Top-k Aggregation over High Cardinality Large Datasets 2024 VLDB 4.1945683e-05
12,409 Dwarfs in the Rearview Mirror: How Big are they Really? 2008 VLDB 4.1945683e-05
12,565 Parallel Execution of Test Runs for Database Application Systems 2005 VLDB 4.1945683e-05
Previous Page 2 / 2 Next

Outgoing Citations (Sorted by Pagerank)

Showing 2 of 2 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
10 Benchmarking Database Systems: A Systematic Approach 1983 VLDB 0.0012103754
20 GAMMA - A High Performance Dataflow Database Machine 1986 VLDB 0.00086459551
Previous Page 1 / 1 Next

Semantically Similar Papers