Data Generation using Declarative Constraints
Summary: Data generation for synthetic databases via declarative cardinality constraints that specify query-result sizes. Efficient algorithms cover a large, practical constraint class, with empirical results showing scalable performance and outperforming prior techniques. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Arvind Arasu
- 2. Raghav Kaushik
- 3. Jian Li
Incoming Citations (Sorted by Pagerank)
Showing 11 of 11 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 11 of 11 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 145 | Quickly Generating Billion-Record Synthetic Databases | 1994 | SIGMOD | 0.0004138408 |
| 372 | Selectivity Estimation using Probabilistic Models | 2001 | SIGMOD | 0.00025354779 |
| 512 | STHoles: A Multidimensional Workload-Aware Histogram | 2001 | SIGMOD | 0.00021380733 |
| 888 | QAGen: Generating Query-Aware Test Databases | 2007 | SIGMOD | 0.00015578618 |
| 934 | Flexible Database Generators | 2005 | VLDB | 0.00015227409 |
| 1,011 | ToXgene: A template-based data generator for XML | 2002 | SIGMOD | 0.00014652718 |
| 1,483 | Simple and Realistic Data Generation | 2006 | VLDB | 0.00011720317 |
| 2,035 | Generating Example Data for Dataflow Programs | 2009 | SIGMOD | 9.7149269e-05 |
| 4,215 | Generating XML Structure Using Examples and Constraints | 2008 | VLDB | 6.3527334e-05 |
| 4,517 | Generating Databases for Query Workloads | 2010 | VLDB | 6.1178732e-05 |
| 5,977 | Understanding Cardinality Estimation using Entropy Maximization | 2010 | PODS | 5.2455909e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 10,118 | Test Data Generation for Complex SQL Queries | 2026 | SIGMOD | 4.1945683e-05 |
| 10,421 | A Query-Aware Enormous Database Generator For System Performance Evaluation | 2025 | SIGMOD | 4.1945683e-05 |
| 145 | Quickly Generating Billion-Record Synthetic Databases | 1994 | SIGMOD | 0.0004138408 |
| 934 | Flexible Database Generators | 2005 | VLDB | 0.00015227409 |
| 9,836 | Projection-Compliant Database Generation | 2022 | VLDB | 4.2747054e-05 |
| 5,371 | LearnedSQLGen: Constraint-aware SQL Generation using Reinforcement Learning | 2022 | SIGMOD | 5.5428776e-05 |
| 8,699 | Supporting Database Constraints in Synthetic Data Generation based on Generative Adversarial Networks | 2020 | SIGMOD | 4.465684e-05 |
| 2,277 | Generating Targeted Queries for Database Testing | 2008 | SIGMOD | 9.1241198e-05 |
| 6,887 | Synthesizing Linked Data Under Cardinality and Integrity Constraints | 2021 | SIGMOD | 4.8937852e-05 |
| 8,870 | DataSynth: Generating Synthetic Data using Declarative Constraints | 2011 | VLDB | 4.431665e-05 |