Database Paper Browser

Back to papers

Generating Example Data for Dataflow Programs

Summary: Generates small, semantically faithful intermediate data to illustrate dataflow program semantics rather than full outputs. Tackles highly selective and noninvertible operators with dedicated data generation techniques, validated on real Yahoo!-scale dataflow workloads. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
4123
Venue
SIGMOD
Year
2009
Pagerank
9.7149269e-05
Overall Rank
2,035 | 85.85%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 11 of 11 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 5 of 5 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
3 Pig Latin: A Not-So-Foreign Language for Data Processing 2008 SIGMOD 0.0024183614
18 On Random Sampling over Joins 1999 SIGMOD 0.00092385438
888 QAGen: Generating Query-Aware Test Databases 2007 SIGMOD 0.00015578618
949 Tioga: Providing Data Management Support for Scientific Visualization Applications 1993 VLDB 0.00015111638
4,638 Test Data for Relational Queries (Extended abstract) 1986 PODS 6.0291138e-05
Previous Page 1 / 1 Next

Semantically Similar Papers