Database Paper Browser

Back to papers

A Bi-Level Bernoulli Scheme for Database Sampling

Summary: Bi-level Bernoulli sampling unites row- and page-level sampling for ISO-style queries, enabling a tunable speed–precision trade-off with SQL extensions and data-aware parameter optimization. A bang-bang policy governed by a page-heterogeneity index (PHI) guides parameter choice; pilot sampling or catalog statistics set PHI, with a heuristic achieving near-optimal accuracy on clustered or skewed data across 1,100 experiments. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
3525
Venue
SIGMOD
Year
2004
Pagerank
6.4531387e-05
Overall Rank
4,100 | 71.48%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 9 of 9 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 6 of 6 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
14 Online Aggregation 1997 SIGMOD 0.0010801504
18 On Random Sampling over Joins 1999 SIGMOD 0.00092385438
39 Statistical Estimators for Relational Algebra Expressions 1988 PODS 0.00074745564
46 Simple Random Sampling from Relational Databases 1986 VLDB 0.00070894702
211 Join Synopses for Approximate Query Answering 1999 SIGMOD 0.00033981214
553 Bifocal Sampling for Skew-Resistant Join Size Estimation 1996 SIGMOD 0.00020272061
Previous Page 1 / 1 Next

Semantically Similar Papers