Probabilistic Histograms for Probabilistic Data
Summary: Proposes probabilistic histograms for uncertain relations, preserving possible-worlds semantics for planning and AQP. Then a Dynamic Programming framework builds optimal buckets with per-bucket PDFs under metrics (variation distance, SSE, max error, EMD1)—polynomial-time, compact. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 2 of 2 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,712 | Accelerating Approximate Aggregation Queries with Expensive Predicates | 2021 | VLDB | 5.9787986e-05 |
| 11,502 | In the Land of Data Streams where Synopses are Missing, One Framework to Bring Them All | 2021 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 11 of 11 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 74 | Efficient Query Evaluation on Probabilistic Databases | 2004 | VLDB | 0.00057857292 |
| 141 | Selectivity Estimation Without the Attribute Value Independence Assumption | 1997 | VLDB | 0.00041786333 |
| 321 | MCDB: A Monte Carlo Approach to Managing Uncertain Data | 2008 | SIGMOD | 0.00027527389 |
| 325 | The History of Histograms (abridged) | 2003 | VLDB | 0.00027378328 |
| 326 | Optimal Histograms with Quality Guarantees | 1998 | VLDB | 0.00027358981 |
| 627 | Management of Probabilistic Data: Foundations and Challenges | 2007 | PODS | 0.00018959005 |
| 706 | MYSTIQ: A system for finding more answers by using probabilities | 2005 | SIGMOD | 0.00017845469 |
| 2,748 | REHIST: Relative Error Histogram Construction Algorithms | 2004 | VLDB | 8.1785955e-05 |
| 3,041 | Sketching Probabilistic Data Streams | 2007 | SIGMOD | 7.6697078e-05 |
| 3,385 | Estimating Statistical Aggregates on Probabilistic Data Streams | 2007 | PODS | 7.1580391e-05 |
| 8,218 | Mining Deviants in a Time Series Database | 1999 | VLDB | 4.5566051e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,442 | Approximating Predicates and Expressive Queries on Probabilistic Databases | 2008 | PODS | 6.186154e-05 |
| 12,272 | Conditioning and Aggregating Uncertain Data Streams: Going Beyond Expectations | 2010 | VLDB | 4.1945683e-05 |
| 361 | Histogram-Based Approximation of Set-Valued Query Answers | 1999 | VLDB | 0.00025775749 |
| 467 | Evaluating Probabilistic Queries over Imprecise Data | 2003 | SIGMOD | 0.00022443768 |
| 7,623 | Optimizing Probabilistic Query Processing on Continuous Uncertain Data | 2011 | VLDB | 4.6933659e-05 |
| 852 | Dynamic Multidimensional Histograms | 2002 | SIGMOD | 0.00015941524 |
| 3,041 | Sketching Probabilistic Data Streams | 2007 | SIGMOD | 7.6697078e-05 |
| 760 | Creating Probabilistic Databases from Information Extraction Models | 2006 | VLDB | 0.00017053935 |
| 74 | Efficient Query Evaluation on Probabilistic Databases | 2004 | VLDB | 0.00057857292 |
| 627 | Management of Probabilistic Data: Foundations and Challenges | 2007 | PODS | 0.00018959005 |