Finding Frequent Items in Probabilistic Data
Summary: Possible-world semantics define likely frequent items in probabilistic data, capturing structure beyond simple expected frequency. Exact offline algorithms (quadratic/cubic) and sublinear-memory streaming sampling with provable accuracy and confidence-based ranking, validated on real and synthetic data. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 3 of 3 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,080 | Sliding-Window Top-k Queries on Uncertain Streams | 2008 | VLDB | 6.4652983e-05 |
| 7,633 | Mining Frequent Itemsets over Uncertain Databases | 2012 | VLDB | 4.6914549e-05 |
| 11,952 | Beyond Itemsets: Mining Frequent Featuresets over Structured Items | 2015 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 20 of 20 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 74 | Efficient Query Evaluation on Probabilistic Databases | 2004 | VLDB | 0.00057857292 |
| 8,090 | Probabilistic Histograms for Probabilistic Data | 2009 | VLDB | 4.5888589e-05 |
| 4,095 | Ranking Continuous Probabilistic Datasets | 2010 | VLDB | 6.4556768e-05 |
| 467 | Evaluating Probabilistic Queries over Imprecise Data | 2003 | SIGMOD | 0.00022443768 |
| 1,609 | A Unified Approach to Ranking in Probabilistic Databases | 2009 | VLDB | 0.00011150935 |
| 3,385 | Estimating Statistical Aggregates on Probabilistic Data Streams | 2007 | PODS | 7.1580391e-05 |
| 1,707 | Ranking Queries on Uncertain Data: A Probabilistic Threshold Approach | 2008 | SIGMOD | 0.00010816111 |
| 3,041 | Sketching Probabilistic Data Streams | 2007 | SIGMOD | 7.6697078e-05 |
| 835 | Finding Frequent Items in Data Streams | 2008 | VLDB | 0.00016109621 |
| 7,633 | Mining Frequent Itemsets over Uncertain Databases | 2012 | VLDB | 4.6914549e-05 |