An Efficient Rigorous Approach for Identifying Statistically Significant Frequent Itemsets
Summary: Apply Chen–Stein Poisson approximation to the count of itemsets with support ≥ s to locate s* where observed counts exceed random-data expectations. Presents an efficient parametric multi-hypothesis test controlling FDR using whole-dataset counts rather than per-itemset tests; empirically validated. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Adam Kirsch
- 2. Michael Mitzenmacher
- 3. Andrea Pietracaprina
- 4. Geppino Pucci
- 5. Eli Upfal
- 6. Fabio Vandin
Incoming Citations (Sorted by Pagerank)
Showing 3 of 3 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,940 | SliceLine: Fast, Linear-Algebra-based Slice Finding for ML Model Debugging | 2021 | SIGMOD | 0.00010020173 |
| 2,930 | Assessing and Ranking Structural Correlations in Graphs | 2011 | SIGMOD | 7.8723983e-05 |
| 12,132 | Controlling False Positives in Association Rule Mining | 2012 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 4 of 4 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 13 | Mining Association Rules between Sets of Items in Large Databases | 1993 | SIGMOD | 0.0010864752 |
| 599 | Mining Quantitative Association Rules in Large Relational Tables | 1996 | SIGMOD | 0.00019394214 |
| 3,055 | Mining Compressed Frequent-Pattern Sets | 2005 | VLDB | 7.6448739e-05 |
| 5,565 | A New Framework For Itemset Generation | 1998 | PODS | 5.4318211e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 11,978 | Resource-oriented Approximation for Frequent Itemset Mining from Bursty Data Streams | 2014 | SIGMOD | 4.1945683e-05 |
| 3,454 | Traversing Itemset Lattices with Statistical Metric Pruning | 2000 | PODS | 7.0778482e-05 |
| 835 | Finding Frequent Items in Data Streams | 2008 | VLDB | 0.00016109621 |
| 2,685 | On Differentially Private Frequent Itemset Mining | 2013 | VLDB | 8.3070708e-05 |
| 4,449 | False Positive or False Negative: Mining Frequent Itemsets from High Speed Transactional Data Streams | 2004 | VLDB | 6.1780147e-05 |
| 7,633 | Mining Frequent Itemsets over Uncertain Databases | 2012 | VLDB | 4.6914549e-05 |
| 11,952 | Beyond Itemsets: Mining Frequent Featuresets over Structured Items | 2015 | VLDB | 4.1945683e-05 |
| 12,689 | Mining Frequent Itemsets Using Support Constraints | 2000 | VLDB | 4.1945683e-05 |
| 9,064 | Feasible Itemset Distributions in Data Mining: Theory and Application | 2003 | PODS | 4.4039656e-05 |
| 11,039 | Efficient Discovery of Significant Patterns with Few-Shot Resampling | 2024 | VLDB | 4.1945683e-05 |