An Efficient Rigorous Approach for Identifying Statistically Significant Frequent Itemsets

Summary: Apply Chen–Stein Poisson approximation to the count of itemsets with support ≥ s to locate s* where observed counts exceed random-data expectations. Presents an efficient parametric multi-hypothesis test controlling FDR using whole-dataset counts rather than per-itemset tests; empirically validated. (summarized by gpt-5-mini on Feb 09 2026)

Paper ID: 1483
Venue: PODS
Year: 2009
Pagerank: 5.8892254e-05
Overall Rank: 4,827 | 66.46%
DOI: -

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 3 of 3 citing papers.

Rank	Citing Paper	Year	Venue	Pagerank
1,942	SliceLine: Fast, Linear-Algebra-based Slice Finding for ML Model Debugging	2021	SIGMOD	0.00010010569
2,930	Assessing and Ranking Structural Correlations in Graphs	2011	SIGMOD	7.8790753e-05
12,140	Controlling False Positives in Association Rule Mining	2012	VLDB	4.1905499e-05

Outgoing Citations (Sorted by Pagerank)

Showing 4 of 4 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank	Cited Paper	Year	Venue	Pagerank
13	Mining Association Rules between Sets of Items in Large Databases	1993	SIGMOD	0.0010863639
602	Mining Quantitative Association Rules in Large Relational Tables	1996	SIGMOD	0.00019350521
3,061	Mining Compressed Frequent-Pattern Sets	2005	VLDB	7.6389596e-05
5,426	A New Framework For Itemset Generation	1998	PODS	5.5131484e-05

Semantically Similar Papers

Overall Rank	Paper	Year	Venue	Pagerank
11,986	Resource-oriented Approximation for Frequent Itemset Mining from Bursty Data Streams	2014	SIGMOD	4.1905499e-05
3,446	Traversing Itemset Lattices with Statistical Metric Pruning	2000	PODS	7.0836261e-05
831	Finding Frequent Items in Data Streams	2008	VLDB	0.00016094846
2,688	On Differentially Private Frequent Itemset Mining	2013	VLDB	8.3012815e-05
7,633	Mining Frequent Itemsets over Uncertain Databases	2012	VLDB	4.6869557e-05
4,451	False Positive or False Negative: Mining Frequent Itemsets from High Speed Transactional Data Streams	2004	VLDB	6.172063e-05
11,960	Beyond Itemsets: Mining Frequent Featuresets over Structured Items	2015	VLDB	4.1905499e-05
12,698	Mining Frequent Itemsets Using Support Constraints	2000	VLDB	4.1905499e-05
9,062	Feasible Itemset Distributions in Data Mining: Theory and Application	2003	PODS	4.3997447e-05
11,042	Efficient Discovery of Significant Patterns with Few-Shot Resampling	2024	VLDB	4.1905499e-05