Database Paper Browser

Back to papers

Sampling Large Databases for Association Rules

Summary: Sample-driven, probabilistic discovery of association rules: mine a random subset to hypothesize likely rules for the full DB. Missed rules can be recovered with a second pass, yielding exact results while retaining near single-pass I/O efficiency. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
8338
Venue
VLDB
Year
1996
Pagerank
0.0002233798
Overall Rank
473 | 96.72%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 28 of 28 citing papers.

Rank Citing Paper Year Venue Pagerank
166 Approximate Frequency Counts over Data Streams 2002 VLDB 0.00039361552
277 Automatic Subspace Clustering of High Dimensional Data for Data Mining Applications 1998 SIGMOD 0.00029311426
657 Dynamic Itemset Counting and Implication Rules for Market Basket Data 1997 SIGMOD 0.00018553891
744 Beyond Market Baskets: Generalizing Association Rules to Correlations 1997 SIGMOD 0.00017333019
848 Approximate Counts and Quantiles over Sliding Windows 2004 PODS 0.0001597308
904 Integrating Association Rule Mining with Relational Database Systems: Alternatives and Implications 1998 SIGMOD 0.00015469655
1,336 Clustering Categorical Data: An Approach Based on Dynamical Systems 1998 VLDB 0.00012498064
1,626 Exploratory Mining and Pruning Optimizations of Constrained Association Rules 1998 SIGMOD 0.00011094469
2,266 Estimating the Confidence of Conditional Functional Dependencies 2009 SIGMOD 9.1540815e-05
2,368 Online Maintenance of Very Large Random Samples 2004 SIGMOD 8.9501526e-05
3,100 Crowd Mining 2013 SIGMOD 7.5634778e-05
3,822 Association Rules over Interval Data 1997 SIGMOD 6.7263391e-05
4,258 Online Association Rule Mining 1999 SIGMOD 6.3148619e-05
4,716 Mining Graph Patterns Efficiently via Randomized Summaries 2009 VLDB 5.9755569e-05
4,919 Optimization of Constrained Frequent Set Queries with 2-variable Constraints 1999 SIGMOD 5.8256934e-05
5,436 Output Space Sampling for Graph Patterns 2009 VLDB 5.5042223e-05
6,418 An Optimal Algorithm for l1-Heavy Hitters in Insertion Streams and Related Problems 2016 PODS 5.0696932e-05
6,782 On the Discovery of Interesting Patterns in Association Rules 1998 VLDB 4.9272477e-05
6,961 CATAPULT: Data-driven Selection of Canned Patterns for Efficient Visual Graph Query Formulation 2019 SIGMOD 4.8841486e-05
8,321 A Condensed Representation to Find Frequent Patterns 2001 PODS 4.5435639e-05
8,695 CrowdMiner: Mining association rules from the crowd 2013 VLDB 4.4661379e-05
9,106 TED: Towards Discovering Top-k Edge-Diversified Patterns in a Graph Database 2023 SIGMOD 4.3952103e-05
9,963 Parallel Rule Discovery from Large Datasets by Sampling 2022 SIGMOD 4.2294678e-05
10,489 Incremental Rule Discovery in Response to Parameter Updates 2025 SIGMOD 4.1945683e-05
11,217 Efficient Approximation Framework for Attribute Recommendation 2023 SIGMOD 4.1945683e-05
11,952 Beyond Itemsets: Mining Frequent Featuresets over Structured Items 2015 VLDB 4.1945683e-05
12,573 Cost-Based Labeling of Groups of Mass Spectra 2004 SIGMOD 4.1945683e-05
12,645 Mining Long Sequential Patterns in a Noisy Environment 2002 SIGMOD 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 9 of 9 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers