Mind the Gap: Large-Scale Frequent Sequence Mining
Summary: MG-FSM enables scalable frequent sequence mining on MapReduce with gap constraints to prune outputs. W-equivalency partitioning enables independent mining with existing algorithms; part-size optimizations cut costs, outperforming prior text mining. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Iris Miliaraki
- 2. Klaus Berberich
- 3. Rainer Gemulla
- 4. Spyros Zoupanos
Incoming Citations (Sorted by Pagerank)
Showing 4 of 4 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,803 | A System for Management and Analysis of Preference Data | 2014 | VLDB | 5.9107061e-05 |
| 6,111 | Why Big Data Industrial Systems Need Rules and What We Can Do About It | 2015 | SIGMOD | 5.2049579e-05 |
| 7,597 | Oracle Workload Intelligence | 2015 | SIGMOD | 4.7007801e-05 |
| 11,903 | LASH: Large-Scale Sequence Mining with Hierarchies | 2015 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 1 of 1 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 13 | Mining Association Rules between Sets of Items in Large Databases | 1993 | SIGMOD | 0.0010864752 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 9,565 | Flexible and Feasible Support Measures for Mining Frequent Patterns in Large Labeled Graphs | 2017 | SIGMOD | 4.3254416e-05 |
| 2,674 | Minimal MapReduce Algorithms | 2013 | SIGMOD | 8.3328645e-05 |
| 11,903 | LASH: Large-Scale Sequence Mining with Hierarchies | 2015 | SIGMOD | 4.1945683e-05 |
| 5,772 | Mining Frequent Patterns with Differential Privacy | 2013 | VLDB | 5.3322378e-05 |
| 13,889 | Towards Data Mining Benchmarking: A Test Bed for Performance Study of Frequent Pattern Mining | 2000 | SIGMOD | - |
| 3,055 | Mining Compressed Frequent-Pattern Sets | 2005 | VLDB | 7.6448739e-05 |
| 9,561 | T-FSM: A Task-Based System for Massively Parallel Frequent Subgraph Pattern Mining from a Big Graph | 2023 | SIGMOD | 4.3254416e-05 |
| 10,975 | Language-Model Based Informed Partition of Databases to Speed Up Pattern Mining | 2024 | SIGMOD | 4.1945683e-05 |
| 4,307 | Mining Periodic Patterns with Gap Requirement from Sequences | 2005 | SIGMOD | 6.2885419e-05 |
| 181 | Mining Frequent Patterns without Candidate Generation | 2000 | SIGMOD | 0.00036992674 |