Accurate and Fast Approximate Graph Pattern Mining at Scale
Summary: ScaleGPM: an A-GPM system with on-the-fly convergence detection that provides provable confidence and low overhead, plus eager-verify pruning and hybrid sampling to overcome low-hit “needle-in-the-hay” cases. Delivers geomean 565× (up to 610k×) speedups vs Arya and scales to billion-node graphs with stable, rapid termination. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Anna Arpaci-Dusseau
- 2. Zixiang Zhou
- 3. Xuhao Chen
Incoming Citations (Sorted by Pagerank)
Showing 4 of 4 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 10,078 | Estimating Biclique Counts with Accuracy Guarantees | 2026 | SIGMOD | 4.1945683e-05 |
| 10,270 | Characterizing Parallel Subgraph Matching Performance: A Systematic Study of Interactions, Scalability, and Enumeration | 2026 | VLDB | 4.1945683e-05 |
| 10,276 | AGIS: Fast Approximate Graph Pattern Mining with Structure-Informed Sampling | 2026 | VLDB | 4.1945683e-05 |
| 10,658 | LLMLog: Advanced Log Template Generation via LLM-driven Multi-Round Annotation | 2025 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 8 of 8 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 392 | Counting Triangles in Data Streams | 2006 | PODS | 0.00024556183 |
| 1,333 | Optimizing Subgraph Queries by Combining Binary and Worst-Case Optimal Joins | 2019 | VLDB | 0.00012523806 |
| 1,344 | Counting and Sampling Triangles from a Graph Stream | 2013 | VLDB | 0.00012473724 |
| 1,740 | A General Framework for Estimating Graphlet Statistics via Random Walk | 2017 | VLDB | 0.0001071792 |
| 3,009 | Pangolin: An Efficient and Flexible Graph Mining System on CPU and GPU | 2020 | VLDB | 7.7214924e-05 |
| 3,215 | Fractal: A General-Purpose Graph Pattern Mining System | 2019 | SIGMOD | 7.3645742e-05 |
| 3,410 | Motivo: fast motif counting via succinct color coding and adaptive sampling | 2019 | VLDB | 7.1253867e-05 |
| 6,468 | The Complexity of Counting Cycles in the Adjacency List Streaming Model | 2019 | PODS | 5.0526408e-05 |
Previous
Page 1 / 1
Next