On Sampling from Massive Graph Streams
Summary: GPS is an order-based reservoir sampling framework for graphs, weighting edge samples to optimize subgraph estimation. Separates sampling from estimation (post- and in-stream) with a Martingale-based unbiased estimator; yields <1% error on subgraph counts while using <0.01% of edges. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Nesreen K. Ahmed
- 2. Nick Duffield
- 3. Theodore L. Willke
- 4. Ryan A. Rossi
Incoming Citations (Sorted by Pagerank)
Showing 5 of 5 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,626 | Efficient Biclique Counting in Large Bipartite Graphs | 2023 | SIGMOD | 6.0399035e-05 |
| 5,518 | Hypergraph Motifs: Concepts, Algorithms, and Discoveries | 2020 | VLDB | 5.4621935e-05 |
| 10,276 | AGIS: Fast Approximate Graph Pattern Mining with Structure-Informed Sampling | 2026 | VLDB | 4.1945683e-05 |
| 10,623 | Efficient and Adaptive Estimation of Local Triadic Coefficients | 2025 | VLDB | 4.1945683e-05 |
| 10,871 | Efficient Computation of Hyper-triangles on Hypergraphs | 2025 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 4 of 4 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 392 | Counting Triangles in Data Streams | 2006 | PODS | 0.00024556183 |
| 595 | Estimating PageRank on Graph Streams | 2008 | PODS | 0.00019507721 |
| 1,344 | Counting and Sampling Triangles from a Graph Stream | 2013 | VLDB | 0.00012473724 |
| 1,472 | Space Efficient Mining of Multigraph Streams | 2005 | PODS | 0.00011828662 |
Previous
Page 1 / 1
Next