Accelerating Triangle Counting on GPU
Summary: Lightweight graph preprocessing boosts GPU triangle counting without changing code. Proposes analytic models for workload imbalance and pattern diversity; uses approx edge directions and vertex reordering to balance load and boost GPU parallelism. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 8 of 8 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3,025 | NeutronStar: Distributed GNN Training with Hybrid Dependency Management | 2022 | SIGMOD | 7.6906935e-05 |
| 5,474 | Efficient Load-Balanced Butterfly Counting on GPU | 2022 | VLDB | 5.4881807e-05 |
| 10,079 | Fast Optimal Group Steiner Tree Search using GPUs | 2026 | SIGMOD | 4.1945683e-05 |
| 10,085 | GraphTwin: Cache-Centric Bit-Level Graph Representation for Fast and Exact Graph Queries | 2026 | SIGMOD | 4.1945683e-05 |
| 10,485 | Finding Logic Bugs in Graph-processing Systems via Graph-cutting | 2025 | SIGMOD | 4.1945683e-05 |
| 10,596 | Truss Decomposition in Hypergraphs | 2025 | VLDB | 4.1945683e-05 |
| 10,863 | Towards Sufficient GPU-accelerated Dynamic Graph Management: Survey and Experiment | 2025 | VLDB | 4.1945683e-05 |
| 10,871 | Efficient Computation of Hyper-triangles on Hypergraphs | 2025 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 6 of 6 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 108 | Truss Decomposition in Massive Networks | 2012 | VLDB | 0.00048300163 |
| 1,150 | K-Core Decomposition of Large Networks on a Single PC | 2016 | VLDB | 0.00013657353 |
| 1,973 | Speeding Up Set Intersections in Graph Algorithms using SIMD Instructions | 2018 | SIGMOD | 9.8913631e-05 |
| 2,051 | Efficient Parallel Lists Intersection and Index Compression Algorithms using Graphics Processing Units | 2011 | VLDB | 9.686731e-05 |
| 3,975 | Accelerating Truss Decomposition on Heterogeneous Processors | 2020 | VLDB | 6.5736847e-05 |
| 4,254 | Fast Sparse Matrix-Vector Multiplication on GPUs: Implications for Graph Mining | 2011 | VLDB | 6.3213177e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,254 | Fast Sparse Matrix-Vector Multiplication on GPUs: Implications for Graph Mining | 2011 | VLDB | 6.3213177e-05 |
| 9,228 | Efficiently Counting Triangles in Large Temporal Graphs | 2025 | SIGMOD | 4.3690661e-05 |
| 5,046 | Better Algorithms for Counting Triangles in Data Streams | 2016 | PODS | 5.7405307e-05 |
| 589 | Massive Graph Triangulation | 2013 | SIGMOD | 0.00019576567 |
| 3,641 | GPU-Accelerated Subgraph Enumeration on Partitioned Graphs | 2020 | SIGMOD | 6.8884895e-05 |
| 1,344 | Counting and Sampling Triangles from a Graph Stream | 2013 | VLDB | 0.00012473724 |
| 3,063 | Sliding Window-based Approximate Triangle Counting over Streaming Graphs with Duplicate Edges | 2021 | SIGMOD | 7.6321424e-05 |
| 5,474 | Efficient Load-Balanced Butterfly Counting on GPU | 2022 | VLDB | 5.4881807e-05 |
| 4,522 | GPU-based Graph Traversal on Compressed Graphs | 2019 | SIGMOD | 6.1146374e-05 |
| 3,975 | Accelerating Truss Decomposition on Heterogeneous Processors | 2020 | VLDB | 6.5736847e-05 |