NeutronOrch: Rethinking Sample-based GNN Training under CPU-GPU Heterogeneous Environments
Summary: Diagnoses suboptimal CPU–GPU orchestration in sample-based GNN training and introduces layer-decoupled execution that pushes bottom-layer training to CPU to shrink GPU compute and memory footprint. NeutronOrch offloads only frequently accessed vertices with bounded-staleness embedding reuse and a fine-grained pipeline, achieving up to 11.51× speedup over prior systems. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Xin Ai
- 2. Qiange Wang
- 3. Chunyu Cao
- 4. Yanfeng Zhang
- 5. Chaoyi Chen
- 6. Hao Yuan
- 7. Yu Gu
- 8. Ge Yu
Incoming Citations (Sorted by Pagerank)
Showing 7 of 7 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 6,942 | Efficient Training of Graph Neural Networks on Large Graphs | 2024 | VLDB | 4.8922884e-05 |
| 10,011 | A Comprehensive Benchmark on Spectral GNNs: The Impact on Efficiency, Memory, and Effectiveness | 2026 | SIGMOD | 4.1945683e-05 |
| 10,027 | NeutronHeter: Optimizing Distributed Graph Neural Network Training for Heterogeneous Clusters | 2026 | SIGMOD | 4.1945683e-05 |
| 10,233 | Efficient GNN Training on Giant Graphs with Collective Batching and Scheduling | 2026 | VLDB | 4.1945683e-05 |
| 10,539 | Graph Neural Network Training Systems: A Performance Comparison of Full-Graph and Mini-Batch. | 2025 | VLDB | 4.1945683e-05 |
| 10,570 | NeutronTask: Scalable and Efficient Multi-GPU GNN Training with Task Parallelism | 2025 | VLDB | 4.1945683e-05 |
| 10,735 | Faster Convergence in Mini-batch Graph Neural Networks Training with Pseudo Full Neighborhood Compensation | 2025 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 6 of 6 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,103 | Large Graph Convolutional Network Training with GPU-Oriented Data Communication Architecture | 2021 | VLDB | 0.00014025101 |
| 1,160 | Sancus: Staleness-Aware Communication-Avoiding Full-Graph Decentralized Training in Large-Scale Graph Neural Networks | 2022 | VLDB | 0.00013586221 |
| 2,422 | DUCATI: A Dual-Cache Training System for Graph Neural Networks on Giant Graphs with the GPU | 2023 | SIGMOD | 8.8499665e-05 |
| 3,025 | NeutronStar: Distributed GNN Training with Hybrid Dependency Management | 2022 | SIGMOD | 7.6906935e-05 |
| 3,711 | Saga: A Platform for Continuous Construction and Serving of Knowledge At Scale | 2022 | SIGMOD | 6.823609e-05 |
| 5,321 | FreshGNN: Reducing Memory Access via Stable Historical Embeddings for Graph Neural Network Training | 2024 | VLDB | 5.5710441e-05 |
Previous
Page 1 / 1
Next