HongTu: Scalable Full-Graph GNN Training on Multiple GPUs
Summary: HongTu scales full-graph GNN training on multi-GPU platforms with CPU-memory vertex storage, GPU offload, and recomputation caching. It minimizes host-GPU traffic with deduplicated communication and cost-guided reorganization, and delivers speedups over DistGNN. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Qiange Wang
- 2. Yao Chen
- 3. Weng-Fai Wong
- 4. Bingsheng He
Incoming Citations (Sorted by Pagerank)
Showing 8 of 8 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 7,607 | Systems for Scalable Graph Analytics and Machine Learning: Trends and Methods | 2025 | VLDB | 4.6967024e-05 |
| 10,027 | NeutronHeter: Optimizing Distributed Graph Neural Network Training for Heterogeneous Clusters | 2026 | SIGMOD | 4.1945683e-05 |
| 10,066 | DepCache: A KV Cache Management Framework for GraphRAG with Dependency Attention | 2026 | SIGMOD | 4.1945683e-05 |
| 10,298 | NeutronCloud: Resource-Aware Distributed GNN Training in Fluctuating Cloud Environments | 2026 | VLDB | 4.1945683e-05 |
| 10,539 | Graph Neural Network Training Systems: A Performance Comparison of Full-Graph and Mini-Batch. | 2025 | VLDB | 4.1945683e-05 |
| 10,570 | NeutronTask: Scalable and Efficient Multi-GPU GNN Training with Task Parallelism | 2025 | VLDB | 4.1945683e-05 |
| 10,735 | Faster Convergence in Mini-batch Graph Neural Networks Training with Pseudo Full Neighborhood Compensation | 2025 | VLDB | 4.1945683e-05 |
| 11,026 | Improving Graph Compression for Efficient Resource-Constrained Graph Analytics | 2024 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 8 of 8 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 278 | AliGraph: A Comprehensive Graph Neural Network Platform | 2019 | VLDB | 0.00029230623 |
| 1,103 | Large Graph Convolutional Network Training with GPU-Oriented Data Communication Architecture | 2021 | VLDB | 0.00014025101 |
| 1,160 | Sancus: Staleness-Aware Communication-Avoiding Full-Graph Decentralized Training in Large-Scale Graph Neural Networks | 2022 | VLDB | 0.00013586221 |
| 2,067 | HippogriffDB: Balancing I/O and GPU Bandwidth in Big Data Analytics | 2016 | VLDB | 9.6392739e-05 |
| 2,287 | Pipelined Query Processing in Coprocessor Environments | 2018 | SIGMOD | 9.0972606e-05 |
| 3,025 | NeutronStar: Distributed GNN Training with Hybrid Dependency Management | 2022 | SIGMOD | 7.6906935e-05 |
| 4,355 | LargeEA: Aligning Entities for Large-scale Knowledge Graphs | 2022 | VLDB | 6.259483e-05 |
| 5,699 | EMOGI: Efficient Memory-access for Out-of-memory Graph-traversal in GPUs | 2021 | VLDB | 5.3654927e-05 |
Previous
Page 1 / 1
Next