Back to papers
HET-GMP: A Graph-based System Approach to Scaling Large Embedding Model Training
Summary: HET-GMP uses a graph-based design to scale embedding models via a bigraph of data-sample to embedding-vector access. Graph locality, skewness-aware replication/partitioning, bounded-asynchronous sync reduces comms; 87.5% reduction, 27.5x CTR speedup.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 6354
- Venue
- SIGMOD
- Year
- 2022
- Pagerank
- 5.7337977e-05
- Overall Rank
- 5,052 | 64.86%
- DOI
-
10.1145/3514221.3517902
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 12 of 12 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
| 5,475 |
ETC: Efficient Training of Temporal Graph Neural Networks over Large-scale Dynamic Graphs |
2024 |
VLDB |
5.4869706e-05 |
| 5,561 |
Accelerating Sampling and Aggregation Operations in GNN Frameworks with GPU Initiated Direct Storage Accesses |
2024 |
VLDB |
5.4332062e-05 |
| 6,377 |
Galvatron: Efficient Transformer Training over Multiple GPUs Using Automatic Parallelism |
2023 |
VLDB |
5.0911095e-05 |
| 6,998 |
PetPS: Supporting Huge Embedding Models with Persistent Memory |
2023 |
VLDB |
4.8676312e-05 |
| 8,737 |
Scheduling Data Processing Pipelines for Incremental Training on MLP-based Recommendation Models |
2025 |
SIGMOD |
4.456315e-05 |
| 8,808 |
FlexMoE: Scaling Large-scale Sparse Pre-trained Model Training via Dynamic Device Placement |
2023 |
SIGMOD |
4.4454035e-05 |
| 9,326 |
BladeDISC: Optimizing Dynamic Shape Machine Learning Workloads via Compiler Approach |
2023 |
SIGMOD |
4.3556432e-05 |
| 9,402 |
CAFE: Towards Compact, Adaptive, and Fast Embedding for Large-scale Recommendation Models |
2024 |
SIGMOD |
4.3441378e-05 |
| 9,408 |
Experimental Analysis of Large-scale Learnable Vector Storage Compression |
2024 |
VLDB |
4.3441378e-05 |
| 9,677 |
Apt-Serve: Adaptive Request Scheduling on Hybrid Cache for Scalable LLM Inference Serving |
2025 |
SIGMOD |
4.3047774e-05 |
| 9,966 |
Towards Communication-efficient Vertical Federated Learning Training via Cache-enabled Local Updates |
2022 |
VLDB |
4.2269436e-05 |
| 10,111 |
Scalable Graph Indexing using GPUs for Approximate Nearest Neighbor Search |
2026 |
SIGMOD |
4.1945683e-05 |
Outgoing Citations (Sorted by Pagerank)
Showing 11 of 11 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 11,423 |
Scalable Robust Graph Embedding with Spark |
2022 |
VLDB |
4.1945683e-05 |
| 1,877 |
Large-Scale Distributed Graph Computing Systems: An Experimental Evaluation |
2015 |
VLDB |
0.00010236803 |
| 2,400 |
ByteGNN: Efficient Graph Neural Network Training at Large Scale |
2022 |
VLDB |
8.8955105e-05 |
| 8,740 |
Historical Embedding-Guided Efficient Large-Scale Federated Graph Learning |
2024 |
SIGMOD |
4.456315e-05 |
| 5,377 |
Parallel Training of Knowledge Graph Embedding Models: A Comparison of Techniques |
2022 |
VLDB |
5.5410858e-05 |
| 9,596 |
Scalable Graph Convolutional Network Training on Distributed-Memory Systems |
2023 |
VLDB |
4.319218e-05 |
| 3,087 |
Scalable and Efficient Full-Graph GNN Training for Large Graphs |
2023 |
SIGMOD |
7.5939896e-05 |
| 7,924 |
Distributed Graph Embedding with Information-Oriented Random Walks |
2023 |
VLDB |
4.6154072e-05 |
| 10,638 |
Heta: Distributed Training of Heterogeneous Graph Neural Networks |
2025 |
VLDB |
4.1945683e-05 |
| 2,677 |
HET: Scaling out Huge Embedding Model Training via Cache-enabled Distributed Framework |
2022 |
VLDB |
8.3268401e-05 |