Back to papers
Accelerating Recommendation System Training by Leveraging Popular Choices
Summary: Hot-embedding skew in recommender training; FAE uses a hot-embedding aware layout to keep frequently accessed embeddings on GPU, exploiting skewed access to reduce CPU-GPU transfers. Delivers 2.3× speedup over CPU-only XDL and 1.52× over CPU+GPU with maintained accuracy on production models.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 12636
- Venue
- VLDB
- Year
- 2022
- Pagerank
- 8.2991144e-05
- Overall Rank
- 2,688 | 81.31%
- DOI
-
10.14778/3485450.3485462
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 10 of 10 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
| 5,993 |
DLRover-RM: Resource Optimization for Deep Recommendation Models Training in the Cloud |
2024 |
VLDB |
5.2415551e-05 |
| 5,998 |
Efficient Fault Tolerance for Recommendation Model Training via Erasure Coding |
2023 |
VLDB |
5.2415551e-05 |
| 8,737 |
Scheduling Data Processing Pipelines for Incremental Training on MLP-based Recommendation Models |
2025 |
SIGMOD |
4.456315e-05 |
| 9,094 |
FEC: Efficient Deep Recommendation Model Training with Flexible Embedding Communication |
2023 |
SIGMOD |
4.3980444e-05 |
| 9,231 |
Modyn: Data-Centric Machine Learning Pipeline Orchestration |
2025 |
SIGMOD |
4.3690661e-05 |
| 9,326 |
BladeDISC: Optimizing Dynamic Shape Machine Learning Workloads via Compiler Approach |
2023 |
SIGMOD |
4.3556432e-05 |
| 9,806 |
The Image Calculator: 10x Faster Image-AI Inference by Replacing JPEG with Self-designing Storage Format |
2024 |
SIGMOD |
4.2805224e-05 |
| 10,472 |
CARINA: An Efficient CXL-Oriented Embedding Serving System for Recommendation Models |
2025 |
SIGMOD |
4.1945683e-05 |
| 10,580 |
GPEmu: A GPU Emulator for Faster and Cheaper Prototyping and Evaluation of Deep Learning System Research |
2025 |
VLDB |
4.1945683e-05 |
| 10,697 |
Lighter-X: An Efficient and Plug-and-play Strategy for Graph-based Recommendation through Decoupled Propagation |
2025 |
VLDB |
4.1945683e-05 |
Outgoing Citations (Sorted by Pagerank)
Showing 4 of 4 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 10,885 |
Efficient Graph Embedding Generation and Update for Large-Scale Temporal Graph |
2025 |
VLDB |
4.1945683e-05 |
| 13,116 |
The Limits of Graph Samplers for Training Inductive Recommender Systems |
2025 |
VLDB |
- |
| 9,408 |
Experimental Analysis of Large-scale Learnable Vector Storage Compression |
2024 |
VLDB |
4.3441378e-05 |
| 5,998 |
Efficient Fault Tolerance for Recommendation Model Training via Erasure Coding |
2023 |
VLDB |
5.2415551e-05 |
| 5,993 |
DLRover-RM: Resource Optimization for Deep Recommendation Models Training in the Cloud |
2024 |
VLDB |
5.2415551e-05 |
| 8,737 |
Scheduling Data Processing Pipelines for Incremental Training on MLP-based Recommendation Models |
2025 |
SIGMOD |
4.456315e-05 |
| 1,103 |
Large Graph Convolutional Network Training with GPU-Oriented Data Communication Architecture |
2021 |
VLDB |
0.00014025101 |
| 4,887 |
Supercharging Recommender Systems using Taxonomies for Learning User Purchase Behavior |
2012 |
VLDB |
5.8533058e-05 |
| 9,402 |
CAFE: Towards Compact, Adaptive, and Fast Embedding for Large-scale Recommendation Models |
2024 |
SIGMOD |
4.3441378e-05 |
| 9,094 |
FEC: Efficient Deep Recommendation Model Training with Flexible Embedding Communication |
2023 |
SIGMOD |
4.3980444e-05 |