Database Paper Browser

Back to papers

CAFE: Towards Compact, Adaptive, and Fast Embedding for Large-scale Recommendation Models

Summary: CAFE enables compact, adaptive embedding for large-scale DLRMs; HotSketch identifies hot features and assigns them dedicated embeddings, while non-hot features share via multi-level hash. Theoretical accuracy/convergence analysis; 3.92% and 3.68% AUC gains on Criteo Kaggle and CriteoTB at 10k× compression. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
6860
Venue
SIGMOD
Year
2024
Pagerank
4.3441378e-05
Overall Rank
9,402 | 34.60%
DOI
10.1145/3639306

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 2 of 2 citing papers.

Rank Citing Paper Year Venue Pagerank
9,805 MEMO: Fine-grained Tensor Management For Ultra-long Context LLM Training 2025 SIGMOD 4.2805224e-05
10,233 Efficient GNN Training on Giant Graphs with Collective Batching and Scheduling 2026 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 18 of 18 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
754 Distributed Representations of Tuples for Entity Resolution 2018 VLDB 0.00017117211
984 Natural language to SQL: Where are we today? 2020 VLDB 0.00014857465
1,584 Augmented Sketch: Faster and More Accurate Stream Processing 2016 SIGMOD 0.00011255801
2,677 HET: Scaling out Huge Embedding Model Training via Cache-enabled Distributed Framework 2022 VLDB 8.3268401e-05
3,169 QueryFormer: A Tree Transformer Model for Query Plan Representation 2022 VLDB 7.4498425e-05
3,271 Data Sketches for Disaggregated Subset Sum and Frequent Item Estimation 2018 SIGMOD 7.2968732e-05
3,499 Fauce: Fast and Accurate Deep Ensembles with Uncertainty for Cardinality Estimation 2021 VLDB 7.0376445e-05
3,803 Scaling Attributed Network Embedding to Massive Graphs 2021 VLDB 6.7550628e-05
4,462 LOGER: A Learned Optimizer towards Generating Efficient and Robust Query Execution Plans 2023 VLDB 6.1611784e-05
5,052 HET-GMP: A Graph-based System Approach to Scaling Large Embedding Model Training 2022 SIGMOD 5.7337977e-05
5,377 Parallel Training of Knowledge Graph Embedding Models: A Comparison of Techniques 2022 VLDB 5.5410858e-05
5,909 At-the-time and Back-in-time Persistent Sketches 2021 SIGMOD 5.2769377e-05
6,738 Agile and Accurate CTR Prediction Model Training for Massive-Scale Online Advertising Systems 2021 SIGMOD 4.9452647e-05
7,164 SKT: A One-Pass Multi-Sketch Data Analytics Accelerator 2021 VLDB 4.8131514e-05
7,256 Effective and Efficient Retrieval of Structured Entities 2020 VLDB 4.7869419e-05
7,474 Cardinality Estimation of Approximate Substring Queries using Deep Learning 2022 VLDB 4.7194345e-05
9,041 TreeSensing: Linearly Compressing Sketches with Flexibility 2023 SIGMOD 4.4039656e-05
9,408 Experimental Analysis of Large-scale Learnable Vector Storage Compression 2024 VLDB 4.3441378e-05
Previous Page 1 / 1 Next

Semantically Similar Papers