PS2: Parameter Server on Spark
Summary: PS2 deploys a parameter server on Spark, preserving Spark for data processing while offloading models, without hacking Spark core. DCV enables locality-aware, element-wise multi-vector ops, delivering up to 55.6x Spark MLlib and 3.7x Petuum. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Zhipeng Zhang
- 2. Bin Cui
- 3. Yingxia Shao
- 4. Lele Yu
- 5. Jiawei Jiang
- 6. Xupeng Miao
Incoming Citations (Sorted by Pagerank)
Showing 8 of 8 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,677 | HET: Scaling out Huge Embedding Model Training via Cache-enabled Distributed Framework | 2022 | VLDB | 8.3268401e-05 |
| 4,557 | Distributed Deep Learning on Data Systems: A Comparative Analysis of Approaches | 2021 | VLDB | 6.087611e-05 |
| 4,975 | An Experimental Evaluation of Large Scale GBDT Systems | 2019 | VLDB | 5.79026e-05 |
| 5,333 | Heterogeneity-Aware Distributed Machine Learning Training via Partial Reduce | 2021 | SIGMOD | 5.5656575e-05 |
| 5,720 | BAGUA: Scaling up Distributed Learning with System Relaxations | 2022 | VLDB | 5.3527734e-05 |
| 5,988 | NuPS: A Parameter Server for Machine Learning with Non-Uniform Parameter Access | 2022 | SIGMOD | 5.2430981e-05 |
| 6,471 | Dynamic Parameter Allocation in Parameter Servers | 2020 | VLDB | 5.0511668e-05 |
| 8,025 | Just Move It! Dynamic Parameter Allocation in Action | 2021 | VLDB | 4.6031105e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 6 of 6 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 66 | Spark SQL: Relational Data Processing in Spark | 2015 | SIGMOD | 0.00061639801 |
| 497 | Column-Stores vs. Row-Stores: How Different Are They Really? | 2008 | SIGMOD | 0.00021716559 |
| 557 | SystemML: Declarative Machine Learning on Spark | 2016 | VLDB | 0.00020197988 |
| 1,942 | Heterogeneity-aware Distributed Parameter Servers | 2017 | SIGMOD | 0.00010012691 |
| 9,469 | DimBoost: Boosting Gradient Boosting Decision Tree to Higher Dimensions | 2018 | SIGMOD | 4.3342363e-05 |
| 11,795 | LDA*: A Robust and Large-scale Topic Modeling System | 2017 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 7,019 | Bridging the Gap Between HPC and Big Data Frameworks | 2017 | VLDB | 4.860057e-05 |
| 8,300 | sPCA: Scalable Principal Component Analysis for Big Data on Distributed Platforms | 2015 | SIGMOD | 4.5435639e-05 |
| 11,188 | ST4ML: Machine Learning Oriented Spatio-Temporal Data Processing at Scale | 2023 | SIGMOD | 4.1945683e-05 |
| 6,998 | PetPS: Supporting Huge Embedding Models with Persistent Memory | 2023 | VLDB | 4.8676312e-05 |
| 1,402 | Hybrid Parallelization Strategies for Large-Scale Machine Learning in SystemML | 2014 | VLDB | 0.00012180605 |
| 1,942 | Heterogeneity-aware Distributed Parameter Servers | 2017 | SIGMOD | 0.00010012691 |
| 8,617 | A Spark Optimizer for Adaptive, Fine-Grained Parameter Tuning | 2024 | VLDB | 4.4846425e-05 |
| 557 | SystemML: Declarative Machine Learning on Spark | 2016 | VLDB | 0.00020197988 |
| 6,871 | Towards General and Efficient Online Tuning for Spark | 2023 | VLDB | 4.8997004e-05 |
| 2,440 | FlexPS: Flexible Parallelism Control in Parameter Server Architecture | 2018 | VLDB | 8.8119143e-05 |