Distributed Deep Learning on Data Systems: A Comparative Analysis of Approaches
Summary: Assesses DL on DB-resident data via four canonical approaches, with MOP highlighted, and notes no single best method. Prototype on Greenplum; DL workloads reveal a Pareto frontier among speed, governance, and portability, guiding DL-support design with open-source artifacts. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Yuhao Zhang
- 2. Frank McQuillan
- 3. Nandish Jayaram
- 4. Nikhil Kak
- 5. Ekta Khanna
- 6. Orhan Kislal
- 7. Domino Valdano
- 8. Arun Kumar
Incoming Citations (Sorted by Pagerank)
Showing 14 of 14 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 30 of 30 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 8,103 | Grep: A Graph Learning Based Database Partitioning System | 2023 | SIGMOD | 4.5852201e-05 |
| 4,003 | Data Platform for Machine Learning | 2019 | SIGMOD | 6.54347e-05 |
| 4,409 | Declarative Recursive Computation on an RDBMS | 2019 | VLDB | 6.2104034e-05 |
| 608 | DeepDB: Learn from Data, not from Queries! | 2020 | VLDB | 0.00019235898 |
| 9,222 | Towards an Optimized GROUP BY Abstraction for Large-Scale Machine Learning | 2021 | VLDB | 4.3698672e-05 |
| 3,076 | Learning a Partitioning Advisor for Cloud Databases | 2020 | SIGMOD | 7.6107677e-05 |
| 8,864 | Cerebro: A Layered Data Platform for Scalable Deep Learning | 2021 | CIDR | 4.4326439e-05 |
| 683 | Cerebro: A Data System for Optimized Deep Learning Model Selection | 2020 | VLDB | 0.00018195476 |
| 7,372 | Model-Free Control for Distributed Stream Data Processing using Deep Reinforcement Learning | 2018 | VLDB | 4.7496881e-05 |
| 13,171 | Reimagining Deep Learning Systems Through the Lens of Data Systems | 2024 | VLDB | - |