DistME: A Fast and Elastic Distributed Matrix Computation Engine using GPUs
Summary: Introduces DistME, a fast elastic distributed matrix computation engine built atop Spark, combining CuboidMM with GPU acceleration. CuboidMM partitions matrices into cuboids to minimize network traffic under memory constraints, while subcuboid GPU partitioning reduces PCIe costs; experiments show superior performance and scalability over existing distributed matrix mult methods. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Donghyoung Han
- 2. Yoon-Min Nam
- 3. Jihye Lee
- 4. Kyongseok Park
- 5. Hyunwoo Kim
- 6. Min-Soo Kim
Incoming Citations (Sorted by Pagerank)
Showing 5 of 5 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 5,821 | Tensor Relational Algebra for Distributed Machine Learning System Design | 2021 | VLDB | 5.3134851e-05 |
| 8,262 | FuseME: Distributed Matrix Computation Engine based on Cuboid-based Fused Operator and Plan Generation | 2022 | SIGMOD | 4.5467867e-05 |
| 9,706 | Distributed Numerical and Machine Learning Computations via Two-Phase Execution of Aggregated Join Trees | 2021 | VLDB | 4.2992942e-05 |
| 11,339 | Redundancy Elimination in Distributed Matrix Computation | 2022 | SIGMOD | 4.1945683e-05 |
| 11,472 | Hybrid Evaluation for Distributed Iterative Matrix Computation | 2021 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 9 of 9 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 66 | Spark SQL: Relational Data Processing in Spark | 2015 | SIGMOD | 0.00061639801 |
| 318 | Overview of SciDB: Large Scale Array Storage, Processing and Analysis | 2010 | SIGMOD | 0.00027795661 |
| 557 | SystemML: Declarative Machine Learning on Spark | 2016 | VLDB | 0.00020197988 |
| 960 | A Comparison of Join Algorithms for Log Processing in MapReduce | 2010 | SIGMOD | 0.00015012242 |
| 1,279 | Towards Linear Algebra over Normalized Data | 2017 | VLDB | 0.00012868394 |
| 2,848 | Exploiting Matrix Dependency for Efficient Distributed Matrix Computation | 2015 | SIGMOD | 8.0208832e-05 |
| 3,834 | GTS: A Fast and Scalable Graph Processing Method based on Streaming Topology to GPUs | 2016 | SIGMOD | 6.7173094e-05 |
| 3,948 | A Comparative Evaluation of Systems for Scalable Linear Algebra-based Analytics | 2018 | VLDB | 6.5959084e-05 |
| 6,322 | The BUDS Language for Distributed Bayesian Machine Learning | 2017 | SIGMOD | 5.1124615e-05 |
Previous
Page 1 / 1
Next