Exploiting Matrix Dependency for Efficient Distributed Matrix Computation

Summary: Exploits matrix dependencies to reduce communication in distributed matrix computation. DMac decomposes programs into operations, derives a dependency-oriented cost model, and generates un-interleaved, stage-wise execution plans on Spark for efficient local processing. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID: 4933
Venue: SIGMOD
Year: 2015
Pagerank: 8.013421e-05
Overall Rank: 2,855 | 80.16%
DOI: 10.1145/2723372.2723712

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 12 of 12 citing papers.

Rank	Citing Paper	Year	Venue	Pagerank
557	SystemML: Declarative Machine Learning on Spark	2016	VLDB	0.00020186115
1,970	Compressed Linear Algebra for Large-Scale Machine Learning	2016	VLDB	9.9024431e-05
3,920	On Optimizing Operator Fusion Plans for Large-Scale Machine Learning in SystemML	2018	VLDB	6.6246708e-05
4,807	Resource Elasticity for Large-Scale Machine Learning	2015	SIGMOD	5.9045148e-05
6,194	Automatic Optimization of Matrix Implementations for Distributed Machine Learning and Linear Algebra	2021	SIGMOD	5.1592984e-05
6,747	DistME: A Fast and Elastic Distributed Matrix Computation Engine using GPUs	2019	SIGMOD	4.9369478e-05
8,258	FuseME: Distributed Matrix Computation Engine based on Cuboid-based Fused Operator and Plan Generation	2022	SIGMOD	4.5424271e-05
8,515	UPLIFT: Parallelization Strategies for Feature Transformations in Machine Learning Workloads	2022	VLDB	4.4901466e-05
11,341	Redundancy Elimination in Distributed Matrix Computation	2022	SIGMOD	4.1905499e-05
11,475	Hybrid Evaluation for Distributed Iterative Matrix Computation	2021	SIGMOD	4.1905499e-05
11,515	HyMAC: A Hybrid Matrix Computation System	2021	VLDB	4.1905499e-05
11,854	Real-time Video Recommendation Exploration	2016	SIGMOD	4.1905499e-05

Outgoing Citations (Sorted by Pagerank)

Showing 5 of 5 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank	Cited Paper	Year	Venue	Pagerank
319	Overview of SciDB: Large Scale Array Storage, Processing and Analysis	2010	SIGMOD	0.00027771951
408	HaLoop: Efficient Iterative Data Processing on Large Clusters	2010	VLDB	0.00023939456
947	MRShare: Sharing Across Multiple Queries in MapReduce	2010	VLDB	0.00015112344
1,407	Hybrid Parallelization Strategies for Large-Scale Machine Learning in SystemML	2014	VLDB	0.00012163413
2,754	Stubby: A Transformation-based Optimizer for MapReduce Workflows	2012	VLDB	8.1720428e-05

Semantically Similar Papers

Overall Rank	Paper	Year	Venue	Pagerank
9,442	BlockJoin: Efficient Matrix Partitioning Through Joins	2017	VLDB	4.3384032e-05
1,407	Hybrid Parallelization Strategies for Large-Scale Machine Learning in SystemML	2014	VLDB	0.00012163413
8,533	Translation of Array-Based Loops to Distributed Data-Parallel Programs	2020	VLDB	4.4893996e-05
7,019	Bridging the Gap Between HPC and Big Data Frameworks	2017	VLDB	4.8553946e-05
5,622	Distributed implementations of dependency discovery algorithms	2019	VLDB	5.4050344e-05
7,952	Efficient Matrix Sketching over Distributed Data	2017	PODS	4.6089395e-05
11,341	Redundancy Elimination in Distributed Matrix Computation	2022	SIGMOD	4.1905499e-05
6,194	Automatic Optimization of Matrix Implementations for Distributed Machine Learning and Linear Algebra	2021	SIGMOD	5.1592984e-05
6,747	DistME: A Fast and Elastic Distributed Matrix Computation Engine using GPUs	2019	SIGMOD	4.9369478e-05
11,475	Hybrid Evaluation for Distributed Iterative Matrix Computation	2021	SIGMOD	4.1905499e-05