BlockJoin: Efficient Matrix Partitioning Through Joins
Summary: BlockJoin fuses relational and linear-algebra operators to emit block-partitioned results, cutting shuffles. Adapts columnar techniques (index-joins, late materialization) to dataflow engines, delivering 6x speedups and skew resilience vs Spark. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Andreas Kunft
- 2. Asterios Katsifodimos
- 3. Sebastian Schelter
- 4. Tilmann Rabl
- 5. Volker Markl
Incoming Citations (Sorted by Pagerank)
Showing 2 of 2 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,122 | SystemDS: A Declarative Machine Learning System for the End-to-End Data Science Lifecycle | 2020 | CIDR | 9.4989076e-05 |
| 2,350 | An Intermediate Representation for Optimizing Machine Learning Pipelines | 2019 | VLDB | 8.9788641e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 18 of 18 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 960 | A Comparison of Join Algorithms for Log Processing in MapReduce | 2010 | SIGMOD | 0.00015012242 |
| 447 | Efficient Parallel Set-Similarity Joins Using MapReduce | 2010 | SIGMOD | 0.00022900171 |
| 3,141 | ClusterJoin: A Similarity Joins Framework using Map-Reduce | 2014 | VLDB | 7.4829448e-05 |
| 1,939 | From Theory to Practice: Efficient Join Query Evaluation in a Parallel Database System | 2015 | SIGMOD | 0.00010025655 |
| 10,930 | Similarity Joins of Sparse Features | 2024 | SIGMOD | 4.1945683e-05 |
| 6,191 | Automatic Optimization of Matrix Implementations for Distributed Machine Learning and Linear Algebra | 2021 | SIGMOD | 5.1642282e-05 |
| 5,118 | AdaptDB: Adaptive Partitioning for Distributed Joins | 2017 | VLDB | 5.6820984e-05 |
| 11,797 | Runtime Optimization of Join Location in Parallel Data Management Systems | 2017 | VLDB | 4.1945683e-05 |
| 11,890 | Let's Rethink Join Optimization in Distributed Systems | 2015 | CIDR | 4.1945683e-05 |
| 3,382 | Scalable and Adaptive Online Joins | 2014 | VLDB | 7.1597145e-05 |