Database Paper Browser

Back to papers

BlockJoin: Efficient Matrix Partitioning Through Joins

Summary: BlockJoin fuses relational and linear-algebra operators to emit block-partitioned results, cutting shuffles. Adapts columnar techniques (index-joins, late materialization) to dataflow engines, delivering 6x speedups and skew resilience vs Spark. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
11517
Venue
VLDB
Year
2017
Pagerank
4.3425552e-05
Overall Rank
9,437 | 34.35%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 2 of 2 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 18 of 18 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
9 Implementation Techniques For Main Memory Database Systems 1984 SIGMOD 0.0014279444
51 Including Group-By in Query Optimization 1994 VLDB 0.00067123727
52 Database Architecture Optimized for the new Bottleneck: Memory Access 1999 VLDB 0.00066474881
168 MAD Skills: New Analysis Practices for Big Data 2009 VLDB 0.00038946305
318 Overview of SciDB: Large Scale Array Storage, Processing and Analysis 2010 SIGMOD 0.00027795661
497 Column-Stores vs. Row-Stores: How Different Are They Really? 2008 SIGMOD 0.00021716559
543 MLbase: A Distributed Machine-learning System 2013 CIDR 0.00020526854
557 SystemML: Declarative Machine Learning on Spark 2016 VLDB 0.00020197988
585 Massively Parallel Sort-Merge Joins in Main Memory Multi-Core Database Systems 2012 VLDB 0.00019706145
761 Materialization Optimizations for Feature Selection Workloads 2014 SIGMOD 0.00017053783
860 The Multidimensional Database System RasDaMan 1998 SIGMOD 0.00015860465
1,074 Processing Theta-Joins using MapReduce* 2011 SIGMOD 0.00014260096
1,167 Learning Generalized Linear Models Over Normalized Data 2015 SIGMOD 0.00013547713
1,653 Query Processing Techniques for Solid State Drives 2009 SIGMOD 0.00011003558
1,967 Compressed Linear Algebra for Large-Scale Machine Learning 2016 VLDB 9.9131712e-05
2,526 Track Join: Distributed Joins with Minimal Network Traffic 2014 SIGMOD 8.5968612e-05
2,667 Cumulon: Optimizing Statistical Data Analysis in the Cloud 2013 SIGMOD 8.3413995e-05
2,818 Implicit Parallelism through Deep Language Embedding 2015 SIGMOD 8.0665558e-05
Previous Page 1 / 1 Next

Semantically Similar Papers