Back to papers
Accelerate Distributed Joins with Predicate Transfer
Summary: Extends predicate transfer to distributed joins with cost-based adaptive execution and Bloom-filter pre-filtering. Introduces pruning to drop non-contributory transfers; reports 3x speedup and 2.7x data-exchange reduction on TPC-H/DSB SF400 in a distributed analytics engine.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 7197
- Venue
- SIGMOD
- Year
- 2025
- Pagerank
- 4.4534753e-05
- Overall Rank
- 8,781 | 38.92%
- DOI
-
10.1145/3725259
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 4 of 4 citing papers.
Outgoing Citations (Sorted by Pagerank)
Showing 30 of 30 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 16 |
MAGIC SETS AND OTHER STRANGE WAYS TO IMPLEMENT LOGIC PROGRAMS (Extended Abstract) |
1986 |
PODS |
0.0010066783 |
| 30 |
Hashing Methods and Relational Algebra Operations |
1984 |
VLDB |
0.00078672446 |
| 66 |
Spark SQL: Relational Data Processing in Spark |
2015 |
SIGMOD |
0.00061639801 |
| 139 |
Predicate Migration: Optimizing Queries with Expensive Predicates |
1993 |
SIGMOD |
0.00042299329 |
| 351 |
Sort vs. Hash Revisited: Fast Join Implementation on Modern Multi-Core CPUs |
2009 |
VLDB |
0.0002636504 |
| 365 |
On the Power of Magic |
1987 |
PODS |
0.00025585898 |
| 540 |
Design and Evaluation of Main Memory Hash Join Algorithms for Multi-core CPUs |
2011 |
SIGMOD |
0.0002063443 |
| 700 |
A Methodology For Interpreting Tree Queries Into Optimal Semi-Join Expressions |
1980 |
SIGMOD |
0.00017948517 |
| 1,016 |
Memory-Efficient Hash Joins |
2015 |
VLDB |
0.00014638492 |
| 1,302 |
Query Optimization by Predicate Move-Around |
1994 |
VLDB |
0.00012705525 |
| 1,342 |
On the Design of a Query Processing Strategy in a Distributed Database Environment |
1983 |
SIGMOD |
0.00012483694 |
| 1,423 |
Magic is Relevant |
1990 |
SIGMOD |
0.00012054867 |
| 2,526 |
Track Join: Distributed Joins with Minimal Network Traffic |
2014 |
SIGMOD |
8.5968612e-05 |
| 2,772 |
Quickstep: A Data Platform Based on the Scaling-Up Approach |
2018 |
VLDB |
8.1401661e-05 |
| 2,985 |
DSB: A Decision Support Benchmark for Workload-Driven and Traditional Database Systems |
2021 |
VLDB |
7.7795847e-05 |
| 3,283 |
Magic Conditions |
1990 |
PODS |
7.280826e-05 |
| 3,443 |
Distributed Join Algorithms on Thousands of Cores |
2017 |
VLDB |
7.0887214e-05 |
| 3,779 |
Instance-Optimized Data Layouts for Cloud Analytics Workloads |
2021 |
SIGMOD |
6.7747205e-05 |
| 3,922 |
Pushing Data-Induced Predicates Through Joins in Big-Data Clusters |
2020 |
VLDB |
6.6291079e-05 |
| 4,132 |
Advanced Join Strategies for Large-Scale Distributed Computation |
2014 |
VLDB |
6.4241067e-05 |
| 4,276 |
Looking Ahead Makes Query Plans Robust: Making the Initial Case with In-Memory Star Schema Data Warehouse Workloads |
2017 |
VLDB |
6.2976602e-05 |
| 4,465 |
Robust Join Processing with Diamond Hardened Joins |
2024 |
VLDB |
6.1604282e-05 |
| 4,667 |
FlexPushdownDB: Hybrid Pushdown and Caching in a Cloud DBMS |
2021 |
VLDB |
6.0116919e-05 |
| 5,036 |
Query Processing For Distributed Databases Using Generalized Semi-Joins |
1982 |
SIGMOD |
5.745922e-05 |
| 5,194 |
Bitvector-aware Query Optimization for Decision Support Queries |
2020 |
SIGMOD |
5.6368209e-05 |
| 5,531 |
Presto: A Decade of SQL Analytics at Meta |
2023 |
SIGMOD |
5.4549499e-05 |
| 5,765 |
Predicate Transfer: Efficient Pre-Filtering on Multi-Join Queries |
2024 |
CIDR |
5.336442e-05 |
| 7,060 |
SquirrelJoin: Network-Aware Distributed Join Processing with Lazy Partitioning |
2017 |
VLDB |
4.8465382e-05 |
| 7,677 |
Semi-Join Algorithms For Multiprocessor Systems |
1982 |
SIGMOD |
4.6813684e-05 |
| 9,839 |
Optimal Semijoin Schedules For Query Processing In Local Distributed Database Systems |
1981 |
SIGMOD |
4.2739573e-05 |
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 3,154 |
The MemSQL Query Optimizer: A modern optimizer for real-time analytics in a distributed database |
2016 |
VLDB |
7.4686089e-05 |
| 3,443 |
Distributed Join Algorithms on Thousands of Cores |
2017 |
VLDB |
7.0887214e-05 |
| 1,939 |
From Theory to Practice: Efficient Join Query Evaluation in a Parallel Database System |
2015 |
SIGMOD |
0.00010025655 |
| 3,821 |
Locality-aware Partitioning in Parallel Database Systems |
2015 |
SIGMOD |
6.7281515e-05 |
| 10,241 |
Robust Predicate Transfer with Dynamic Execution |
2026 |
VLDB |
4.1945683e-05 |
| 1,429 |
A Scalable, Predictable Join Operator for Highly Concurrent Data Warehouses |
2009 |
VLDB |
0.00012033518 |
| 4,132 |
Advanced Join Strategies for Large-Scale Distributed Computation |
2014 |
VLDB |
6.4241067e-05 |
| 11,890 |
Let's Rethink Join Optimization in Distributed Systems |
2015 |
CIDR |
4.1945683e-05 |
| 3,922 |
Pushing Data-Induced Predicates Through Joins in Big-Data Clusters |
2020 |
VLDB |
6.6291079e-05 |
| 5,765 |
Predicate Transfer: Efficient Pre-Filtering on Multi-Join Queries |
2024 |
CIDR |
5.336442e-05 |