Runtime Optimization of Join Location in Parallel Data Management Systems
Summary: Per-key runtime choice between map-side and reduce-side joins in parallel storage, accounting for UDFs and transfer costs. Extends ski-rental with multi-resource load balancing (CPU, network, I/O) and worst-case guarantees; implemented on Hadoop, Spark, and Muppet; yields throughput gains. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Authors
- 1. Bikash Chandra
- 2. S. Sudarshan
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 8 of 8 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 166 | Approximate Frequency Counts over Data Streams | 2002 | VLDB | 0.00039361552 |
| 288 | Storm @Twitter | 2014 | SIGMOD | 0.00028939871 |
| 328 | An Architecture for Parallel Topic Models | 2010 | VLDB | 0.0002728514 |
| 588 | Practical Skew Handling in Parallel Joins | 1992 | VLDB | 0.00019604754 |
| 835 | Finding Frequent Items in Data Streams | 2008 | VLDB | 0.00016109621 |
| 2,605 | Muppet: MapReduce-Style Processing of Fast Data | 2012 | VLDB | 8.4646171e-05 |
| 3,308 | Automatic Partitioning of Database Applications | 2012 | VLDB | 7.2422925e-05 |
| 4,943 | Lifting the Burden of History from Adaptive Query Processing | 2004 | VLDB | 5.8170713e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 960 | A Comparison of Join Algorithms for Log Processing in MapReduce | 2010 | SIGMOD | 0.00015012242 |
| 11,358 | Scaling Equi-Joins | 2022 | SIGMOD | 4.1945683e-05 |
| 7,153 | Submodularity of Distributed Join Computation | 2018 | SIGMOD | 4.8153963e-05 |
| 3,443 | Distributed Join Algorithms on Thousands of Cores | 2017 | VLDB | 7.0887214e-05 |
| 2,640 | Design and Evaluation of Parallel Pipelined Join Algorithms | 1987 | SIGMOD | 8.3924401e-05 |
| 6,619 | Near-Optimal Distributed Band-Joins through Recursive Partitioning | 2020 | SIGMOD | 4.9910152e-05 |
| 1,939 | From Theory to Practice: Efficient Join Query Evaluation in a Parallel Database System | 2015 | SIGMOD | 0.00010025655 |
| 3,382 | Scalable and Adaptive Online Joins | 2014 | VLDB | 7.1597145e-05 |
| 447 | Efficient Parallel Set-Similarity Joins Using MapReduce | 2010 | SIGMOD | 0.00022900171 |
| 11,890 | Let's Rethink Join Optimization in Distributed Systems | 2015 | CIDR | 4.1945683e-05 |