Database Paper Browser

Back to papers

Handling Data Skew in Parallel Joins in Shared-Nothing Systems

Summary: Introduces PRPD (Partial Redistribution & Partial Duplication) to mitigate data skew in parallel joins on shared-nothing DBMS. Demonstrates significant speedups and higher throughput under skew by reducing hot-spotting and balancing workload in high-concurrency data warehouses. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
4052
Venue
SIGMOD
Year
2008
Pagerank
0.00010104123
Overall Rank
1,915 | 86.68%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 18 of 18 citing papers.

Rank Citing Paper Year Venue Pagerank
502 Worst-case Optimal Join Algorithms 2012 PODS 0.00021526612
1,110 Parallel Evaluation of Conjunctive Queries 2011 PODS 0.00013968198
1,409 High-Speed Query Processing over High-Speed Networks 2016 VLDB 0.00012132768
2,212 Skew in Parallel Query Processing 2014 PODS 9.2771827e-05
3,021 Adaptive and Big Data Scale Parallel Execution in Oracle 2013 VLDB 7.6991391e-05
4,132 Advanced Join Strategies for Large-Scale Distributed Computation 2014 VLDB 6.4241067e-05
5,532 A Padded Encoding Scheme to Accelerate Scans by Leveraging Skew 2015 SIGMOD 5.4548897e-05
5,568 Efficient outer join data skew handling in parallel DBMS 2009 VLDB 5.4301489e-05
6,619 Near-Optimal Distributed Band-Joins through Recursive Partitioning 2020 SIGMOD 4.9910152e-05
7,059 Adaptive and Robust Query Execution for Lakehouses at Scale 2024 VLDB 4.8477825e-05
7,060 SquirrelJoin: Network-Aware Distributed Join Processing with Lazy Partitioning 2017 VLDB 4.8465382e-05
7,153 Submodularity of Distributed Join Computation 2018 SIGMOD 4.8153963e-05
7,836 NOCAP: Near-Optimal Correlation-Aware Partitioning Joins 2023 SIGMOD 4.6380835e-05
7,913 Resource Bricolage for Parallel Database Systems 2015 VLDB 4.6180739e-05
8,997 Chasing Similarity: Distribution-aware Aggregation Scheduling 2019 VLDB 4.4120041e-05
10,993 SPID-Join: A Skew-resistant Processing-in-DIMM Join Algorithm Exploiting the Bank- and Rank-level Parallelisms of DIMMs 2024 SIGMOD 4.1945683e-05
11,358 Scaling Equi-Joins 2022 SIGMOD 4.1945683e-05
11,531 Fangorn: Adaptive Execution Framework for Heterogeneous Workloads on Shared Clusters 2021 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 5 of 5 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers