Handling Data Skew in Parallel Joins in Shared-Nothing Systems
Summary: Introduces PRPD (Partial Redistribution & Partial Duplication) to mitigate data skew in parallel joins on shared-nothing DBMS. Demonstrates significant speedups and higher throughput under skew by reducing hot-spotting and balancing workload in high-concurrency data warehouses. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Yu Xu
- 2. Pekka Kostamaa
- 3. Xin Zhou
- 4. Liang Chen
Incoming Citations (Sorted by Pagerank)
Showing 18 of 18 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 5 of 5 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 588 | Practical Skew Handling in Parallel Joins | 1992 | VLDB | 0.00019604754 |
| 861 | A Taxonomy and Performance Model of Data Skew Effects in Parallel Joins | 1991 | VLDB | 0.00015848554 |
| 1,232 | Bucket Spreading Parallel Hash: A New, Robust, Parallel Hash Join Method for Data Skew in the Super Database Computer (SDC) | 1990 | VLDB | 0.00013147188 |
| 1,365 | Handling Data Skew in Multiprocessor Database Computers Using Partition Tuning | 1991 | VLDB | 0.00012368421 |
| 3,899 | Using Shared Virtual Memory for Parallel Join Processing | 1993 | SIGMOD | 6.6538884e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,674 | Adaptive Parallel Aggregation Algorithms | 1995 | SIGMOD | 0.0001094787 |
| 8,165 | Progressive Optimization in a Shared-Nothing Parallel Database | 2007 | SIGMOD | 4.5717277e-05 |
| 1,232 | Bucket Spreading Parallel Hash: A New, Robust, Parallel Hash Join Method for Data Skew in the Super Database Computer (SDC) | 1990 | VLDB | 0.00013147188 |
| 2,212 | Skew in Parallel Query Processing | 2014 | PODS | 9.2771827e-05 |
| 5,568 | Efficient outer join data skew handling in parallel DBMS | 2009 | VLDB | 5.4301489e-05 |
| 679 | Skew-Aware Automatic Database Partitioning in Shared-Nothing, Parallel OLTP Systems | 2012 | SIGMOD | 0.00018215154 |
| 3,899 | Using Shared Virtual Memory for Parallel Join Processing | 1993 | SIGMOD | 6.6538884e-05 |
| 861 | A Taxonomy and Performance Model of Data Skew Effects in Parallel Joins | 1991 | VLDB | 0.00015848554 |
| 588 | Practical Skew Handling in Parallel Joins | 1992 | VLDB | 0.00019604754 |
| 1,365 | Handling Data Skew in Multiprocessor Database Computers Using Partition Tuning | 1991 | VLDB | 0.00012368421 |