Efficient outer join data skew handling in parallel DBMS
Summary: Proposes OJSO (Outer Join Skew Optimization), a simple, efficient algorithm to handle data skew in parallel outer joins and improve load balance in PDBMS. First study for parallel outer-join skew; demonstrates substantial query-time speedups under skew in experiments. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Yu Xu
- 2. Pekka Kostamaa
Incoming Citations (Sorted by Pagerank)
Showing 5 of 5 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,334 | SkewTune: Mitigating Skew in MapReduce Applications | 2012 | SIGMOD | 0.0001250413 |
| 4,132 | Advanced Join Strategies for Large-Scale Distributed Computation | 2014 | VLDB | 6.4241067e-05 |
| 5,532 | A Padded Encoding Scheme to Accelerate Scans by Leveraging Skew | 2015 | SIGMOD | 5.4548897e-05 |
| 7,913 | Resource Bricolage for Parallel Database Systems | 2015 | VLDB | 4.6180739e-05 |
| 11,531 | Fangorn: Adaptive Execution Framework for Heterogeneous Workloads on Shared Clusters | 2021 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 7 of 7 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 588 | Practical Skew Handling in Parallel Joins | 1992 | VLDB | 0.00019604754 |
| 861 | A Taxonomy and Performance Model of Data Skew Effects in Parallel Joins | 1991 | VLDB | 0.00015848554 |
| 1,232 | Bucket Spreading Parallel Hash: A New, Robust, Parallel Hash Join Method for Data Skew in the Super Database Computer (SDC) | 1990 | VLDB | 0.00013147188 |
| 1,365 | Handling Data Skew in Multiprocessor Database Computers Using Partition Tuning | 1991 | VLDB | 0.00012368421 |
| 1,915 | Handling Data Skew in Parallel Joins in Shared-Nothing Systems | 2008 | SIGMOD | 0.00010104123 |
| 3,899 | Using Shared Virtual Memory for Parallel Join Processing | 1993 | SIGMOD | 6.6538884e-05 |
| 6,166 | View Matching for Outer-Join Views | 2005 | VLDB | 5.1724475e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,044 | Optimization of Multi-Way Join Queries for Parallel Execution | 1991 | VLDB | 9.6953608e-05 |
| 13,032 | Using Semiouterjoins to Process Queries in Multidatabase Systems | 1984 | PODS | 4.1945683e-05 |
| 1,939 | From Theory to Practice: Efficient Join Query Evaluation in a Parallel Database System | 2015 | SIGMOD | 0.00010025655 |
| 5,960 | Skew-Aware Join Optimization for Array Databases | 2015 | SIGMOD | 5.2559595e-05 |
| 679 | Skew-Aware Automatic Database Partitioning in Shared-Nothing, Parallel OLTP Systems | 2012 | SIGMOD | 0.00018215154 |
| 1,365 | Handling Data Skew in Multiprocessor Database Computers Using Partition Tuning | 1991 | VLDB | 0.00012368421 |
| 1,915 | Handling Data Skew in Parallel Joins in Shared-Nothing Systems | 2008 | SIGMOD | 0.00010104123 |
| 6,214 | Skew Handling Techniques in Sort-Merge Join | 2002 | SIGMOD | 5.1546943e-05 |
| 588 | Practical Skew Handling in Parallel Joins | 1992 | VLDB | 0.00019604754 |
| 861 | A Taxonomy and Performance Model of Data Skew Effects in Parallel Joins | 1991 | VLDB | 0.00015848554 |