Database Paper Browser

Back to papers

Practical Skew Handling in Parallel Joins

Summary: Presents a skew-handling framework for parallel joins using a portfolio of four algorithms chosen from a sample. Virtual processor range partitioning excels under high skew; hybrid hash wins under low/no skew. Gamma-based results deliver first implementation skew metrics. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
8053
Venue
VLDB
Year
1992
Pagerank
0.00019604754
Overall Rank
588 | 95.92%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 37 of 37 citing papers.

Rank Citing Paper Year Venue Pagerank
540 Design and Evaluation of Main Memory Hash Join Algorithms for Multi-core CPUs 2011 SIGMOD 0.0002063443
585 Massively Parallel Sort-Merge Joins in Main Memory Multi-Core Database Systems 2012 VLDB 0.00019706145
780 Building a High-Level Dataflow System on top of Map-Reduce: The Pig Experience 2009 VLDB 0.00016775082
871 Building a Scalable Geo-Spatial DBMS: Technology, Implementation, and Evaluation 1997 SIGMOD 0.00015767786
925 Partition Based Spatial-Merge Join 1996 SIGMOD 0.00015264328
1,074 Processing Theta-Joins using MapReduce* 2011 SIGMOD 0.00014260096
1,334 SkewTune: Mitigating Skew in MapReduce Applications 2012 SIGMOD 0.0001250413
1,789 Reducing the Braking Distance of an SQL Query Engine 1998 VLDB 0.00010555087
1,915 Handling Data Skew in Parallel Joins in Shared-Nothing Systems 2008 SIGMOD 0.00010104123
2,417 Dynamic Load Balancing in Hierarchical Parallel Database Systems 1996 VLDB 8.8604775e-05
2,439 CoHadoop: Flexible Data Placement and Its Exploitation in Hadoop 2011 VLDB 8.8190594e-05
3,528 Distributed Data Deduplication 2016 VLDB 7.0066139e-05
3,893 Estimation of Query-Result Distribution and its Application in Parallel-Join Load Balancing 1996 VLDB 6.6584217e-05
3,899 Using Shared Virtual Memory for Parallel Join Processing 1993 SIGMOD 6.6538884e-05
4,132 Advanced Join Strategies for Large-Scale Distributed Computation 2014 VLDB 6.4241067e-05
4,135 Analysis of Dynamic Load Balancing Strategies for Parallel Shared Nothing Database Systems 1993 VLDB 6.4189164e-05
4,403 A Framework for Adversarially Robust Streaming Algorithms 2020 PODS 6.2194225e-05
5,049 Run-Time Operator State Spilling for Memory Intensive Long-Running Queries 2006 SIGMOD 5.7372423e-05
5,532 A Padded Encoding Scheme to Accelerate Scans by Leveraging Skew 2015 SIGMOD 5.4548897e-05
5,568 Efficient outer join data skew handling in parallel DBMS 2009 VLDB 5.4301489e-05
5,960 Skew-Aware Join Optimization for Array Databases 2015 SIGMOD 5.2559595e-05
6,214 Skew Handling Techniques in Sort-Merge Join 2002 SIGMOD 5.1546943e-05
6,324 Revisiting Pipelined Parallelism in Multi-Join Query Processing 2005 VLDB 5.1109987e-05
6,524 The 3D Hash Join: Building On Non-Unique Join Attributes 2022 CIDR 5.0274964e-05
6,619 Near-Optimal Distributed Band-Joins through Recursive Partitioning 2020 SIGMOD 4.9910152e-05
7,153 Submodularity of Distributed Join Computation 2018 SIGMOD 4.8153963e-05
7,836 NOCAP: Near-Optimal Correlation-Aware Partitioning Joins 2023 SIGMOD 4.6380835e-05
7,913 Resource Bricolage for Parallel Database Systems 2015 VLDB 4.6180739e-05
8,462 Topology-aware Parallel Data Processing: Models, Algorithms and Systems at Scale 2020 CIDR 4.5056381e-05
8,978 SpongeFiles: Mitigating Data Skew in MapReduce Using Distributed Memory 2014 SIGMOD 4.417225e-05
8,997 Chasing Similarity: Distribution-aware Aggregation Scheduling 2019 VLDB 4.4120041e-05
10,488 HoneyComb: A Parallel Worst-Case Optimal Join on Multicores 2025 SIGMOD 4.1945683e-05
10,993 SPID-Join: A Skew-resistant Processing-in-DIMM Join Algorithm Exploiting the Bank- and Rank-level Parallelisms of DIMMs 2024 SIGMOD 4.1945683e-05
11,332 The White-Box Adversarial Data Stream Model 2022 PODS 4.1945683e-05
11,358 Scaling Equi-Joins 2022 SIGMOD 4.1945683e-05
11,797 Runtime Optimization of Join Location in Parallel Data Management Systems 2017 VLDB 4.1945683e-05
12,101 Optimization Strategies for A/B Testing on HADOOP 2013 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 11 of 11 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers