Database Paper Browser

Back to papers

A Taxonomy and Performance Model of Data Skew Effects in Parallel Joins

Summary: Taxonomy of data-skew effects in parallel joins and a new performance model for skew-aware evaluation. Distinguishes skew causes and characteristics to enable fair cross-algorithm comparisons, and uses the model to contrast two parallel join approaches. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
8043
Venue
VLDB
Year
1991
Pagerank
0.00015848554
Overall Rank
861 | 94.02%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 23 of 23 citing papers.

Rank Citing Paper Year Venue Pagerank
588 Practical Skew Handling in Parallel Joins 1992 VLDB 0.00019604754
679 Skew-Aware Automatic Database Partitioning in Shared-Nothing, Parallel OLTP Systems 2012 SIGMOD 0.00018215154
871 Building a Scalable Geo-Spatial DBMS: Technology, Implementation, and Evaluation 1997 SIGMOD 0.00015767786
1,334 SkewTune: Mitigating Skew in MapReduce Applications 2012 SIGMOD 0.0001250413
1,674 Adaptive Parallel Aggregation Algorithms 1995 SIGMOD 0.0001094787
1,915 Handling Data Skew in Parallel Joins in Shared-Nothing Systems 2008 SIGMOD 0.00010104123
2,212 Skew in Parallel Query Processing 2014 PODS 9.2771827e-05
2,417 Dynamic Load Balancing in Hierarchical Parallel Database Systems 1996 VLDB 8.8604775e-05
2,963 Distributed File Organization with Scalable Cost/Performance 1994 SIGMOD 7.8097631e-05
3,893 Estimation of Query-Result Distribution and its Application in Parallel-Join Load Balancing 1996 VLDB 6.6584217e-05
3,899 Using Shared Virtual Memory for Parallel Join Processing 1993 SIGMOD 6.6538884e-05
4,135 Analysis of Dynamic Load Balancing Strategies for Parallel Shared Nothing Database Systems 1993 VLDB 6.4189164e-05
4,214 Dynamic Multi-Resource Load Balancing in Parallel Database Systems 1995 VLDB 6.3541e-05
4,483 DFI: The Data Flow Interface for High-Speed Networks 2021 SIGMOD 6.148188e-05
5,532 A Padded Encoding Scheme to Accelerate Scans by Leveraging Skew 2015 SIGMOD 5.4548897e-05
5,568 Efficient outer join data skew handling in parallel DBMS 2009 VLDB 5.4301489e-05
5,960 Skew-Aware Join Optimization for Array Databases 2015 SIGMOD 5.2559595e-05
6,214 Skew Handling Techniques in Sort-Merge Join 2002 SIGMOD 5.1546943e-05
6,619 Near-Optimal Distributed Band-Joins through Recursive Partitioning 2020 SIGMOD 4.9910152e-05
6,741 DEX: Scalable Range Indexing on Disaggregated Memory 2024 VLDB 4.9432931e-05
7,153 Submodularity of Distributed Join Computation 2018 SIGMOD 4.8153963e-05
7,836 NOCAP: Near-Optimal Correlation-Aware Partitioning Joins 2023 SIGMOD 4.6380835e-05
8,978 SpongeFiles: Mitigating Data Skew in MapReduce Using Distributed Memory 2014 SIGMOD 4.417225e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 5 of 5 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers