Database Paper Browser

Back to papers

To Partition, or Not to Partition, That is the Join Question in a Real System

Summary: Assesses whether radix join should be integrated into a real code-generating DBMS (Umbra) with a Bloom-filter semi-join reducer. TPC-H/microbenchmarks show radix join helps only 1 of 59 joins; partitioning gains vanish outside tight settings, and late materialization rarely helps. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
6111
Venue
SIGMOD
Year
2021
Pagerank
6.8179379e-05
Overall Rank
3,721 | 74.12%
DOI
10.1145/3448016.3452831

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 28 of 28 citing papers.

Rank Citing Paper Year Venue Pagerank
5,247 Triton Join: Efficiently Scaling to a Large Join State on GPUs with Fast Interconnects 2022 SIGMOD 5.6057839e-05
6,302 Diva: Making MVCC Systems HTAP-Friendly 2022 SIGMOD 5.1215989e-05
6,524 The 3D Hash Join: Building On Non-Unique Join Attributes 2022 CIDR 5.0274964e-05
6,525 Database Technology for the Masses: Sub-Operators as First-Class Entities 2021 VLDB 5.027205e-05
7,546 Is Perfect Hashing Practical for OLAP Systems? 2024 CIDR 4.7148429e-05
7,667 Fast Detection of Denial Constraint Violations 2022 VLDB 4.683767e-05
7,751 Efficiently Processing Joins and Grouped Aggregations on GPUs 2025 SIGMOD 4.6603427e-05
7,836 NOCAP: Near-Optimal Correlation-Aware Partitioning Joins 2023 SIGMOD 4.6380835e-05
8,023 Design Trade-offs for a Robust Dynamic Hybrid Hash Join 2022 VLDB 4.6035454e-05
8,051 Building Advanced SQL Analytics From Low-Level Plan Operators 2021 SIGMOD 4.5969549e-05
8,417 The Case for Learned In-Memory Joins 2023 VLDB 4.5194164e-05
8,478 Analyzing Vectorized Hash Tables Across CPU Architectures 2023 VLDB 4.5015937e-05
8,514 UPLIFT: Parallelization Strategies for Feature Transformations in Machine Learning Workloads 2022 VLDB 4.4944285e-05
8,680 A Practical Approach to Groupjoin and Nested Aggregates 2021 VLDB 4.4694927e-05
8,855 A Design Space Exploration and Evaluation for Main-Memory Hash Joins in Storage Class Memory 2023 VLDB 4.4348906e-05
9,142 Design and Analysis of a Processing-in-DIMM Join Algorithm: A Case Study with UPMEM DIMMs 2023 SIGMOD 4.3853149e-05
9,743 Databases in the Era of Memory-Centric Computing 2025 CIDR 4.2897489e-05
9,838 Efficiently Joining Large Relations on Multi-GPU Systems 2025 VLDB 4.2740344e-05
9,967 Hash Joins Meet CXL: A Fresh Look 2026 CIDR 4.1945683e-05
10,063 Counting Is All You Need for Instant Tuple Discovery: Enabling Real-Time HTAP in Standalone DBMSs 2026 SIGMOD 4.1945683e-05
10,295 Global Hash Tables Strike Back! An Analysis of Parallel GROUP BY Aggregation 2026 VLDB 4.1945683e-05
10,372 Data Chunk Compaction in Vectorized Execution 2025 SIGMOD 4.1945683e-05
10,494 Nested Parquet Is Flat, Why Not Use It? How To Scan Nested Data With On-the-Fly Key Generation and Joins 2025 SIGMOD 4.1945683e-05
10,635 Saving Private Hash Join 2025 VLDB 4.1945683e-05
10,756 Selective Late Materialization in Modern Analytical Databases 2025 VLDB 4.1945683e-05
10,989 High-Performance Query Processing with NVMe Arrays: Spilling without Killing Performance 2024 SIGMOD 4.1945683e-05
10,993 SPID-Join: A Skew-resistant Processing-in-DIMM Join Algorithm Exploiting the Bank- and Rank-level Parallelisms of DIMMs 2024 SIGMOD 4.1945683e-05
11,358 Scaling Equi-Joins 2022 SIGMOD 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 27 of 27 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
52 Database Architecture Optimized for the new Bottleneck: Memory Access 1999 VLDB 0.00066474881
71 How Good Are Query Optimizers, Really? 2016 VLDB 0.00059038975
81 Cache Conscious Algorithms for Relational Query Processing 1994 VLDB 0.00055548574
351 Sort vs. Hash Revisited: Fast Join Implementation on Modern Multi-Core CPUs 2009 VLDB 0.0002636504
404 Multi-Core, Main-Memory Joins: Sort vs. Hash Revisited 2014 VLDB 0.00024143076
418 Morsel-Driven Parallelism: A NUMA-Aware Query Evaluation Framework for the Many-Core Age 2014 SIGMOD 0.00023729211
540 Design and Evaluation of Main Memory Hash Join Algorithms for Multi-core CPUs 2011 SIGMOD 0.0002063443
585 Massively Parallel Sort-Merge Joins in Main Memory Multi-Core Database Systems 2012 VLDB 0.00019706145
735 Umbra: A Disk-Based System with In-Memory Performance 2020 CIDR 0.00017452467
930 Fast Sort on CPUs and GPUs: A Case for Bandwidth Oblivious SIMD Sort 2010 SIGMOD 0.00015238545
958 Rethinking SIMD Vectorization for In-Memory Databases 2015 SIGMOD 0.00015045316
1,016 Memory-Efficient Hash Joins 2015 VLDB 0.00014638492
1,079 What happens during a Join? Dissecting CPU and Memory Optimization Effects 2000 VLDB 0.00014233415
1,263 Data Blocks: Hybrid OLTP and OLAP on Compressed Storage using both Vectorization and Compilation 2016 SIGMOD 0.00012982857
1,607 A Comprehensive Study of Main-Memory Partitioning and its Application to Large-Scale Comparison- and Radix-Sort 2014 SIGMOD 0.00011162682
1,696 A Seven-Dimensional Analysis of Hashing Methods and its Implications on Query Processing 2016 VLDB 0.00010881034
1,804 An Experimental Comparison of Thirteen Relational Equi-Joins in Main Memory 2016 SIGMOD 0.00010501185
1,864 Relaxed Operator Fusion for In-Memory Databases: Making Compilation, Vectorization, and Prefetching Work Together At Last 2018 VLDB 0.00010280966
2,014 Voodoo - A Vector Algebra for Portable Database Performance on Modern Hardware 2016 VLDB 9.7904029e-05
2,916 Quantifying TPC-H Choke Points and Their Optimizations 2020 VLDB 7.9068048e-05
3,722 Cache-Conscious Radix-Decluster Projections 2004 VLDB 6.8176075e-05
4,158 Performance-Optimal Filtering: Bloom Overtakes Cuckoo at High Throughput 2019 VLDB 6.3994318e-05
5,087 Accelerating Queries with Group-By and Join by Groupjoin 2011 VLDB 5.7075009e-05
5,178 FPGA-based Data Partitioning 2017 SIGMOD 5.6438393e-05
5,653 On the Surprising Difficulty of Simple Things: the Case of Radix Partitioning 2015 VLDB 5.3889513e-05
6,540 Data Partitioning for In-Memory Systems: Myths, Challenges, and Opportunities 2019 CIDR 5.0219214e-05
9,299 Engineering High-Performance Database Engines 2014 VLDB 4.3587894e-05
Previous Page 1 / 1 Next

Semantically Similar Papers