Database Paper Browser

Back to papers

Multi-Core, Main-Memory Joins: Sort vs. Hash Revisited

Summary: Empirical re-evaluation of in-memory, multi-core joins comparing sort-merge (NUMA-aware) and radix-hash. Radix-hash remains superior; sort-merge only matches it at very large data volumes; fastest implementations and hardware-conscious guidance are provided. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
10962
Venue
VLDB
Year
2014
Pagerank
0.00024143076
Overall Rank
404 | 97.20%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 50 of 81 citing papers.

Rank Citing Paper Year Venue Pagerank
418 Morsel-Driven Parallelism: A NUMA-Aware Query Evaluation Framework for the Many-Core Age 2014 SIGMOD 0.00023729211
958 Rethinking SIMD Vectorization for In-Memory Databases 2015 SIGMOD 0.00015045316
1,016 Memory-Efficient Hash Joins 2015 VLDB 0.00014638492
1,044 DimmWitted: A Study of Main-Memory Statistical Analytics 2014 VLDB 0.00014475229
1,206 Rack-Scale In-Memory Join Processing using RDMA 2015 SIGMOD 0.00013281657
1,269 Cache locality is not enough: High-Performance Nearest Neighbor Search with Product Quantization Fast Scan 2016 VLDB 0.00012930432
1,361 The End of Slow Networks: It's Time for a Redesign 2016 VLDB 0.00012379741
1,607 A Comprehensive Study of Main-Memory Partitioning and its Application to Large-Scale Comparison- and Radix-Sort 2014 SIGMOD 0.00011162682
1,804 An Experimental Comparison of Thirteen Relational Equi-Joins in Main Memory 2016 SIGMOD 0.00010501185
2,040 A Study of the Fundamental Performance Characteristics of GPUs and CPUs for Database Analytics 2020 SIGMOD 9.7057698e-05
2,390 ByteSlice: Pushing the Envelop of Main Memory Data Processing with a New Storage Layout 2015 SIGMOD 8.9084657e-05
2,526 Track Join: Distributed Joins with Minimal Network Traffic 2014 SIGMOD 8.5968612e-05
2,742 Cache-Efficient Aggregation: Hashing Is Sorting 2015 SIGMOD 8.1906104e-05
2,757 Parallel Data Analysis Directly on Scientific File Formats 2014 SIGMOD 8.1679384e-05
3,151 A Memory Bandwidth-Efficient Hybrid Radix Sort on GPUs 2017 SIGMOD 7.4720668e-05
3,443 Distributed Join Algorithms on Thousands of Cores 2017 VLDB 7.0887214e-05
3,546 Extracting Top-K Insights from Multi-dimensional Data 2017 SIGMOD 6.9870745e-05
3,721 To Partition, or Not to Partition, That is the Join Question in a Real System 2021 SIGMOD 6.8179379e-05
3,885 Density-optimized Intersection-free Mapping and Matrix Multiplication for Join-Project Operations 2022 VLDB 6.6674822e-05
3,898 Efficient Join Algorithms For Large Database Tables in a Multi-GPU Environment 2021 VLDB 6.6551268e-05
3,933 RUMA has it: Rewired User-space Memory Access is Possible! 2016 VLDB 6.617914e-05
3,993 Improving Main Memory Hash Joins on Intel Xeon Phi Processors: An Experimental Approach 2015 VLDB 6.5534805e-05
4,042 PARADIS: An Efficient Parallel Algorithm for In-place Radix Sort 2015 VLDB 6.5026989e-05
4,097 The Case for a Learned Sorting Algorithm 2020 SIGMOD 6.4551616e-05
4,281 Maximizing Persistent Memory Bandwidth Utilization for OLAP Workloads 2021 SIGMOD 6.2940039e-05
4,282 Scaling Up Concurrent Main-Memory Column-Store Scans: Towards Adaptive NUMA-aware Data and Task Placement 2015 VLDB 6.293052e-05
4,326 Fast Queries Over Heterogeneous Data Through Engine Customization 2016 VLDB 6.288323e-05
4,363 Hardware-conscious Query Processing in GPU-accelerated Analytical Engines 2019 CIDR 6.2552614e-05
4,655 SIMD- and Cache-Friendly Algorithm for Sorting an Array of Structures 2015 VLDB 6.0221672e-05
4,701 Tensors: An abstraction for general data processing 2021 VLDB 5.9866564e-05
5,109 Adaptive NUMA-aware data placement and task scheduling for analytical workloads in main-memory column-stores 2017 VLDB 5.6908086e-05
5,125 The Art of Balance: A RateupDBTM Experience of Building a CPU/GPU Hybrid Database Product 2021 VLDB 5.679423e-05
5,178 FPGA-based Data Partitioning 2017 SIGMOD 5.6438393e-05
5,247 Triton Join: Efficiently Scaling to a Large Join State on GPUs with Fast Interconnects 2022 SIGMOD 5.6057839e-05
5,293 MQJoin: Efficient Shared Execution of Main-Memory Joins 2016 VLDB 5.5815698e-05
5,376 Holistic Indexing in Main-memory Column-stores 2015 SIGMOD 5.5417421e-05
5,653 On the Surprising Difficulty of Simple Things: the Case of Radix Partitioning 2015 VLDB 5.3889513e-05
5,721 FPGA-based Multithreading for In-Memory Hash Joins 2015 CIDR 5.3525009e-05
5,784 What Is the Price for Joining Securely? Benchmarking Equi-Joins in Trusted Execution Environments 2022 VLDB 5.328804e-05
6,221 Charting the Design Space of Query Execution using VOILA 2021 VLDB 5.1512158e-05
6,223 Distributed GPU Joins on Fast RDMA-capable Networks 2023 SIGMOD 5.1496398e-05
6,304 Elastic Pipelining in an In-Memory Database Cluster 2016 SIGMOD 5.1210182e-05
6,434 Patience is a Virtue: Revisiting Merge and Sort on Modern Processors 2014 SIGMOD 5.0640194e-05
6,524 The 3D Hash Join: Building On Non-Unique Join Attributes 2022 CIDR 5.0274964e-05
6,540 Data Partitioning for In-Memory Systems: Myths, Challenges, and Opportunities 2019 CIDR 5.0219214e-05
6,964 A Morsel-Driven Query Execution Engine for Heterogeneous Multi-Cores 2019 VLDB 4.8815971e-05
7,097 Fast Multi-Column Sorting in Main-Memory Column-Stores 2016 SIGMOD 4.8336115e-05
7,155 Evaluating Multi-GPU Sorting with Modern Interconnects 2022 SIGMOD 4.8149812e-05
7,335 MorphStore: Analytical Query Engine with a Holistic Compression-Enabled Processing Model 2020 VLDB 4.7603723e-05
7,819 Main Memory Adaptive Denormalization 2016 SIGMOD 4.6432769e-05
Previous Page 1 / 2 Next

Outgoing Citations (Sorted by Pagerank)

Showing 13 of 13 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers