Database Paper Browser

Back to papers

Efficiently Joining Large Relations on Multi-GPU Systems

Summary: Heterogeneous multi‑GPU sort‑merge join that leverages NVLink/NVSwitch P2P, handles out‑of‑core data, and combines GPU P2P partitioning, CPU multiway merge, and a hybrid CPU/GPU join strategy. Outperforms CPU/GPU baselines (up to 15.2x vs CPU SMJ, 5.5x vs radix, up to 8.7x vs non‑P2P multi‑GPU), scales with GPU count, and keeps overlapped copy/compute overhead low. (summarized by gpt-5-mini on Feb 09 2026)

Paper ID
14074
Venue
VLDB
Year
2025
Pagerank
4.2740344e-05
Overall Rank
9,838 | 31.56%
DOI
10.14778/3749646.3749720

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 1 of 1 citing papers.

Rank Citing Paper Year Venue Pagerank
7,916 Terabyte-Scale Analytics in the Blink of an Eye 2026 VLDB 4.6173899e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 32 of 32 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
145 Quickly Generating Billion-Record Synthetic Databases 1994 SIGMOD 0.0004138408
351 Sort vs. Hash Revisited: Fast Join Implementation on Modern Multi-Core CPUs 2009 VLDB 0.0002636504
404 Multi-Core, Main-Memory Joins: Sort vs. Hash Revisited 2014 VLDB 0.00024143076
426 Amazon Redshift and the Case for Simpler Data Warehouses 2015 SIGMOD 0.00023594359
540 Design and Evaluation of Main Memory Hash Join Algorithms for Multi-core CPUs 2011 SIGMOD 0.0002063443
585 Massively Parallel Sort-Merge Joins in Main Memory Multi-Core Database Systems 2012 VLDB 0.00019706145
775 Relational Joins on Graphics Processors 2008 SIGMOD 0.00016823862
811 On the Relative Cost of Sampling for Join Selectivity Estimation 1994 PODS 0.00016425612
907 Fundamental Techniques for Order Optimization 1996 SIGMOD 0.00015435983
930 Fast Sort on CPUs and GPUs: A Case for Bandwidth Oblivious SIMD Sort 2010 SIGMOD 0.00015238545
946 Efficient Implementation of Sorting on Multi-Core SIMD CPU Architecture 2008 VLDB 0.0001513324
958 Rethinking SIMD Vectorization for In-Memory Databases 2015 SIGMOD 0.00015045316
1,016 Memory-Efficient Hash Joins 2015 VLDB 0.00014638492
1,255 Fixed-Precision Estimation of Join Selectivity 1993 PODS 0.00013024064
1,273 The Yin and Yang of Processing Data Warehousing Queries on GPU Devices 2013 VLDB 0.00012912938
1,804 An Experimental Comparison of Thirteen Relational Equi-Joins in Main Memory 2016 SIGMOD 0.00010501185
2,040 A Study of the Fundamental Performance Characteristics of GPUs and CPUs for Database Analytics 2020 SIGMOD 9.7057698e-05
2,287 Pipelined Query Processing in Coprocessor Environments 2018 SIGMOD 9.0972606e-05
2,526 Track Join: Distributed Joins with Minimal Network Traffic 2014 SIGMOD 8.5968612e-05
3,151 A Memory Bandwidth-Efficient Hybrid Radix Sort on GPUs 2017 SIGMOD 7.4720668e-05
3,327 Pump Up the Volume: Processing Large Data on GPUs with Fast Interconnects 2020 SIGMOD 7.2205738e-05
3,443 Distributed Join Algorithms on Thousands of Cores 2017 VLDB 7.0887214e-05
3,721 To Partition, or Not to Partition, That is the Join Question in a Real System 2021 SIGMOD 6.8179379e-05
3,898 Efficient Join Algorithms For Large Database Tables in a Multi-GPU Environment 2021 VLDB 6.6551268e-05
4,002 MG-Join: A Scalable Join for Massively Parallel Multi-GPU Architectures 2021 SIGMOD 6.545665e-05
4,655 SIMD- and Cache-Friendly Algorithm for Sorting an Array of Structures 2015 VLDB 6.0221672e-05
5,178 FPGA-based Data Partitioning 2017 SIGMOD 5.6438393e-05
5,247 Triton Join: Efficiently Scaling to a Large Join State on GPUs with Fast Interconnects 2022 SIGMOD 5.6057839e-05
5,721 FPGA-based Multithreading for In-Memory Hash Joins 2015 CIDR 5.3525009e-05
6,223 Distributed GPU Joins on Fast RDMA-capable Networks 2023 SIGMOD 5.1496398e-05
7,155 Evaluating Multi-GPU Sorting with Modern Interconnects 2022 SIGMOD 4.8149812e-05
7,751 Efficiently Processing Joins and Grouped Aggregations on GPUs 2025 SIGMOD 4.6603427e-05
Previous Page 1 / 1 Next

Semantically Similar Papers