Database Paper Browser

Back to papers

Evaluating Multi-GPU Sorting with Modern Interconnects

Summary: Evaluates multi-GPU sorting across PCIe/NVLink/NVSwitch; proposes a P2P GPU-only sort and a heterogeneous sort, benchmarked on three modern platforms. Reports up to 35x higher P2P throughput with NVSwitch, up to 14x CPU radix-sort speedup (P2P) and 9x (HET); on fast interconnects P2P beats HET by ~1.65x, and copy/compute overlap does not hide transfer bottlenecks. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
6294
Venue
SIGMOD
Year
2022
Pagerank
4.8149812e-05
Overall Rank
7,155 | 50.23%
DOI
10.1145/3514221.3517842

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 9 of 9 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 18 of 18 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
305 SIMD-Scan: Ultra Fast in-Memory Table Scan using on-Chip Vector Processing Units 2009 VLDB 0.00028248614
396 One Trillion Edges: Graph Processing at Facebook-Scale 2015 VLDB 0.00024424102
404 Multi-Core, Main-Memory Joins: Sort vs. Hash Revisited 2014 VLDB 0.00024143076
426 Amazon Redshift and the Case for Simpler Data Warehouses 2015 SIGMOD 0.00023594359
585 Massively Parallel Sort-Merge Joins in Main Memory Multi-Core Database Systems 2012 VLDB 0.00019706145
596 HYRISE—A Main Memory Hybrid Storage Engine 2011 VLDB 0.00019481482
858 Efficient Transaction Processing in SAP HANA Database – The End of a Column Store Myth 2012 SIGMOD 0.000158756
930 Fast Sort on CPUs and GPUs: A Case for Bandwidth Oblivious SIMD Sort 2010 SIGMOD 0.00015238545
1,607 A Comprehensive Study of Main-Memory Partitioning and its Application to Large-Scale Comparison- and Radix-Sort 2014 SIGMOD 0.00011162682
2,040 A Study of the Fundamental Performance Characteristics of GPUs and CPUs for Database Analytics 2020 SIGMOD 9.7057698e-05
2,165 Self-Tuning, GPU-Accelerated Kernel Density Models for Multidimensional Selectivity Estimation 2015 SIGMOD 9.389622e-05
3,151 A Memory Bandwidth-Efficient Hybrid Radix Sort on GPUs 2017 SIGMOD 7.4720668e-05
3,327 Pump Up the Volume: Processing Large Data on GPUs with Fast Interconnects 2020 SIGMOD 7.2205738e-05
3,898 Efficient Join Algorithms For Large Database Tables in a Multi-GPU Environment 2021 VLDB 6.6551268e-05
4,002 MG-Join: A Scalable Join for Massively Parallel Multi-GPU Architectures 2021 SIGMOD 6.545665e-05
4,042 PARADIS: An Efficient Parallel Algorithm for In-place Radix Sort 2015 VLDB 6.5026989e-05
4,655 SIMD- and Cache-Friendly Algorithm for Sorting an Array of Structures 2015 VLDB 6.0221672e-05
7,209 GPU-accelerated data management under the test of time 2020 CIDR 4.7996023e-05
Previous Page 1 / 1 Next

Semantically Similar Papers