Database Paper Browser

Back to papers

Fast Sort on CPUs and GPUs: A Case for Bandwidth Oblivious SIMD Sort

Summary: Bandwidth-oblivious SIMD sort for CPU/GPU; competitive analysis across SIMD, radix, and merge approaches. Proposes CPU radix sort and GPU merge sort ~2× faster than prior work; radix dominates on current HW, GPU advantage narrows; merge sort favors large-key cardinalities. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
4257
Venue
SIGMOD
Year
2010
Pagerank
0.00015238545
Overall Rank
930 | 93.54%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 37 of 37 citing papers.

Rank Citing Paper Year Venue Pagerank
404 Multi-Core, Main-Memory Joins: Sort vs. Hash Revisited 2014 VLDB 0.00024143076
958 Rethinking SIMD Vectorization for In-Memory Databases 2015 SIGMOD 0.00015045316
1,273 The Yin and Yang of Processing Data Warehousing Queries on GPU Devices 2013 VLDB 0.00012912938
1,287 Hardware-Oblivious Parallelism for In-Memory Column-Stores 2013 VLDB 0.00012820443
1,607 A Comprehensive Study of Main-Memory Partitioning and its Application to Large-Scale Comparison- and Radix-Sort 2014 SIGMOD 0.00011162682
1,804 An Experimental Comparison of Thirteen Relational Equi-Joins in Main Memory 2016 SIGMOD 0.00010501185
2,006 PALM: Parallel Architecture-Friendly Latch-Free Modifications to B+ Trees on Many-Core Processors 2011 VLDB 9.8101551e-05
2,040 A Study of the Fundamental Performance Characteristics of GPUs and CPUs for Database Analytics 2020 SIGMOD 9.7057698e-05
2,330 Concurrent Analytical Query Processing with GPUs 2014 VLDB 9.0192228e-05
2,526 Track Join: Distributed Joins with Minimal Network Traffic 2014 SIGMOD 8.5968612e-05
2,870 Streaming Similarity Search over one Billion Tweets using Parallel Locality-Sensitive Hashing 2013 VLDB 7.9799783e-05
3,151 A Memory Bandwidth-Efficient Hybrid Radix Sort on GPUs 2017 SIGMOD 7.4720668e-05
3,655 CloudRAMSort: Fast and Efficient Large-Scale Distributed RAM Sort on Shared-Nothing Cluster 2012 SIGMOD 6.8718304e-05
3,721 To Partition, or Not to Partition, That is the Join Question in a Real System 2021 SIGMOD 6.8179379e-05
3,993 Improving Main Memory Hash Joins on Intel Xeon Phi Processors: An Experimental Approach 2015 VLDB 6.5534805e-05
4,042 PARADIS: An Efficient Parallel Algorithm for In-place Radix Sort 2015 VLDB 6.5026989e-05
4,085 In-Cache Query Co-Processing on Coupled CPU-GPU Architectures 2015 VLDB 6.4620277e-05
4,097 The Case for a Learned Sorting Algorithm 2020 SIGMOD 6.4551616e-05
4,610 Deployment of Query Plans on Multicores 2015 VLDB 6.0516573e-05
4,655 SIMD- and Cache-Friendly Algorithm for Sorting an Array of Structures 2015 VLDB 6.0221672e-05
5,178 FPGA-based Data Partitioning 2017 SIGMOD 5.6438393e-05
5,247 Triton Join: Efficiently Scaling to a Large Join State on GPUs with Fast Interconnects 2022 SIGMOD 5.6057839e-05
5,653 On the Surprising Difficulty of Simple Things: the Case of Radix Partitioning 2015 VLDB 5.3889513e-05
5,814 Towards a Hybrid Design for Fast Query Processing in DB2 with BLU Acceleration Using Graphical Processing Units: A Technology Demonstration 2016 SIGMOD 5.3167137e-05
6,223 Distributed GPU Joins on Fast RDMA-capable Networks 2023 SIGMOD 5.1496398e-05
6,525 Database Technology for the Masses: Sub-Operators as First-Class Entities 2021 VLDB 5.027205e-05
6,540 Data Partitioning for In-Memory Systems: Myths, Challenges, and Opportunities 2019 CIDR 5.0219214e-05
7,097 Fast Multi-Column Sorting in Main-Memory Column-Stores 2016 SIGMOD 4.8336115e-05
7,155 Evaluating Multi-GPU Sorting with Modern Interconnects 2022 SIGMOD 4.8149812e-05
7,335 MorphStore: Analytical Query Engine with a Holistic Compression-Enabled Processing Model 2020 VLDB 4.7603723e-05
8,468 Inferray: fast in-memory RDF inference 2016 VLDB 4.504284e-05
8,626 Adaptive Code Generation for Data-Intensive Analytics 2021 VLDB 4.4829152e-05
8,649 Zero-sided RDMA: Network-driven Data Shuffling for Disaggregated Heterogeneous Cloud DBMSs 2024 SIGMOD 4.4762914e-05
9,823 Thriving in the No Man’s Land between Compilers and Databases 2019 CIDR 4.2754485e-05
9,838 Efficiently Joining Large Relations on Multi-GPU Systems 2025 VLDB 4.2740344e-05
11,381 Origami: A High-Performance Mergesort Framework 2022 VLDB 4.1945683e-05
12,098 Permuting Data on Random-Access Block Storage 2013 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 5 of 5 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers