Database Paper Browser

Back to papers

Relational Joins on Graphics Processors

Summary: GPU-based relational joins use data-parallel primitives (split, sort) on GPUs, leveraging random writes and inter-processor communication. Implements indexed or non-indexed nested-loop, sort-merge, and hash joins; yields 2–7× CPU speedups on NVIDIA G80. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
4012
Venue
SIGMOD
Year
2008
Pagerank
0.00016823862
Overall Rank
775 | 94.62%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 47 of 47 citing papers.

Rank Citing Paper Year Venue Pagerank
351 Sort vs. Hash Revisited: Fast Join Implementation on Modern Multi-Core CPUs 2009 VLDB 0.0002636504
585 Massively Parallel Sort-Merge Joins in Main Memory Multi-Core Database Systems 2012 VLDB 0.00019706145
1,273 The Yin and Yang of Processing Data Warehousing Queries on GPU Devices 2013 VLDB 0.00012912938
1,287 Hardware-Oblivious Parallelism for In-Memory Column-Stores 2013 VLDB 0.00012820443
1,881 Ibex - An Intelligent Storage Engine with Support for Advanced SQL Off-loading 2014 VLDB 0.00010217568
2,040 A Study of the Fundamental Performance Characteristics of GPUs and CPUs for Database Analytics 2020 SIGMOD 9.7057698e-05
2,067 HippogriffDB: Balancing I/O and GPU Bandwidth in Big Data Analytics 2016 VLDB 9.6392739e-05
2,330 Concurrent Analytical Query Processing with GPUs 2014 VLDB 9.0192228e-05
2,519 Revisiting Co-Processing for Hash Joins on the Coupled CPU-GPU Architecture 2013 VLDB 8.6078505e-05
2,882 Database Compression on Graphics Processors 2010 VLDB 7.9661218e-05
3,103 High-Throughput Transaction Executions on Graphics Processors 2011 VLDB 7.5586143e-05
3,327 Pump Up the Volume: Processing Large Data on GPUs with Fast Interconnects 2020 SIGMOD 7.2205738e-05
3,465 GPL: A GPU-based Pipelined Query Processing Engine 2016 SIGMOD 7.0695873e-05
3,696 Why it is time for a HyPE: A Hybrid Query Processing Engine for Efficient GPU Coprocessing in DBMS 2013 VLDB 6.834483e-05
3,777 A Hybrid B+-tree as Solution for In-Memory Indexing on CPU-GPU Heterogeneous Computing Platforms 2016 SIGMOD 6.7750901e-05
3,898 Efficient Join Algorithms For Large Database Tables in a Multi-GPU Environment 2021 VLDB 6.6551268e-05
3,993 Improving Main Memory Hash Joins on Intel Xeon Phi Processors: An Experimental Approach 2015 VLDB 6.5534805e-05
4,002 MG-Join: A Scalable Join for Massively Parallel Multi-GPU Architectures 2021 SIGMOD 6.545665e-05
4,085 In-Cache Query Co-Processing on Coupled CPU-GPU Architectures 2015 VLDB 6.4620277e-05
4,999 Adaptive Work Placement for Query Processing on Heterogeneous Computing Resources 2017 VLDB 5.7752801e-05
5,019 Orchestrating Data Placement and Query Execution in Heterogeneous CPU-GPU DBMS 2022 VLDB 5.7559197e-05
5,040 Tile-based Lightweight Integer Compression in GPU 2022 SIGMOD 5.7425187e-05
5,088 TCUDB: Accelerating Database with Tensor Processors 2022 SIGMOD 5.7072189e-05
5,125 The Art of Balance: A RateupDBTM Experience of Building a CPU/GPU Hybrid Database Product 2021 VLDB 5.679423e-05
5,247 Triton Join: Efficiently Scaling to a Large Join State on GPUs with Fast Interconnects 2022 SIGMOD 5.6057839e-05
5,514 GPU Rasterization for Real-Time Spatial Aggregation over Arbitrary Polygons 2018 VLDB 5.4638186e-05
5,814 Towards a Hybrid Design for Fast Query Processing in DB2 with BLU Acceleration Using Graphical Processing Units: A Technology Demonstration 2016 SIGMOD 5.3167137e-05
6,223 Distributed GPU Joins on Fast RDMA-capable Networks 2023 SIGMOD 5.1496398e-05
7,377 GPUQP: Query Co-Processing Using Graphics Processors 2007 SIGMOD 4.7484565e-05
7,751 Efficiently Processing Joins and Grouped Aggregations on GPUs 2025 SIGMOD 4.6603427e-05
7,880 Thread Cooperation in Multicore Architectures for Frequency Counting over Multiple Data Streams 2009 VLDB 4.6291185e-05
8,094 Modularis: Modular Relational Analytics over Heterogeneous Distributed Platforms 2021 VLDB 4.5867812e-05
8,356 Accelerating Pathology Image Data Cross-Comparison on CPU-GPU Hybrid Systems 2012 VLDB 4.5377018e-05
8,417 The Case for Learned In-Memory Joins 2023 VLDB 4.5194164e-05
8,616 A Case for Graphics-driven Query Processing 2023 VLDB 4.4846474e-05
8,846 Scaling your Hybrid CPU-GPU DBMS to Multiple GPUs 2024 VLDB 4.4372012e-05
9,204 Themis: A GPU-accelerated Relational Query Execution Engine 2025 VLDB 4.3737475e-05
9,620 Deploying Computational Storage for HTAP DBMSs Takes More Than Just Computation Offloading 2023 VLDB 4.3169852e-05
9,837 GpJSON: High-performance JSON Data Processing on GPUs 2025 VLDB 4.2740344e-05
9,838 Efficiently Joining Large Relations on Multi-GPU Systems 2025 VLDB 4.2740344e-05
9,925 GPH: An Efficient and Effective Perfect Hashing Scheme for GPU Architectures 2025 SIGMOD 4.2543961e-05
10,253 Scalable GPU Acceleration of Scalar Functions in Analytical Databases: Compilation, Benchmarking, and Optimization 2026 VLDB 4.1945683e-05
10,749 Scaling GPU-Accelerated Databases beyond GPU Memory Size 2025 VLDB 4.1945683e-05
11,020 Accelerating Merkle Patricia Trie with GPU 2024 VLDB 4.1945683e-05
11,358 Scaling Equi-Joins 2022 SIGMOD 4.1945683e-05
11,964 Work-Efficient Parallel Skyline Computation for the GPU 2015 VLDB 4.1945683e-05
12,051 WoW: What the World of (Data) Warehousing Can Learn from the World of Warcraft 2013 SIGMOD 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 15 of 15 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
21 C-Store: A Column-oriented DBMS 2005 VLDB 0.00086087497
52 Database Architecture Optimized for the new Bottleneck: Memory Access 1999 VLDB 0.00066474881
81 Cache Conscious Algorithms for Relational Query Processing 1994 VLDB 0.00055548574
232 A Performance Evaluation of Four Parallel Join Algorithms in a Shared-Nothing Multiprocessor Environment 1989 SIGMOD 0.00032122485
238 Cache Conscious Indexing for Decision-Support in Main Memory 1999 VLDB 0.00031642425
239 GPUTeraSort: High Performance Graphics Co-processor Sorting for Large Database Management 2006 SIGMOD 0.00031617428
1,365 Handling Data Skew in Multiprocessor Database Computers Using Partition Tuning 1991 VLDB 0.00012368421
1,686 Fast Computation of Database Operations using Graphics Processors 2004 SIGMOD 0.00010917794
2,278 Hardware Acceleration for Spatial Selections and Joins 2003 SIGMOD 9.1218781e-05
2,619 Hash-Based Join Algorithms for Multiprocessor Computers with Shared Memory 1990 VLDB 8.4431973e-05
2,761 Hardware Acceleration in Commercial Databases: A Case Study of Spatial Operations 2004 VLDB 8.1622784e-05
2,778 Database Servers on Chip Multiprocessors: Limitations and Opportunities 2007 CIDR 8.1321802e-05
5,068 Query Co-Processing on Commodity Processors 2006 VLDB 5.72154e-05
5,457 Fast and Approximate Stream Mining of Quantiles and Frequencies Using Graphics Processors 2005 SIGMOD 5.4970777e-05
6,324 Revisiting Pipelined Parallelism in Multi-Join Query Processing 2005 VLDB 5.1109987e-05
Previous Page 1 / 1 Next

Semantically Similar Papers