Database Paper Browser

Back to papers

Revisiting Co-Processing for Hash Joins on the Coupled CPU-GPU Architecture

Summary: Revisits hash joins on coupled CPU-GPU architectures, exploiting on-chip cache reuse and fine-grained co-processing with and without partitioning. Extends cost models to auto-guide design choices; on AMD APUs, achieves 53% CPU-only, 35% GPU-only, and 28% conventional co-processing gains. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
10750
Venue
VLDB
Year
2013
Pagerank
8.6078505e-05
Overall Rank
2,519 | 82.48%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 26 of 26 citing papers.

Rank Citing Paper Year Venue Pagerank
2,040 A Study of the Fundamental Performance Characteristics of GPUs and CPUs for Database Analytics 2020 SIGMOD 9.7057698e-05
2,067 HippogriffDB: Balancing I/O and GPU Bandwidth in Big Data Analytics 2016 VLDB 9.6392739e-05
3,305 Robust Query Processing in Co-Processor-accelerated Databases 2016 SIGMOD 7.2460965e-05
3,327 Pump Up the Volume: Processing Large Data on GPUs with Fast Interconnects 2020 SIGMOD 7.2205738e-05
3,465 GPL: A GPU-based Pipelined Query Processing Engine 2016 SIGMOD 7.0695873e-05
3,762 SABER: Window-Based Hybrid Stream Processing for Heterogeneous Architectures 2016 SIGMOD 6.7804471e-05
3,898 Efficient Join Algorithms For Large Database Tables in a Multi-GPU Environment 2021 VLDB 6.6551268e-05
3,993 Improving Main Memory Hash Joins on Intel Xeon Phi Processors: An Experimental Approach 2015 VLDB 6.5534805e-05
4,002 MG-Join: A Scalable Join for Massively Parallel Multi-GPU Architectures 2021 SIGMOD 6.545665e-05
4,085 In-Cache Query Co-Processing on Coupled CPU-GPU Architectures 2015 VLDB 6.4620277e-05
4,363 Hardware-conscious Query Processing in GPU-accelerated Analytical Engines 2019 CIDR 6.2552614e-05
4,678 OmniDB: Towards Portable and Efficient Query Processing on Parallel CPU/GPU Architectures 2013 VLDB 6.0046271e-05
5,019 Orchestrating Data Placement and Query Execution in Heterogeneous CPU-GPU DBMS 2022 VLDB 5.7559197e-05
5,040 Tile-based Lightweight Integer Compression in GPU 2022 SIGMOD 5.7425187e-05
5,088 TCUDB: Accelerating Database with Tensor Processors 2022 SIGMOD 5.7072189e-05
5,247 Triton Join: Efficiently Scaling to a Large Join State on GPUs with Fast Interconnects 2022 SIGMOD 5.6057839e-05
5,814 Towards a Hybrid Design for Fast Query Processing in DB2 with BLU Acceleration Using Graphical Processing Units: A Technology Demonstration 2016 SIGMOD 5.3167137e-05
6,223 Distributed GPU Joins on Fast RDMA-capable Networks 2023 SIGMOD 5.1496398e-05
6,369 Improving Execution Efficiency of Just-in-time Compilation based Query Processing on GPUs 2021 VLDB 5.0936663e-05
6,964 A Morsel-Driven Query Execution Engine for Heterogeneous Multi-Cores 2019 VLDB 4.8815971e-05
7,038 Demonstrating Efficient Query Processing in Heterogeneous Environments 2014 SIGMOD 4.8546906e-05
7,568 Powerful GPUs or Fast Interconnects: Analyzing Relational Workloads on Modern GPUs 2025 VLDB 4.7084322e-05
7,751 Efficiently Processing Joins and Grouped Aggregations on GPUs 2025 SIGMOD 4.6603427e-05
8,846 Scaling your Hybrid CPU-GPU DBMS to Multiple GPUs 2024 VLDB 4.4372012e-05
10,253 Scalable GPU Acceleration of Scalar Functions in Analytical Databases: Compilation, Benchmarking, and Optimization 2026 VLDB 4.1945683e-05
11,020 Accelerating Merkle Patricia Trie with GPU 2024 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 18 of 18 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
21 C-Store: A Column-oriented DBMS 2005 VLDB 0.00086087497
52 Database Architecture Optimized for the new Bottleneck: Memory Access 1999 VLDB 0.00066474881
78 Multiprocessor Hash-Based Join Algorithms 1985 VLDB 0.00056413752
81 Cache Conscious Algorithms for Relational Query Processing 1994 VLDB 0.00055548574
124 DBMSs On A Modern Processor: Where Does Time Go? 1999 VLDB 0.00045103515
232 A Performance Evaluation of Four Parallel Join Algorithms in a Shared-Nothing Multiprocessor Environment 1989 SIGMOD 0.00032122485
351 Sort vs. Hash Revisited: Fast Join Implementation on Modern Multi-Core CPUs 2009 VLDB 0.0002636504
540 Design and Evaluation of Main Memory Hash Join Algorithms for Multi-core CPUs 2011 SIGMOD 0.0002063443
775 Relational Joins on Graphics Processors 2008 SIGMOD 0.00016823862
1,079 What happens during a Join? Dissecting CPU and Memory Optimization Effects 2000 VLDB 0.00014233415
1,101 Generic Database Cost Models for Hierarchical Memory Systems 2002 VLDB 0.00014070632
1,543 NUMA-aware algorithms: the case of data shuffling 2013 CIDR 0.0001145318
1,694 How Soccer Players Would do Stream Joins 2011 SIGMOD 0.00010893764
1,847 Using Segmented Right-Deep Trees for the Execution of Pipelined Hash Joins 1992 VLDB 0.00010333796
2,882 Database Compression on Graphics Processors 2010 VLDB 7.9661218e-05
3,103 High-Throughput Transaction Executions on Graphics Processors 2011 VLDB 7.5586143e-05
3,260 On Optimal Processor Allocation To Support Pipelined Hash Joins 1993 SIGMOD 7.3122611e-05
4,678 OmniDB: Towards Portable and Efficient Query Processing on Parallel CPU/GPU Architectures 2013 VLDB 6.0046271e-05
Previous Page 1 / 1 Next

Semantically Similar Papers