Database Paper Browser

Back to papers

Pump Up the Volume: Processing Large Data on GPUs with Fast Interconnects

Summary: NVLink 2.0-based interconnect eliminates CPU-GPU transfer bottlenecks, enabling large-scale in-memory processing on GPUs. Demonstrates scalable no-partitioning hash join beyond GPU memory with up to 18x speedup vs PCIe 3.0 and 7.3x vs optimized CPU. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
5920
Venue
SIGMOD
Year
2020
Pagerank
7.2205738e-05
Overall Rank
3,327 | 76.86%
DOI
10.1145/3318464.3389705

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 29 of 29 citing papers.

Rank Citing Paper Year Venue Pagerank
3,254 Query Processing on Tensor Computation Runtimes 2022 VLDB 7.3161051e-05
4,498 GaccO - A GPU-accelerated OLTP DBMS 2022 SIGMOD 6.138538e-05
5,019 Orchestrating Data Placement and Query Execution in Heterogeneous CPU-GPU DBMS 2022 VLDB 5.7559197e-05
5,040 Tile-based Lightweight Integer Compression in GPU 2022 SIGMOD 5.7425187e-05
5,088 TCUDB: Accelerating Database with Tensor Processors 2022 SIGMOD 5.7072189e-05
5,247 Triton Join: Efficiently Scaling to a Large Join State on GPUs with Fast Interconnects 2022 SIGMOD 5.6057839e-05
5,621 CXL and the Return of Scale-Up Database Engines 2024 VLDB 5.4061827e-05
6,066 GPU Database Systems Characterization and Optimization 2024 VLDB 5.2290447e-05
6,223 Distributed GPU Joins on Fast RDMA-capable Networks 2023 SIGMOD 5.1496398e-05
6,369 Improving Execution Efficiency of Just-in-time Compilation based Query Processing on GPUs 2021 VLDB 5.0936663e-05
6,453 Vortex: Overcoming Memory Capacity Limitations in GPU-Accelerated Large-Scale Data Analytics 2025 VLDB 5.0571108e-05
7,155 Evaluating Multi-GPU Sorting with Modern Interconnects 2022 SIGMOD 4.8149812e-05
7,306 DAPHNE: An Open and Extensible System Infrastructure for Integrated Data Analysis Pipelines 2022 CIDR 4.7678574e-05
7,328 BOSS - An Architecture for Database Kernel Composition 2024 VLDB 4.7610909e-05
7,408 An Examination of CXL Memory Use Cases for In-Memory Database Management Systems using SAP HANA 2024 VLDB 4.7371479e-05
7,568 Powerful GPUs or Fast Interconnects: Analyzing Relational Workloads on Modern GPUs 2025 VLDB 4.7084322e-05
7,751 Efficiently Processing Joins and Grouped Aggregations on GPUs 2025 SIGMOD 4.6603427e-05
7,836 NOCAP: Near-Optimal Correlation-Aware Partitioning Joins 2023 SIGMOD 4.6380835e-05
7,916 Terabyte-Scale Analytics in the Blink of an Eye 2026 VLDB 4.6173899e-05
8,018 Parallelizing Intra-Window Join on Multicores: An Experimental Study 2021 SIGMOD 4.6046381e-05
8,478 Analyzing Vectorized Hash Tables Across CPU Architectures 2023 VLDB 4.5015937e-05
8,649 Zero-sided RDMA: Network-driven Data Shuffling for Disaggregated Heterogeneous Cloud DBMSs 2024 SIGMOD 4.4762914e-05
8,846 Scaling your Hybrid CPU-GPU DBMS to Multiple GPUs 2024 VLDB 4.4372012e-05
9,229 H-Rocks: CPU-GPU accelerated Heterogeneous RocksDB on Persistent Memory 2025 SIGMOD 4.3690661e-05
9,731 Workload Placement on Heterogeneous CPU-GPU Systems 2024 VLDB 4.2942813e-05
9,838 Efficiently Joining Large Relations on Multi-GPU Systems 2025 VLDB 4.2740344e-05
10,121 TQEx: Tensor-based Query Engine Enhanced by Bridging the Gap 2026 SIGMOD 4.1945683e-05
10,281 GPU Acceleration of SQL Analytics on Compressed Data 2026 VLDB 4.1945683e-05
11,020 Accelerating Merkle Patricia Trie with GPU 2024 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 31 of 31 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
351 Sort vs. Hash Revisited: Fast Join Implementation on Modern Multi-Core CPUs 2009 VLDB 0.0002636504
381 FAST: Fast Architecture Sensitive Tree Search on Modern CPUs and GPUs 2010 SIGMOD 0.00024873637
418 Morsel-Driven Parallelism: A NUMA-Aware Query Evaluation Framework for the Many-Core Age 2014 SIGMOD 0.00023729211
540 Design and Evaluation of Main Memory Hash Join Algorithms for Multi-core CPUs 2011 SIGMOD 0.0002063443
775 Relational Joins on Graphics Processors 2008 SIGMOD 0.00016823862
1,206 Rack-Scale In-Memory Join Processing using RDMA 2015 SIGMOD 0.00013281657
1,273 The Yin and Yang of Processing Data Warehousing Queries on GPU Devices 2013 VLDB 0.00012912938
1,287 Hardware-Oblivious Parallelism for In-Memory Column-Stores 2013 VLDB 0.00012820443
1,543 NUMA-aware algorithms: the case of data shuffling 2013 CIDR 0.0001145318
1,686 Fast Computation of Database Operations using Graphics Processors 2004 SIGMOD 0.00010917794
1,804 An Experimental Comparison of Thirteen Relational Equi-Joins in Main Memory 2016 SIGMOD 0.00010501185
1,967 Compressed Linear Algebra for Large-Scale Machine Learning 2016 VLDB 9.9131712e-05
2,067 HippogriffDB: Balancing I/O and GPU Bandwidth in Big Data Analytics 2016 VLDB 9.6392739e-05
2,287 Pipelined Query Processing in Coprocessor Environments 2018 SIGMOD 9.0972606e-05
2,519 Revisiting Co-Processing for Hash Joins on the Coupled CPU-GPU Architecture 2013 VLDB 8.6078505e-05
2,651 HetExchange: Encapsulating heterogeneous CPU-GPU parallelism in JIT compiled engines 2019 VLDB 8.3694317e-05
2,882 Database Compression on Graphics Processors 2010 VLDB 7.9661218e-05
3,151 A Memory Bandwidth-Efficient Hybrid Radix Sort on GPUs 2017 SIGMOD 7.4720668e-05
3,305 Robust Query Processing in Co-Processor-accelerated Databases 2016 SIGMOD 7.2460965e-05
3,363 CROSSBOW: Scaling Deep Learning with Small Batch Sizes on Multi-GPU Servers 2019 VLDB 7.1731921e-05
3,762 SABER: Window-Based Hybrid Stream Processing for Heterogeneous Architectures 2016 SIGMOD 6.7804471e-05
3,777 A Hybrid B+-tree as Solution for In-Memory Indexing on CPU-GPU Heterogeneous Computing Platforms 2016 SIGMOD 6.7750901e-05
4,033 In-RDBMS Hardware Acceleration of Advanced Analytics 2018 VLDB 6.5113267e-05
4,085 In-Cache Query Co-Processing on Coupled CPU-GPU Architectures 2015 VLDB 6.4620277e-05
4,363 Hardware-conscious Query Processing in GPU-accelerated Analytical Engines 2019 CIDR 6.2552614e-05
4,770 The Case For Heterogeneous HTAP 2017 CIDR 5.9338845e-05
4,999 Adaptive Work Placement for Query Processing on Heterogeneous Computing Resources 2017 VLDB 5.7752801e-05
6,404 ColumnML: Column-Store Machine Learning with On-The-Fly Data Transformation 2019 VLDB 5.0786954e-05
6,964 A Morsel-Driven Query Execution Engine for Heterogeneous Multi-Cores 2019 VLDB 4.8815971e-05
7,209 GPU-accelerated data management under the test of time 2020 CIDR 4.7996023e-05
8,048 Lowering the Latency of Data Processing Pipelines Through FPGA based Hardware Acceleration 2020 VLDB 4.5977431e-05
Previous Page 1 / 1 Next

Semantically Similar Papers