Efficiently Processing Joins and Grouped Aggregations on GPUs
Summary: Revisits GPU join and group-by; GFTR reduces random accesses, up to 2.3x. Optimizes hash- and sort-based group-by (19.4x, 1.7x); adds partition-based group-by for high cardinalities, with cost models and heuristics to guide optimizers. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Bowen Wu
- 2. Dimitrios Koutsoukos
- 3. Gustavo Alonso
Incoming Citations (Sorted by Pagerank)
Showing 7 of 7 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 7,568 | Powerful GPUs or Fast Interconnects: Analyzing Relational Workloads on Modern GPUs | 2025 | VLDB | 4.7084322e-05 |
| 7,916 | Terabyte-Scale Analytics in the Blink of an Eye | 2026 | VLDB | 4.6173899e-05 |
| 9,838 | Efficiently Joining Large Relations on Multi-GPU Systems | 2025 | VLDB | 4.2740344e-05 |
| 9,970 | Rethinking Analytical Processing in the GPU Era | 2026 | CIDR | 4.1945683e-05 |
| 10,084 | GraphMatch: Subgraph Query Processing on Steroids | 2026 | SIGMOD | 4.1945683e-05 |
| 10,121 | TQEx: Tensor-based Query Engine Enhanced by Bridging the Gap | 2026 | SIGMOD | 4.1945683e-05 |
| 10,281 | GPU Acceleration of SQL Analytics on Compressed Data | 2026 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 23 of 23 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 8,680 | A Practical Approach to Groupjoin and Nested Aggregates | 2021 | VLDB | 4.4694927e-05 |
| 5,247 | Triton Join: Efficiently Scaling to a Large Join State on GPUs with Fast Interconnects | 2022 | SIGMOD | 5.6057839e-05 |
| 2,519 | Revisiting Co-Processing for Hash Joins on the Coupled CPU-GPU Architecture | 2013 | VLDB | 8.6078505e-05 |
| 5,322 | Generalized Hash Teams for Join and Group-by | 1999 | VLDB | 5.5701077e-05 |
| 6,223 | Distributed GPU Joins on Fast RDMA-capable Networks | 2023 | SIGMOD | 5.1496398e-05 |
| 5,087 | Accelerating Queries with Group-By and Join by Groupjoin | 2011 | VLDB | 5.7075009e-05 |
| 8,616 | A Case for Graphics-driven Query Processing | 2023 | VLDB | 4.4846474e-05 |
| 775 | Relational Joins on Graphics Processors | 2008 | SIGMOD | 0.00016823862 |
| 9,838 | Efficiently Joining Large Relations on Multi-GPU Systems | 2025 | VLDB | 4.2740344e-05 |
| 3,898 | Efficient Join Algorithms For Large Database Tables in a Multi-GPU Environment | 2021 | VLDB | 6.6551268e-05 |