Efficient Top-K Query Processing on Massively Parallel Hardware
Summary: GPU-based top-k algorithms for massively parallel data analytics, including a novel bitonic top-k with up to 15x speedups over sort for k ≤ 256. A cost model predicts relative performance across algorithms and matches measurements on modern GPUs. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Anil Shanbhag
- 2. Holger Pirk
- 3. Samuel Madden
Incoming Citations (Sorted by Pagerank)
Showing 4 of 4 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 7,328 | BOSS - An Architecture for Database Kernel Composition | 2024 | VLDB | 4.7610909e-05 |
| 9,123 | External Merge Sort for Top-K Queries: Eager input filtering guided by histograms | 2020 | SIGMOD | 4.3920263e-05 |
| 11,142 | Cache-Efficient Top-k Aggregation over High Cardinality Large Datasets | 2024 | VLDB | 4.1945683e-05 |
| 11,364 | MinMax Sampling: A Near-optimal Global Summary for Aggregation in the Wide Area | 2022 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 8 of 8 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 239 | GPUTeraSort: High Performance Graphics Co-processor Sorting for Large Database Management | 2006 | SIGMOD | 0.00031617428 |
| 946 | Efficient Implementation of Sorting on Multi-Core SIMD CPU Architecture | 2008 | VLDB | 0.0001513324 |
| 1,273 | The Yin and Yang of Processing Data Warehousing Queries on GPU Devices | 2013 | VLDB | 0.00012912938 |
| 1,287 | Hardware-Oblivious Parallelism for In-Memory Column-Stores | 2013 | VLDB | 0.00012820443 |
| 2,014 | Voodoo - A Vector Algebra for Portable Database Performance on Modern Hardware | 2016 | VLDB | 9.7904029e-05 |
| 2,882 | Database Compression on Graphics Processors | 2010 | VLDB | 7.9661218e-05 |
| 3,151 | A Memory Bandwidth-Efficient Hybrid Radix Sort on GPUs | 2017 | SIGMOD | 7.4720668e-05 |
| 3,305 | Robust Query Processing in Co-Processor-accelerated Databases | 2016 | SIGMOD | 7.2460965e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3,904 | Progressive Top-k Subarray Query Processing in Array Databases | 2019 | VLDB | 6.6424961e-05 |
| 239 | GPUTeraSort: High Performance Graphics Co-processor Sorting for Large Database Management | 2006 | SIGMOD | 0.00031617428 |
| 7,916 | Terabyte-Scale Analytics in the Blink of an Eye | 2026 | VLDB | 4.6173899e-05 |
| 930 | Fast Sort on CPUs and GPUs: A Case for Bandwidth Oblivious SIMD Sort | 2010 | SIGMOD | 0.00015238545 |
| 4,671 | Realtime Top-k Personalized PageRank over Large Graphs on GPUs | 2020 | VLDB | 6.0085645e-05 |
| 7,751 | Efficiently Processing Joins and Grouped Aggregations on GPUs | 2025 | SIGMOD | 4.6603427e-05 |
| 6,066 | GPU Database Systems Characterization and Optimization | 2024 | VLDB | 5.2290447e-05 |
| 9,123 | External Merge Sort for Top-K Queries: Eager input filtering guided by histograms | 2020 | SIGMOD | 4.3920263e-05 |
| 7,963 | Efficient Top-K Processing Over Query-Dependent Functions | 2008 | VLDB | 4.613363e-05 |
| 3,151 | A Memory Bandwidth-Efficient Hybrid Radix Sort on GPUs | 2017 | SIGMOD | 7.4720668e-05 |