Database Paper Browser

Back to papers

Rethinking SIMD Vectorization for In-Memory Databases

Summary: Novel SIMD vectorized operators for in-memory DBs using gathers/scatters (selections, hashes, partitioning, sorting, joins). On MIC Xeon Phi and mainstream CPUs, up to 10x faster than prior work; highlights algorithmic and power-efficiency implications across simple vs. complex cores. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
5040
Venue
SIGMOD
Year
2015
Pagerank
0.00015045316
Overall Rank
958 | 93.34%
DOI
10.1145/2723372.2747645

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 50 of 51 citing papers.

Rank Citing Paper Year Venue Pagerank
1,263 Data Blocks: Hybrid OLTP and OLAP on Compressed Storage using both Vectorization and Compilation 2016 SIGMOD 0.00012982857
1,804 An Experimental Comparison of Thirteen Relational Equi-Joins in Main Memory 2016 SIGMOD 0.00010501185
1,864 Relaxed Operator Fusion for In-Memory Databases: Making Compilation, Vectorization, and Prefetching Work Together At Last 2018 VLDB 0.00010280966
1,943 Procella: Unifying serving and analytical data at YouTube 2019 VLDB 0.00010012569
2,014 Voodoo - A Vector Algebra for Portable Database Performance on Modern Hardware 2016 VLDB 9.7904029e-05
2,040 A Study of the Fundamental Performance Characteristics of GPUs and CPUs for Database Analytics 2020 SIGMOD 9.7057698e-05
2,471 Morton Filters: Faster, Space-Efficient Cuckoo Filters via Biasing, Compression, and Decoupled Logical Sparsity 2018 VLDB 8.7320072e-05
2,473 Photon: A Fast Query Engine for Lakehouse Systems 2022 SIGMOD 8.7237281e-05
2,613 Decomposed Bounded Floats for Fast Compression and Queries 2021 VLDB 8.4503824e-05
3,254 Query Processing on Tensor Computation Runtimes 2022 VLDB 7.3161051e-05
3,465 GPL: A GPU-based Pipelined Query Processing Engine 2016 SIGMOD 7.0695873e-05
3,608 Column Sketches: A Scan Accelerator for Rapid and Robust Predicate Evaluation 2018 SIGMOD 6.924272e-05
3,721 To Partition, or Not to Partition, That is the Join Question in a Real System 2021 SIGMOD 6.8179379e-05
4,161 Access Path Selection in Main-Memory Optimized Data Systems: Should I Scan or Should I Probe? 2017 SIGMOD 6.3938006e-05
4,184 CoroBase: Coroutine-Oriented Main-Memory Database Engine 2021 VLDB 6.3779731e-05
4,363 Hardware-conscious Query Processing in GPU-accelerated Analytical Engines 2019 CIDR 6.2552614e-05
4,495 ClickHouse - Lightning Fast Analytics for Everyone 2024 VLDB 6.1410277e-05
4,518 The FastLanes Compression Layout: Decoding >100 Billion Integers per Second with Scalar Code 2023 VLDB 6.117844e-05
4,602 Accelerating Raw Data Analysis with the ACCORDA Software and Hardware Architecture 2019 VLDB 6.0567387e-05
5,178 FPGA-based Data Partitioning 2017 SIGMOD 5.6438393e-05
5,247 Triton Join: Efficiently Scaling to a Large Join State on GPUs with Fast Interconnects 2022 SIGMOD 5.6057839e-05
5,559 Rethinking MIMD-SIMD Interplay for Analytical Query Processing in In-Memory Database Engines 2025 CIDR 5.4340893e-05
5,749 BinDex: A Two-Layered Index for Fast and Robust Scans 2020 SIGMOD 5.3418923e-05
6,114 Database Processing-in-Memory: An Experimental Study 2020 VLDB 5.204248e-05
6,221 Charting the Design Space of Query Execution using VOILA 2021 VLDB 5.1512158e-05
6,540 Data Partitioning for In-Memory Systems: Myths, Challenges, and Opportunities 2019 CIDR 5.0219214e-05
7,097 Fast Multi-Column Sorting in Main-Memory Column-Stores 2016 SIGMOD 4.8336115e-05
7,335 MorphStore: Analytical Query Engine with a Holistic Compression-Enabled Processing Model 2020 VLDB 4.7603723e-05
7,427 Selection Pushdown in Column Stores using Bit Manipulation Instructions 2023 SIGMOD 4.7327406e-05
7,497 Stackless Processing of Streamed Trees 2021 PODS 4.7180617e-05
7,546 Is Perfect Hashing Practical for OLAP Systems? 2024 CIDR 4.7148429e-05
7,811 Hardware-Oblivious SIMD Parallelism for In-Memory Column-Stores 2020 CIDR 4.6445165e-05
8,031 BSX : Subgraph Matching with Batch Backtracking Search 2025 SIGMOD 4.6018906e-05
8,094 Modularis: Modular Relational Analytics over Heterogeneous Distributed Platforms 2021 VLDB 4.5867812e-05
8,381 Interleaved Multi-Vectorizing 2020 VLDB 4.5310603e-05
8,447 Cabin: a Compressed Adaptive Binned Scan Index 2024 SIGMOD 4.5102052e-05
8,478 Analyzing Vectorized Hash Tables Across CPU Architectures 2023 VLDB 4.5015937e-05
8,626 Adaptive Code Generation for Data-Intensive Analytics 2021 VLDB 4.4829152e-05
8,856 Composable Data Management: An Execution Overview 2024 VLDB 4.4346165e-05
8,997 Chasing Similarity: Distribution-aware Aggregation Scheduling 2019 VLDB 4.4120041e-05
9,671 BIPie: Fast Selection and Aggregation on Encoded Data using Operator Specialization 2018 SIGMOD 4.306318e-05
9,838 Efficiently Joining Large Relations on Multi-GPU Systems 2025 VLDB 4.2740344e-05
10,179 LiveBin: A Localized and Version-Aware Binned Scan Index 2026 SIGMOD 4.1945683e-05
10,372 Data Chunk Compaction in Vectorized Execution 2025 SIGMOD 4.1945683e-05
10,749 Scaling GPU-Accelerated Databases beyond GPU Memory Size 2025 VLDB 4.1945683e-05
10,803 GraphAr: An Efficient Storage Scheme for Graph Data in Data Lakes 2025 VLDB 4.1945683e-05
10,893 Program your (custom) SIMD instruction set on FPGA in C++ 2024 CIDR 4.1945683e-05
10,997 Automating Vectorized Distributed Graph Computation 2024 SIGMOD 4.1945683e-05
11,090 Simple (yet Efficient) Function Authoring for Vectorized Engines 2024 VLDB 4.1945683e-05
11,843 Efficient Query Processing on Many-core Architectures: A Case Study with Intel Xeon Phi Processor 2016 SIGMOD 4.1945683e-05
Previous Page 1 / 2 Next

Outgoing Citations (Sorted by Pagerank)

Showing 21 of 21 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
21 C-Store: A Column-oriented DBMS 2005 VLDB 0.00086087497
35 MonetDB/X100: Hyper-Pipelining Query Execution 2005 CIDR 0.00076197749
52 Database Architecture Optimized for the new Bottleneck: Memory Access 1999 VLDB 0.00066474881
60 Efficiently Compiling Efficient Query Plans for Modern Hardware 2011 VLDB 0.00064439773
241 DB2 with BLU Acceleration: So Much More than Just a Column Store 2013 VLDB 0.00031420034
305 SIMD-Scan: Ultra Fast in-Memory Table Scan using on-Chip Vector Processing Units 2009 VLDB 0.00028248614
343 Implementing Database Operations Using SIMD Instructions 2002 SIGMOD 0.00026768139
351 Sort vs. Hash Revisited: Fast Join Implementation on Modern Multi-Core CPUs 2009 VLDB 0.0002636504
381 FAST: Fast Architecture Sensitive Tree Search on Modern CPUs and GPUs 2010 SIGMOD 0.00024873637
404 Multi-Core, Main-Memory Joins: Sort vs. Hash Revisited 2014 VLDB 0.00024143076
418 Morsel-Driven Parallelism: A NUMA-Aware Query Evaluation Framework for the Many-Core Age 2014 SIGMOD 0.00023729211
540 Design and Evaluation of Main Memory Hash Join Algorithms for Multi-core CPUs 2011 SIGMOD 0.0002063443
585 Massively Parallel Sort-Merge Joins in Main Memory Multi-Core Database Systems 2012 VLDB 0.00019706145
930 Fast Sort on CPUs and GPUs: A Case for Bandwidth Oblivious SIMD Sort 2010 SIGMOD 0.00015238545
946 Efficient Implementation of Sorting on Multi-Core SIMD CPU Architecture 2008 VLDB 0.0001513324
1,079 What happens during a Join? Dissecting CPU and Memory Optimization Effects 2000 VLDB 0.00014233415
1,607 A Comprehensive Study of Main-Memory Partitioning and its Application to Large-Scale Comparison- and Radix-Sort 2014 SIGMOD 0.00011162682
1,989 Column Imprints: A Secondary Index Structure 2013 SIGMOD 9.8478437e-05
3,722 Cache-Conscious Radix-Decluster Projections 2004 VLDB 6.8176075e-05
3,993 Improving Main Memory Hash Joins on Intel Xeon Phi Processors: An Experimental Approach 2015 VLDB 6.5534805e-05
8,858 Automatic Contention Detection and Amelioration for Data-Intensive Operations 2010 SIGMOD 4.4344518e-05
Previous Page 1 / 1 Next

Semantically Similar Papers