Selection Pushdown in Column Stores using Bit Manipulation Instructions
Summary: Generic predicate pushdown over encoded columnar data enabling direct selection without decoding via Bit Manipulation Instructions (BMI). Evaluations on Parquet/TPC-H and Spark show up to 10x scan speedups and 5.5x end-to-end with complex joins. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Yinan Li
- 2. Jianan Lu
- 3. Badrish Chandramouli
Incoming Citations (Sorted by Pagerank)
Showing 7 of 7 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,514 | An Empirical Evaluation of Columnar Storage Formats | 2024 | VLDB | 6.1204636e-05 |
| 9,645 | The FastLanes File Format | 2025 | VLDB | 4.3109001e-05 |
| 9,846 | HyperBlocker: Accelerating Rule-based Blocking in Entity Resolution using GPUs | 2025 | VLDB | 4.2721228e-05 |
| 10,494 | Nested Parquet Is Flat, Why Not Use It? How To Scan Nested Data With On-the-Fly Key Generation and Joins | 2025 | SIGMOD | 4.1945683e-05 |
| 10,749 | Scaling GPU-Accelerated Databases beyond GPU Memory Size | 2025 | VLDB | 4.1945683e-05 |
| 10,803 | GraphAr: An Efficient Storage Scheme for Graph Data in Data Lakes | 2025 | VLDB | 4.1945683e-05 |
| 10,854 | LiquidCache: Efficient Pushdown Caching for Cloud-Native Data Analytics | 2025 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 21 of 21 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 5,532 | A Padded Encoding Scheme to Accelerate Scans by Leveraging Skew | 2015 | SIGMOD | 5.4548897e-05 |
| 7,097 | Fast Multi-Column Sorting in Main-Memory Column-Stores | 2016 | SIGMOD | 4.8336115e-05 |
| 9,906 | Rethinking the Encoding of Integers for Scans on Skewed Data | 2023 | SIGMOD | 4.2578595e-05 |
| 6,367 | Good to the Last Bit: Data-Driven Encoding with CodecDB | 2021 | SIGMOD | 5.0941072e-05 |
| 9,671 | BIPie: Fast Selection and Aggregation on Encoded Data using Operator Specialization | 2018 | SIGMOD | 4.306318e-05 |
| 1,618 | Row-wise Parallel Predicate Evaluation | 2008 | VLDB | 0.00011114015 |
| 4,514 | An Empirical Evaluation of Columnar Storage Formats | 2024 | VLDB | 6.1204636e-05 |
| 6,374 | Optimization of Conjunctive Predicates for Main Memory Column Stores | 2016 | VLDB | 5.0927058e-05 |
| 9,625 | Optimization of Disjunctive Predicates for Main Memory Column Stores | 2017 | SIGMOD | 4.3157275e-05 |
| 3,608 | Column Sketches: A Scan Accelerator for Rapid and Robust Predicate Evaluation | 2018 | SIGMOD | 6.924272e-05 |