Towards Functional Decomposition of Storage Formats
Summary: Shows that tying compression blocks to row‑skipping partitions in columnar formats forces a compressibility vs scan-performance tradeoff; proposes splitting into a storage layer + Search Acceleration Layer (SAL). Finds SALs benefit from fine-grained partitions (~10–100 rows), with optimal size varying by metadata, data, and query, enabling independent tuning and improved tradeoffs. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Martin Prammer
- 2. Xinyu Zeng
- 3. Ruijun Meng
- 4. Wes McKinney
- 5. Huanchen Zhang
- 6. Andrew Pavlo
- 7. Jignesh M. Patel
Incoming Citations (Sorted by Pagerank)
Showing 2 of 2 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 9,201 | F3: The Open-Source Data File Format for the Future | 2026 | SIGMOD | 4.3743539e-05 |
| 9,901 | AnyBlox: A Framework for Self-Decoding Datasets | 2025 | VLDB | 4.258022e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 13 of 13 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,108 | Cracking the Database Store | 2005 | CIDR | 6.4440088e-05 |
| 10,220 | FlatStor: An Efficient Embedded-Index Based Columnar Data Layout for Multimodal Data Workloads | 2026 | VLDB | 4.1945683e-05 |
| 7,907 | Petabyte-Scale Row-Level Operations in Data Lakehouses | 2024 | VLDB | 4.6205839e-05 |
| 5,604 | Design and Evaluation of Storage Organizations for Read-Optimized Main Memory Databases | 2013 | VLDB | 5.4147933e-05 |
| 11,067 | Partition, Don’t Sort! Compression Boosters for Cloud Data Ingestion Pipelines | 2024 | VLDB | 4.1945683e-05 |
| 6,809 | Adaptive Data Skipping in Main-Memory Systems | 2016 | SIGMOD | 4.9206606e-05 |
| 6,279 | Self-Organizing Data Containers | 2022 | CIDR | 5.1295282e-05 |
| 11,993 | A Partitioning Framework for Aggressive Data Skipping | 2014 | VLDB | 4.1945683e-05 |
| 4,514 | An Empirical Evaluation of Columnar Storage Formats | 2024 | VLDB | 6.1204636e-05 |
| 3,737 | Skipping-oriented Partitioning for Columnar Layouts | 2017 | VLDB | 6.8033227e-05 |