Columnar Formats for Schemaless LSM-based Document Stores
Summary: Columnar storage extended to schemaless LSM-based document stores by piggy-backing on LSM events and adapting Dremel for flexible schemas. Runtime-type-aware query compilation and a new columnar layout in Apache AsterixDB yield orders-of-magnitude speedups with modest ingestion overhead. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 5 of 5 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,514 | An Empirical Evaluation of Columnar Storage Formats | 2024 | VLDB | 6.1204636e-05 |
| 9,071 | Structural Designs Meet Optimality: Exploring Optimized LSM-tree Structures in A Colossal Configuration Space | 2024 | SIGMOD | 4.4025274e-05 |
| 10,494 | Nested Parquet Is Flat, Why Not Use It? How To Scan Nested Data With On-the-Fly Key Generation and Joins | 2025 | SIGMOD | 4.1945683e-05 |
| 10,765 | Towards Principled, Practical Document Database Design | 2025 | VLDB | 4.1945683e-05 |
| 10,775 | Cloudy With a Chance of JSON | 2025 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 18 of 18 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 6,336 | Column Stores For Wide and Sparse Data | 2007 | CIDR | 5.1056582e-05 |
| 1,366 | SlimDB: A Space-Efficient Key-Value Storage Engine For Semi-Sorted Data | 2017 | VLDB | 0.00012357685 |
| 6,666 | Mainlining Databases: Supporting Fast Transactional Workloads on Universal Columnar Data File Formats | 2021 | VLDB | 4.9691571e-05 |
| 5,791 | Dissecting, Designing, and Optimizing LSM-based Data Stores | 2022 | SIGMOD | 5.3268999e-05 |
| 5,535 | Lightweight Cardinality Estimation in LSM-based Systems | 2018 | SIGMOD | 5.4539235e-05 |
| 5,918 | Breaking Down Memory Walls: Adaptive Memory Management in LSM-based Storage Systems | 2021 | VLDB | 5.2737135e-05 |
| 2,021 | Storage Management in AsterixDB | 2014 | VLDB | 9.7601304e-05 |
| 4,914 | On Performance Stability in LSM-based Storage Systems | 2020 | VLDB | 5.8315684e-05 |
| 7,743 | Efficient Data Ingestion and Query Processing for LSM-Based Storage Systems | 2019 | VLDB | 4.6626575e-05 |
| 6,231 | An LSM-based Tuple Compaction Framework for Apache AsterixDB | 2020 | VLDB | 5.1457863e-05 |