An LSM-based Tuple Compaction Framework for Apache AsterixDB
Summary: LSM-based tuple compactor for Apache AsterixDB infers and extracts schema from JSON during ingestion, reducing storage overhead. Leverages LSM lifecycle events to piggyback schema inference with minimal ingestion impact and improved query performance. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 8 of 8 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3,793 | Constructing and Analyzing the LSM Compaction Design Space | 2021 | VLDB | 6.7617833e-05 |
| 4,704 | JSON Tiles: Fast Analytics on Semi-Structured Data | 2021 | SIGMOD | 5.9853687e-05 |
| 5,791 | Dissecting, Designing, and Optimizing LSM-based Data Stores | 2022 | SIGMOD | 5.3268999e-05 |
| 6,398 | Endure: A Robust Tuning Paradigm for LSM Trees Under Workload Uncertainty | 2022 | VLDB | 5.0819209e-05 |
| 6,596 | Heracles: An Efficient Storage Model and Data Flushing for Performance Monitoring Timeseries | 2021 | VLDB | 4.9988301e-05 |
| 8,731 | Columnar Formats for Schemaless LSM-based Document Stores | 2022 | VLDB | 4.4577278e-05 |
| 9,386 | Rethinking The Compaction Policies in LSM-trees | 2025 | SIGMOD | 4.3455975e-05 |
| 11,150 | Zed: Leveraging Data Types to Process Eclectic Data | 2023 | CIDR | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 11 of 11 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 21 | C-Store: A Column-oriented DBMS | 2005 | VLDB | 0.00086087497 |
| 61 | DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases | 1997 | VLDB | 0.00064329285 |
| 109 | Dremel: Interactive Analysis of Web-Scale Datasets | 2010 | VLDB | 0.00048186983 |
| 926 | XMill: an Efficient Compressor for XML Data | 2000 | SIGMOD | 0.00015251799 |
| 1,438 | AsterixDB: A Scalable, Open Source BDMS | 2014 | VLDB | 0.00011973592 |
| 2,021 | Storage Management in AsterixDB | 2014 | VLDB | 9.7601304e-05 |
| 3,349 | Schema Management for Document Stores | 2015 | VLDB | 7.1903648e-05 |
| 3,792 | Schema-Agnostic Indexing with Azure DocumentDB | 2015 | VLDB | 6.7618051e-05 |
| 4,489 | Automatic Generation of Normalized Relational Schemas from Nested Key-Value Data | 2016 | SIGMOD | 6.1434237e-05 |
| 5,535 | Lightweight Cardinality Estimation in LSM-based Systems | 2018 | SIGMOD | 5.4539235e-05 |
| 7,743 | Efficient Data Ingestion and Query Processing for LSM-Based Storage Systems | 2019 | VLDB | 4.6626575e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 11,049 | On Reducing Space Amplification with Multi-Column Compaction in Apache IoTDB | 2024 | VLDB | 4.1945683e-05 |
| 10,367 | Aster: Enhancing LSM-structures for Scalable Graph Database | 2025 | SIGMOD | 4.1945683e-05 |
| 6,113 | Compactionary: A Dictionary for LSM Compactions | 2022 | SIGMOD | 5.20426e-05 |
| 3,793 | Constructing and Analyzing the LSM Compaction Design Space | 2021 | VLDB | 6.7617833e-05 |
| 5,535 | Lightweight Cardinality Estimation in LSM-based Systems | 2018 | SIGMOD | 5.4539235e-05 |
| 5,918 | Breaking Down Memory Walls: Adaptive Memory Management in LSM-based Storage Systems | 2021 | VLDB | 5.2737135e-05 |
| 4,914 | On Performance Stability in LSM-based Storage Systems | 2020 | VLDB | 5.8315684e-05 |
| 2,021 | Storage Management in AsterixDB | 2014 | VLDB | 9.7601304e-05 |
| 8,731 | Columnar Formats for Schemaless LSM-based Document Stores | 2022 | VLDB | 4.4577278e-05 |
| 7,743 | Efficient Data Ingestion and Query Processing for LSM-Based Storage Systems | 2019 | VLDB | 4.6626575e-05 |