Database Paper Browser

Back to papers

Dissecting, Designing, and Optimizing LSM-based Data Stores

Summary: Structured, tutorial-like treatment of LSM trees, detailing core operations, access paths, and ingestion/compaction tradeoffs for researchers. Surveys recent optimizations and the design space across ingestion, reads, and queries, offering a roadmap to tune LSM engines and identify open challenges. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
6403
Venue
SIGMOD
Year
2022
Pagerank
5.3268999e-05
Overall Rank
5,791 | 59.72%
DOI
10.1145/3514221.3522563

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 9 of 9 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 40 of 40 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
102 The Case for Learned Index Structures 2018 SIGMOD 0.00049545203
310 The Vertica Analytic Database: C-Store 7 Years Later 2012 VLDB 0.00028132402
368 Small Materialized Aggregates: A Light Weight Index Structure for Data Warehousing 1998 VLDB 0.000254931
379 bLSM: A General Purpose Log Structured Merge Tree 2012 SIGMOD 0.0002493527
569 Optimizing Space Amplification in RocksDB 2017 CIDR 0.00019924098
609 Monkey: Optimal Navigable Key-Value Store 2017 SIGMOD 0.0001923446
823 Design, Implementation, and Performance of the LHAM Log-Structured History Data Access Method 1998 VLDB 0.000162378
857 The PGM-index: a fully-dynamic compressed learned index with provable worst-case bounds 2020 VLDB 0.00015882892
899 Faster: A Concurrent Key-Value Store with In-Place Updates 2018 SIGMOD 0.00015509287
1,169 SuRF: Practical Range Query Filtering with Fast Succinct Tries 2018 SIGMOD 0.00013536447
1,311 Dostoevsky: Better Space-Time Trade-Offs for LSM-Tree Based Key-Value Stores via Adaptive Removal of Superfluous Merging 2018 SIGMOD 0.00012657439
1,366 SlimDB: A Space-Efficient Key-Value Storage Engine For Semi-Sorted Data 2017 VLDB 0.00012357685
1,438 AsterixDB: A Scalable, Open Source BDMS 2014 VLDB 0.00011973592
1,610 MyRocks: LSM-Tree Database Storage Engine Serving Facebook's Social Graph 2020 VLDB 0.00011148094
1,792 Hybrid Transactional/Analytical Processing: A Survey 2017 SIGMOD 0.00010537893
2,004 X-Engine: An Optimized Storage Engine for Large-scale E-commerce Transaction Processing 2019 SIGMOD 9.811707e-05
2,109 The Log-Structured Merge-Bush & the Wacky Continuum 2019 SIGMOD 9.5318694e-05
2,157 The Data Calculator*: Data Structure Design and Cost Synthesis from First Principles and Learned Cost Models 2018 SIGMOD 9.416022e-05
2,471 Morton Filters: Faster, Space-Efficient Cuckoo Filters via Biasing, Compression, and Decoupled Logical Sparsity 2018 VLDB 8.7320072e-05
2,606 Design Continuums and the Path Toward Self-Designing Key-Value Stores that Know and Learn 2019 CIDR 8.4645832e-05
2,798 Chucky: A Succinct Cuckoo Filter for LSM-Tree 2021 SIGMOD 8.1080111e-05
3,386 Lethe: A Tunable Delete-Aware LSM Engine 2020 SIGMOD 7.1577103e-05
3,544 Rosetta: A Robust Space-Time Optimized Range Filter for Key-Value Stores 2020 SIGMOD 6.9898874e-05
3,793 Constructing and Analyzing the LSM Compaction Design Space 2021 VLDB 6.7617833e-05
4,227 Cosine: A Cloud-Cost Optimized Self-Designing Key-Value Storage Engine 2022 VLDB 6.3434324e-05
4,588 Leaper: A Learned Prefetcher for Cache Invalidation in LSM-tree based Storage Engines 2020 VLDB 6.0655418e-05
4,662 Nova-LSM: A Distributed, Component-based LSM-tree Key-value Store 2021 SIGMOD 6.013415e-05
4,914 On Performance Stability in LSM-based Storage Systems 2020 VLDB 5.8315684e-05
5,119 Design Tradeoffs of Data Access Methods 2016 SIGMOD 5.6807904e-05
5,308 Key-Value Storage Engines 2020 SIGMOD 5.576303e-05
5,631 A Comparative Study of Secondary Indexing Techniques in LSM-based NoSQL Databases 2018 SIGMOD 5.4019839e-05
5,848 MaSM: Efficient Online Updates in Data Warehouses 2011 SIGMOD 5.3021155e-05
5,918 Breaking Down Memory Walls: Adaptive Memory Management in LSM-based Storage Systems 2021 VLDB 5.2737135e-05
6,113 Compactionary: A Dictionary for LSM Compactions 2022 SIGMOD 5.20426e-05
6,231 An LSM-based Tuple Compaction Framework for Apache AsterixDB 2020 VLDB 5.1457863e-05
6,398 Endure: A Robust Tuning Paradigm for LSM Trees Under Workload Uncertainty 2022 VLDB 5.0819209e-05
6,649 Big Data Space Fungus 2015 CIDR 4.9768878e-05
7,218 Breaking Down Memory Walls in LSM-based Storage Systems 2020 SIGMOD 4.7982543e-05
7,743 Efficient Data Ingestion and Query Processing for LSM-Based Storage Systems 2019 VLDB 4.6626575e-05
7,909 Immutability Changes Everything 2015 CIDR 4.6203676e-05
Previous Page 1 / 1 Next

Semantically Similar Papers