Database Paper Browser

Back to papers

Optimizing Space Amplification in RocksDB

Summary: Introduces practical techniques to minimize space amplification in RocksDB—trading CPU and read/write amplification via compaction/tuning—to prioritize storage efficiency while preserving OLTP latency targets. Empirical (TPC-C, LinkBench, production) results show RocksDB uses <50% of InnoDB storage and often matches or exceeds its performance, demonstrating the first large-scale competitive LSM deployment for OLTP. (summarized by gpt-5-mini on Feb 09 2026)

Paper ID
309
Venue
CIDR
Year
2017
Pagerank
0.00019924098
Overall Rank
569 | 96.05%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 49 of 49 citing papers.

Rank Citing Paper Year Venue Pagerank
609 Monkey: Optimal Navigable Key-Value Store 2017 SIGMOD 0.0001923446
1,169 SuRF: Practical Range Query Filtering with Fast Succinct Tries 2018 SIGMOD 0.00013536447
1,311 Dostoevsky: Better Space-Time Trade-Offs for LSM-Tree Based Key-Value Stores via Adaptive Removal of Superfluous Merging 2018 SIGMOD 0.00012657439
1,610 MyRocks: LSM-Tree Database Storage Engine Serving Facebook's Social Graph 2020 VLDB 0.00011148094
2,004 X-Engine: An Optimized Storage Engine for Large-scale E-commerce Transaction Processing 2019 SIGMOD 9.811707e-05
2,109 The Log-Structured Merge-Bush & the Wacky Continuum 2019 SIGMOD 9.5318694e-05
2,471 Morton Filters: Faster, Space-Efficient Cuckoo Filters via Biasing, Compression, and Decoupled Logical Sparsity 2018 VLDB 8.7320072e-05
2,606 Design Continuums and the Path Toward Self-Designing Key-Value Stores that Know and Learn 2019 CIDR 8.4645832e-05
2,798 Chucky: A Succinct Cuckoo Filter for LSM-Tree 2021 SIGMOD 8.1080111e-05
3,386 Lethe: A Tunable Delete-Aware LSM Engine 2020 SIGMOD 7.1577103e-05
3,544 Rosetta: A Robust Space-Time Optimized Range Filter for Key-Value Stores 2020 SIGMOD 6.9898874e-05
3,793 Constructing and Analyzing the LSM Compaction Design Space 2021 VLDB 6.7617833e-05
3,965 Spooky: Granulating LSM-Tree Compactions Correctly 2022 VLDB 6.5820028e-05
4,662 Nova-LSM: A Distributed, Component-based LSM-tree Key-value Store 2021 SIGMOD 6.013415e-05
4,670 Napa: Powering Scalable Data Warehousing with Robust Query Performance at Google 2021 VLDB 6.0104466e-05
4,914 On Performance Stability in LSM-based Storage Systems 2020 VLDB 5.8315684e-05
5,069 Disaggregating RocksDB: A Production Experience 2023 SIGMOD 5.7202501e-05
5,308 Key-Value Storage Engines 2020 SIGMOD 5.576303e-05
5,631 A Comparative Study of Secondary Indexing Techniques in LSM-based NoSQL Databases 2018 SIGMOD 5.4019839e-05
5,671 LSched: A Workload-Aware Learned Query Scheduler for Analytical Database Systems 2022 SIGMOD 5.3803919e-05
5,678 Cloud-Native Transactions and Analytics in SingleStore 2022 SIGMOD 5.3746593e-05
5,739 InfiniFilter: Expanding Filters to Infinity and Beyond 2023 SIGMOD 5.3471718e-05
5,762 Oasis: An Optimal Disjoint Segmented Learned Range Filter 2024 VLDB 5.3377299e-05
5,791 Dissecting, Designing, and Optimizing LSM-based Data Stores 2022 SIGMOD 5.3268999e-05
5,918 Breaking Down Memory Walls: Adaptive Memory Management in LSM-based Storage Systems 2021 VLDB 5.2737135e-05
6,113 Compactionary: A Dictionary for LSM Compactions 2022 SIGMOD 5.20426e-05
6,184 Dotori: A Key-Value SSD Based KV Store 2023 VLDB 5.1666338e-05
6,398 Endure: A Robust Tuning Paradigm for LSM Trees Under Workload Uncertainty 2022 VLDB 5.0819209e-05
6,660 ArkDB: A Key-Value Engine for Scalable Cloud Storage Services 2021 SIGMOD 4.9708868e-05
6,772 FineLine: Log-structured Transactional Storage and Recovery 2018 VLDB 4.9313122e-05
6,831 Prefix Filter: Practically and Theoretically Better Than Bloom 2022 VLDB 4.9130458e-05
7,696 Towards Optimal Transaction Scheduling 2024 VLDB 4.6754222e-05
7,756 LETUS: A Log-Structured Efficient Trusted Universal BlockChain Storage 2024 SIGMOD 4.6598957e-05
8,491 SA-LSM: Optimize Data Layout for LSM-tree Based Storage using Survival Analysis 2022 VLDB 4.4993073e-05
9,071 Structural Designs Meet Optimality: Exploring Optimized LSM-tree Structures in A Colossal Configuration Space 2024 SIGMOD 4.4025274e-05
9,191 FlashAlloc: Dedicating Flash Blocks By Objects 2023 VLDB 4.3766529e-05
9,496 Scabbard: Single-Node Fault-Tolerant Stream Processing 2022 VLDB 4.3341665e-05
9,604 GeaFlow: A Graph Extended and Accelerated Dataflow System 2023 SIGMOD 4.3177432e-05
9,799 CloudJump: Optimizing Cloud Databases for Cloud Storages 2022 VLDB 4.2818172e-05
9,821 SHIELD: Encrypting Persistent Data of LSM-KVS from Monolithic to Disaggregated Storage 2025 SIGMOD 4.2757088e-05
9,824 NEXT: A New Secondary Index Framework for LSM-based Data Storage 2025 SIGMOD 4.2751057e-05
10,255 How to Write to SSDs 2026 VLDB 4.1945683e-05
10,609 LogCloud: Fast Search of Compressed Logs on Object Storage 2025 VLDB 4.1945683e-05
10,625 Fair Transaction Processing for Multi-Tenant Databases 2025 VLDB 4.1945683e-05
10,643 Keigo: Co-designing Log-Structured Merge Key-Value Stores with a Non-Volatile, Concurrency-aware Storage Hierarchy 2025 VLDB 4.1945683e-05
10,934 Native Cloud Object Storage in Db2 Warehouse: Implementing a Fast and Cost-Efficient Cloud Storage Architecture 2024 SIGMOD 4.1945683e-05
11,049 On Reducing Space Amplification with Multi-Column Compaction in Apache IoTDB 2024 VLDB 4.1945683e-05
11,076 KGFabric: A Scalable Knowledge Graph Warehouse for Enterprise Data Interconnection 2024 VLDB 4.1945683e-05
11,166 Optimal Uncoordinated Unique IDs 2023 PODS 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 5 of 5 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers