Optimizing Space Amplification in RocksDB
Summary: Introduces practical techniques to minimize space amplification in RocksDB—trading CPU and read/write amplification via compaction/tuning—to prioritize storage efficiency while preserving OLTP latency targets. Empirical (TPC-C, LinkBench, production) results show RocksDB uses <50% of InnoDB storage and often matches or exceeds its performance, demonstrating the first large-scale competitive LSM deployment for OLTP. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Siying Dong
- 2. Mark Callaghan
- 3. Leonidas Galanis
- 4. Dhruba Borthakur
- 5. Tony Savor
- 6. Michael Stumm
Incoming Citations (Sorted by Pagerank)
Showing 49 of 49 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 5 of 5 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 281 | LinkBench: a Database Benchmark Based on the Facebook Social Graph | 2013 | SIGMOD | 0.0002906793 |
| 379 | bLSM: A General Purpose Log Structured Merge Tree | 2012 | SIGMOD | 0.0002493527 |
| 1,613 | Realtime Data Processing at Facebook | 2016 | SIGMOD | 0.00011140777 |
| 2,689 | LogBase: A Scalable Log-structured Database System in the Cloud | 2012 | VLDB | 8.2942515e-05 |
| 2,833 | Optimizing Optimistic Concurrency Control for Tree-Structured, Log-Structured Databases | 2015 | SIGMOD | 8.0460396e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 9,317 | Are Joins over LSM-trees Ready? Take RocksDB as an Example | 2025 | VLDB | 4.3556432e-05 |
| 5,331 | Hybrid Storage Management for Database Systems | 2013 | VLDB | 5.5665225e-05 |
| 1,960 | Compaction management in distributed key-value datastores | 2015 | VLDB | 9.9521444e-05 |
| 10,176 | Improving Range Scan Performance in LSM-trees with Group Caching | 2026 | SIGMOD | 4.1945683e-05 |
| 4,945 | SplinterDB and Maplets: Improving the Tradeoffs in Key-Value Store Compaction Policy | 2023 | SIGMOD | 5.8157107e-05 |
| 5,225 | Optimizing Databases by Learning Hidden Parameters of Solid State Drives | 2020 | VLDB | 5.6194324e-05 |
| 1,366 | SlimDB: A Space-Efficient Key-Value Storage Engine For Semi-Sorted Data | 2017 | VLDB | 0.00012357685 |
| 9,799 | CloudJump: Optimizing Cloud Databases for Cloud Storages | 2022 | VLDB | 4.2818172e-05 |
| 5,069 | Disaggregating RocksDB: A Production Experience | 2023 | SIGMOD | 5.7202501e-05 |
| 1,610 | MyRocks: LSM-Tree Database Storage Engine Serving Facebook's Social Graph | 2020 | VLDB | 0.00011148094 |