Structural Designs Meet Optimality: Exploring Optimized LSM-tree Structures in A Colossal Configuration Space

Summary: LSM-tree design space generalized beyond fixed leveled/Tiered patterns: per-level runs, size ratios, and Bloom filters are optimized jointly. Key insight is a large last level for point lookups plus a runs/ratio correlation yielding Moose/Smoose, outperforming RocksDB baselines across mixed workloads. (summarized by gpt-5.4-mini on May 24 2026)

Paper ID: 6939
Venue: SIGMOD
Year: 2024
Pagerank: 4.3983078e-05
Overall Rank: 9,069 | 36.98%
DOI: 10.1145/3654978

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 9 of 9 citing papers.

Rank	Citing Paper	Year	Venue	Pagerank
5,769	Oasis: An Optimal Disjoint Segmented Learned Range Filter	2024	VLDB	5.3326049e-05
8,011	CAMAL: Optimizing LSM-trees via Active Learning	2024	SIGMOD	4.6022693e-05
8,333	How to Grow an LSM-tree? Towards Bridging the Gap Between Theory and Practice	2025	SIGMOD	4.5390511e-05
8,804	ArceKV: Towards Workload-driven LSM-compactions for Key-Value Store Under Dynamic Workloads	2026	VLDB	4.4424232e-05
9,322	Are Joins over LSM-trees Ready? Take RocksDB as an Example	2025	VLDB	4.351469e-05
9,390	Rethinking The Compaction Policies in LSM-trees	2025	SIGMOD	4.341433e-05
10,176	Improving Range Scan Performance in LSM-trees with Group Caching	2026	SIGMOD	4.1905499e-05
10,379	Aster: Enhancing LSM-structures for Scalable Graph Database	2025	SIGMOD	4.1905499e-05
10,853	AXE: A Task Decomposition Approach to Learned LSM Tuning	2025	VLDB	4.1905499e-05

Outgoing Citations (Sorted by Pagerank)

Showing 37 of 37 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank	Cited Paper	Year	Venue	Pagerank
281	LinkBench: a Database Benchmark Based on the Facebook Social Graph	2013	SIGMOD	0.00029084275
363	CockroachDB: The Resilient Geo-Distributed SQL Database	2020	SIGMOD	0.00025750338
379	bLSM: A General Purpose Log Structured Merge Tree	2012	SIGMOD	0.00024954332
568	Optimizing Space Amplification in RocksDB	2017	CIDR	0.00019932335
608	Monkey: Optimal Navigable Key-Value Store	2017	SIGMOD	0.00019233548
1,008	Spanner: Becoming a SQL System	2017	SIGMOD	0.00014663067
1,169	SuRF: Practical Range Query Filtering with Fast Succinct Tries	2018	SIGMOD	0.00013530267
1,309	Dostoevsky: Better Space-Time Trade-Offs for LSM-Tree Based Key-Value Stores via Adaptive Removal of Superfluous Merging	2018	SIGMOD	0.00012655712
1,932	X-Engine: An Optimized Storage Engine for Large-scale E-commerce Transaction Processing	2019	SIGMOD	0.00010050776
1,957	Compaction management in distributed key-value datastores	2015	VLDB	9.961151e-05
2,112	The Log-Structured Merge-Bush & the Wacky Continuum	2019	SIGMOD	9.5244583e-05
2,606	Design Continuums and the Path Toward Self-Designing Key-Value Stores that Know and Learn	2019	CIDR	8.4621503e-05
2,797	Chucky: A Succinct Cuckoo Filter for LSM-Tree	2021	SIGMOD	8.1116755e-05
3,050	Viper: An Efficient Hybrid PMem-DRAM Key-Value Store	2021	VLDB	7.6491863e-05
3,363	Lethe: A Tunable Delete-Aware LSM Engine	2020	SIGMOD	7.1680649e-05
3,545	Rosetta: A Robust Space-Time Optimized Range Filter for Key-Value Stores	2020	SIGMOD	6.9831585e-05
3,614	SNARF: A Learning-Enhanced Range Filter	2022	VLDB	6.9124805e-05
3,673	LittleTable: A Time-Series Database and Its Uses	2017	SIGMOD	6.8509543e-05
3,970	Spooky: Granulating LSM-Tree Compactions Correctly	2022	VLDB	6.5756727e-05
4,227	Cosine: A Cloud-Cost Optimized Self-Designing Key-Value Storage Engine	2022	VLDB	6.3381409e-05
4,404	TreeLine: An Update-In-Place Key-Value Store for Modern Storage	2023	VLDB	6.2052645e-05
4,659	Nova-LSM: A Distributed, Component-based LSM-tree Key-value Store	2021	SIGMOD	6.0076537e-05
4,836	Proteus: A Self-Designing Range Filter	2022	SIGMOD	5.8849277e-05
4,856	TOAIN: A Throughput Optimizing Adaptive Index for Answering Dynamic kNN Queries on Road Networks	2018	VLDB	5.868876e-05
4,948	SplinterDB and Maplets: Improving the Tradeoffs in Key-Value Store Compaction Policy	2023	SIGMOD	5.810122e-05
5,156	Coconut: A Scalable Bottom-Up Approach for Building Data Series Indexes	2018	VLDB	5.6534878e-05
5,159	DINOMO: An Elastic, Scalable, High-Performance Key-Value Store for Disaggregated Persistent Memory	2022	VLDB	5.6509885e-05
5,455	Grafite: Taming Adversarial Queries with Optimal Range Filters	2024	SIGMOD	5.4965299e-05
5,769	Oasis: An Optimal Disjoint Segmented Learned Range Filter	2024	VLDB	5.3326049e-05
5,920	Breaking Down Memory Walls: Adaptive Memory Management in LSM-based Storage Systems	2021	VLDB	5.2686888e-05
6,186	Dotori: A Key-Value SSD Based KV Store	2023	VLDB	5.1616749e-05
6,394	Endure: A Robust Tuning Paradigm for LSM Trees Under Workload Uncertainty	2022	VLDB	5.0770427e-05
7,092	Revisiting the Design of LSM-tree Based OLTP Storage Engine with Persistent Memory	2021	VLDB	4.83025e-05
7,623	Learning to Optimize LSM-trees: Towards A Reinforcement Learning based Key-Value Store for Dynamic Workloads	2023	SIGMOD	4.6890662e-05
8,490	SA-LSM: Optimize Data Layout for LSM-tree Based Storage using Survival Analysis	2022	VLDB	4.494994e-05
8,704	Incremental Partitioning for Efficient Spatial Data Analytics	2022	VLDB	4.4596039e-05
8,727	Columnar Formats for Schemaless LSM-based Document Stores	2022	VLDB	4.4534547e-05

Semantically Similar Papers

Overall Rank	Paper	Year	Venue	Pagerank
10,176	Improving Range Scan Performance in LSM-trees with Group Caching	2026	SIGMOD	4.1905499e-05
3,797	Constructing and Analyzing the LSM Compaction Design Space	2021	VLDB	6.7552936e-05
5,801	Dissecting, Designing, and Optimizing LSM-based Data Stores	2022	SIGMOD	5.3217858e-05
7,743	Efficient Data Ingestion and Query Processing for LSM-Based Storage Systems	2019	VLDB	4.6581858e-05
9,390	Rethinking The Compaction Policies in LSM-trees	2025	SIGMOD	4.341433e-05
7,217	Breaking Down Memory Walls in LSM-based Storage Systems	2020	SIGMOD	4.7936491e-05
2,112	The Log-Structured Merge-Bush & the Wacky Continuum	2019	SIGMOD	9.5244583e-05
7,341	LSM-Trees and B-Trees: The Best of Both Worlds	2019	SIGMOD	4.7522998e-05
608	Monkey: Optimal Navigable Key-Value Store	2017	SIGMOD	0.00019233548
7,623	Learning to Optimize LSM-trees: Towards A Reinforcement Learning based Key-Value Store for Dynamic Workloads	2023	SIGMOD	4.6890662e-05