Database Paper Browser

Back to papers

Spooky: Granulating LSM-Tree Compactions Correctly

Summary: Spooky introduces granulated LSM-tree compaction: top level partitioned into equal-sized files, with lower levels partitioned by those file boundaries. Merges a single group of perfectly overlapping files at a time, reducing space amplification vs Full Merge and write amplification vs Partial Merge. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
12788
Venue
VLDB
Year
2022
Pagerank
6.5820028e-05
Overall Rank
3,965 | 72.42%
DOI
10.14778/3551793.3551853

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 22 of 22 citing papers.

Rank Citing Paper Year Venue Pagerank
5,739 InfiniFilter: Expanding Filters to Infinity and Beyond 2023 SIGMOD 5.3471718e-05
5,863 GRF: A Global Range Filter for LSM-Trees with Shape Encoding 2024 SIGMOD 5.2979639e-05
7,620 Learning to Optimize LSM-trees: Towards A Reinforcement Learning based Key-Value Store for Dynamic Workloads 2023 SIGMOD 4.693568e-05
7,808 CaaS-LSM: Compaction-as-a-Service for LSM-based Key-Value Stores in Storage Disaggregated Infrastructure 2024 SIGMOD 4.6455813e-05
8,009 CAMAL: Optimizing LSM-trees via Active Learning 2024 SIGMOD 4.6066863e-05
8,339 How to Grow an LSM-tree? Towards Bridging the Gap Between Theory and Practice 2025 SIGMOD 4.5434069e-05
8,805 ArceKV: Towards Workload-driven LSM-compactions for Key-Value Store Under Dynamic Workloads 2026 VLDB 4.4466855e-05
8,876 MirrorKV: An Efficient Key-Value Store on Hybrid Cloud Storage with Balanced Performance of Compaction and Querying 2023 SIGMOD 4.4304279e-05
9,071 Structural Designs Meet Optimality: Exploring Optimized LSM-tree Structures in A Colossal Configuration Space 2024 SIGMOD 4.4025274e-05
9,317 Are Joins over LSM-trees Ready? Take RocksDB as an Example 2025 VLDB 4.3556432e-05
9,386 Rethinking The Compaction Policies in LSM-trees 2025 SIGMOD 4.3455975e-05
9,529 Mnemosyne: Dynamic Workload-Aware BF Tuning via Accurate Statistics in LSM trees 2025 SIGMOD 4.32934e-05
9,758 Practical Dynamic Extension for Sampling Indexes 2023 SIGMOD 4.2879116e-05
9,987 A Multi-tenant Relational OLTP Database at Salesforce 2026 CIDR 4.1945683e-05
10,063 Counting Is All You Need for Instant Tuple Discovery: Enabling Real-Time HTAP in Standalone DBMSs 2026 SIGMOD 4.1945683e-05
10,182 Making LSM-Tree-based Key-Value Store Practical and Efficient for Multi-Tenant Serverless Cloud Databases 2026 SIGMOD 4.1945683e-05
10,191 PartitionKV: Redesigning LSM-tree KV Stores on NVMs with Adaptive Partitioning for Reducing Write Stalls and Amplification 2026 SIGMOD 4.1945683e-05
10,255 How to Write to SSDs 2026 VLDB 4.1945683e-05
10,367 Aster: Enhancing LSM-structures for Scalable Graph Database 2025 SIGMOD 4.1945683e-05
10,773 From FASTER to F2: Evolving Concurrent Key-Value Store Designs for Large Skewed Workloads 2025 VLDB 4.1945683e-05
11,049 On Reducing Space Amplification with Multi-Column Compaction in Apache IoTDB 2024 VLDB 4.1945683e-05
11,075 LavaStore: ByteDance's Purpose-built, High-performance, Cost-effective Local Storage Engine for Cloud Services 2024 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 27 of 27 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
360 BLOCKBENCH: A Framework for Analyzing Private Blockchains 2017 SIGMOD 0.00025790132
379 bLSM: A General Purpose Log Structured Merge Tree 2012 SIGMOD 0.0002493527
569 Optimizing Space Amplification in RocksDB 2017 CIDR 0.00019924098
609 Monkey: Optimal Navigable Key-Value Store 2017 SIGMOD 0.0001923446
1,117 Cache-Oblivious String B-trees 2006 PODS 0.00013882205
1,169 SuRF: Practical Range Query Filtering with Fast Succinct Tries 2018 SIGMOD 0.00013536447
1,311 Dostoevsky: Better Space-Time Trade-Offs for LSM-Tree Based Key-Value Stores via Adaptive Removal of Superfluous Merging 2018 SIGMOD 0.00012657439
1,366 SlimDB: A Space-Efficient Key-Value Storage Engine For Semi-Sorted Data 2017 VLDB 0.00012357685
1,610 MyRocks: LSM-Tree Database Storage Engine Serving Facebook's Social Graph 2020 VLDB 0.00011148094
1,613 Realtime Data Processing at Facebook 2016 SIGMOD 0.00011140777
1,960 Compaction management in distributed key-value datastores 2015 VLDB 9.9521444e-05
2,004 X-Engine: An Optimized Storage Engine for Large-scale E-commerce Transaction Processing 2019 SIGMOD 9.811707e-05
2,109 The Log-Structured Merge-Bush & the Wacky Continuum 2019 SIGMOD 9.5318694e-05
2,157 The Data Calculator*: Data Structure Design and Cost Synthesis from First Principles and Learned Cost Models 2018 SIGMOD 9.416022e-05
2,606 Design Continuums and the Path Toward Self-Designing Key-Value Stores that Know and Learn 2019 CIDR 8.4645832e-05
2,798 Chucky: A Succinct Cuckoo Filter for LSM-Tree 2021 SIGMOD 8.1080111e-05
3,362 Improving Flash Write Performance by Using Update Frequency 2013 VLDB 7.1734963e-05
3,386 Lethe: A Tunable Delete-Aware LSM Engine 2020 SIGMOD 7.1577103e-05
3,564 Accordion: Better Memory Organization for LSM Key-Value Stores 2018 VLDB 6.9669032e-05
3,793 Constructing and Analyzing the LSM Compaction Design Space 2021 VLDB 6.7617833e-05
5,158 Coconut: A Scalable Bottom-Up Approach for Building Data Series Indexes 2018 VLDB 5.6588553e-05
5,403 The Necessary Death of the Block Device Interface 2013 CIDR 5.5269076e-05
6,460 Toward a Better Understanding and Evaluation of Tree Structures on Flash SSDs 2021 VLDB 5.0554178e-05
7,174 Coconut Palm: Static and Streaming Data Series Exploration Now in your Palm 2019 SIGMOD 4.8114555e-05
7,472 GeckoFTL: Scalable Flash Translation Techniques For Very Large Flash Devices 2016 SIGMOD 4.7199619e-05
11,530 The End of Moore’s Law and the Rise of The Data Processor 2021 VLDB 4.1945683e-05
13,445 EagleTree: Exploring the Design Space of SSD-Based Algorithms 2013 VLDB -
Previous Page 1 / 1 Next

Semantically Similar Papers