Chucky: A Succinct Cuckoo Filter for LSM-Tree

Summary: Chucky replaces Bloom filters in LSM-trees with a single Cuckoo filter mapping entries to LSM addresses, reducing memory accesses. To offset FP from address bits, it uses information-theoretic encoding to keep large fingerprints and low FPR at cost. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID: 6165
Venue: SIGMOD
Year: 2021
Pagerank: 8.1116755e-05
Overall Rank: 2,797 | 80.57%
DOI: 10.1145/3448016.3457273

Incoming Non-self Citations Over Time

Authors

1. Niv Dayan
2. Moshe Twitto

Incoming Citations (Sorted by Pagerank)

Showing 35 of 35 citing papers.

Rank	Citing Paper	Year	Venue	Pagerank
3,970	Spooky: Granulating LSM-Tree Compactions Correctly	2022	VLDB	6.5756727e-05
4,404	TreeLine: An Update-In-Place Key-Value Store for Modern Storage	2023	VLDB	6.2052645e-05
4,948	SplinterDB and Maplets: Improving the Tradeoffs in Key-Value Store Compaction Policy	2023	SIGMOD	5.810122e-05
5,749	InfiniFilter: Expanding Filters to Infinity and Beyond	2023	SIGMOD	5.3420354e-05
5,801	Dissecting, Designing, and Optimizing LSM-based Data Stores	2022	SIGMOD	5.3217858e-05
5,868	GRF: A Global Range Filter for LSM-Trees with Shape Encoding	2024	SIGMOD	5.2928769e-05
6,829	Prefix Filter: Practically and Theoretically Better Than Bloom	2022	VLDB	4.9083308e-05
7,043	SepHash: A Write-Optimized Hash Index On Disaggregated Memory via Separate Segment Structure	2024	VLDB	4.8480192e-05
7,152	Bf-Tree: A Modern Read-Write-Optimized Concurrent Larger-Than-Memory Range Index	2024	VLDB	4.8126591e-05
7,623	Learning to Optimize LSM-trees: Towards A Reinforcement Learning based Key-Value Store for Dynamic Workloads	2023	SIGMOD	4.6890662e-05
7,662	Optimizing Collections of Bloom Filters within a Space Budget	2024	VLDB	4.6812878e-05
7,694	LSMGraph: A High-Performance Dynamic Graph Storage System with Multi-Level CSR	2024	SIGMOD	4.6712753e-05
7,812	CaaS-LSM: Compaction-as-a-Service for LSM-based Key-Value Stores in Storage Disaggregated Infrastructure	2024	SIGMOD	4.6411266e-05
8,011	CAMAL: Optimizing LSM-trees via Active Learning	2024	SIGMOD	4.6022693e-05
8,333	How to Grow an LSM-tree? Towards Bridging the Gap Between Theory and Practice	2025	SIGMOD	4.5390511e-05
8,524	Aleph Filter: To Infinity in Constant Time	2024	VLDB	4.4893996e-05
8,717	Entropy-Learned Hashing: Constant Time Hashing with Controllable Uniformity	2022	SIGMOD	4.4566937e-05
8,722	Memento Filter: A Fast, Dynamic, and Robust Range Filter	2024	SIGMOD	4.4558244e-05
8,804	ArceKV: Towards Workload-driven LSM-compactions for Key-Value Store Under Dynamic Workloads	2026	VLDB	4.4424232e-05
8,876	MirrorKV: An Efficient Key-Value Store on Hybrid Cloud Storage with Balanced Performance of Compaction and Querying	2023	SIGMOD	4.4261814e-05
9,069	Structural Designs Meet Optimality: Exploring Optimized LSM-tree Structures in A Colossal Configuration Space	2024	SIGMOD	4.3983078e-05
9,221	Diva: Dynamic Range Filter for Var-Length Keys and Queries	2025	VLDB	4.366098e-05
9,390	Rethinking The Compaction Policies in LSM-trees	2025	SIGMOD	4.341433e-05
9,467	Disco: A Compact Index for LSM-trees	2025	SIGMOD	4.3309383e-05
9,529	Mnemosyne: Dynamic Workload-Aware BF Tuning via Accurate Statistics in LSM trees	2025	SIGMOD	4.3251912e-05
9,795	Optimizing Time Series Queries with Versions	2024	SIGMOD	4.2777144e-05
9,823	NEXT: A New Secondary Index Framework for LSM-based Data Storage	2025	SIGMOD	4.2710095e-05
9,986	A Multi-tenant Relational OLTP Database at Salesforce	2026	CIDR	4.1905499e-05
10,145	Breadcrumb Filters: Fast Fully Featured Filters	2026	SIGMOD	4.1905499e-05
10,176	Improving Range Scan Performance in LSM-trees with Group Caching	2026	SIGMOD	4.1905499e-05
10,182	Making LSM-Tree-based Key-Value Store Practical and Efficient for Multi-Tenant Serverless Cloud Databases	2026	SIGMOD	4.1905499e-05
10,379	Aster: Enhancing LSM-structures for Scalable Graph Database	2025	SIGMOD	4.1905499e-05
10,779	From FASTER to F2: Evolving Concurrent Key-Value Store Designs for Large Skewed Workloads	2025	VLDB	4.1905499e-05
11,078	LavaStore: ByteDance's Purpose-built, High-performance, Cost-effective Local Storage Engine for Cloud Services	2024	VLDB	4.1905499e-05
11,534	The End of Moore’s Law and the Rise of The Data Processor	2021	VLDB	4.1905499e-05

Outgoing Citations (Sorted by Pagerank)

Showing 35 of 35 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank	Cited Paper	Year	Venue	Pagerank
101	The Case for Learned Index Structures	2018	SIGMOD	0.00049778866
361	BLOCKBENCH: A Framework for Analyzing Private Blockchains	2017	SIGMOD	0.00025765318
379	bLSM: A General Purpose Log Structured Merge Tree	2012	SIGMOD	0.00024954332
568	Optimizing Space Amplification in RocksDB	2017	CIDR	0.00019932335
608	Monkey: Optimal Navigable Key-Value Store	2017	SIGMOD	0.00019233548
1,071	Incremental Organization for Data Recording and Warehousing	1997	VLDB	0.00014265647
1,169	SuRF: Practical Range Query Filtering with Fast Succinct Tries	2018	SIGMOD	0.00013530267
1,249	Don't Thrash: How to Cache Your Hash on Flash	2012	VLDB	0.00013040265
1,309	Dostoevsky: Better Space-Time Trade-Offs for LSM-Tree Based Key-Value Stores via Adaptive Removal of Superfluous Merging	2018	SIGMOD	0.00012655712
1,368	SlimDB: A Space-Efficient Key-Value Storage Engine For Semi-Sorted Data	2017	VLDB	0.0001235708
1,437	AsterixDB: A Scalable, Open Source BDMS	2014	VLDB	0.00011973401
1,614	MyRocks: LSM-Tree Database Storage Engine Serving Facebook's Social Graph	2020	VLDB	0.00011137963
1,616	Realtime Data Processing at Facebook	2016	SIGMOD	0.00011133818
1,815	SSD Bufferpool Extensions for Database Systems	2010	VLDB	0.00010439702
1,932	X-Engine: An Optimized Storage Engine for Large-scale E-commerce Transaction Processing	2019	SIGMOD	0.00010050776
1,957	Compaction management in distributed key-value datastores	2015	VLDB	9.961151e-05
2,112	The Log-Structured Merge-Bush & the Wacky Continuum	2019	SIGMOD	9.5244583e-05
2,153	The Data Calculator*: Data Structure Design and Cost Synthesis from First Principles and Learned Cost Models	2018	SIGMOD	9.418541e-05
2,469	Morton Filters: Faster, Space-Efficient Cuckoo Filters via Biasing, Compression, and Decoupled Logical Sparsity	2018	VLDB	8.7255912e-05
2,606	Design Continuums and the Path Toward Self-Designing Key-Value Stores that Know and Learn	2019	CIDR	8.4621503e-05
2,848	A General-Purpose Counting Filter: Making Every Bit Count	2017	SIGMOD	8.0202739e-05
3,363	Lethe: A Tunable Delete-Aware LSM Engine	2020	SIGMOD	7.1680649e-05
3,545	Rosetta: A Robust Space-Time Optimized Range Filter for Key-Value Stores	2020	SIGMOD	6.9831585e-05
3,567	Accordion: Better Memory Organization for LSM Key-Value Stores	2018	VLDB	6.9606988e-05
4,157	Performance-Optimal Filtering: Bloom Overtakes Cuckoo at High Throughput	2019	VLDB	6.3935343e-05
4,920	On Performance Stability in LSM-based Storage Systems	2020	VLDB	5.8262404e-05
4,993	Stacked Filters: Learning to Filter by Structure	2021	VLDB	5.7749409e-05
5,118	Design Tradeoffs of Data Access Methods	2016	SIGMOD	5.6781464e-05
5,156	Coconut: A Scalable Bottom-Up Approach for Building Data Series Indexes	2018	VLDB	5.6534878e-05
5,313	Key-Value Storage Engines	2020	SIGMOD	5.5711707e-05
5,401	The Necessary Death of the Block Device Interface	2013	CIDR	5.5260995e-05
6,440	From Auto-tuning One Size Fits All to Self-designed and Learned Data-intensive Systems	2019	SIGMOD	5.0546781e-05
7,173	Coconut Palm: Static and Streaming Data Series Exploration Now in your Palm	2019	SIGMOD	4.8068454e-05
7,467	GeckoFTL: Scalable Flash Translation Techniques For Very Large Flash Devices	2016	SIGMOD	4.7174456e-05
13,458	EagleTree: Exploring the Design Space of SSD-Based Algorithms	2013	VLDB	-

Semantically Similar Papers

Overall Rank	Paper	Year	Venue	Pagerank
9,932	ChainedFilter: Combining Membership Filters by Chain Rule	2023	SIGMOD	4.2461158e-05
6,829	Prefix Filter: Practically and Theoretically Better Than Bloom	2022	VLDB	4.9083308e-05
8,499	Conditional Cuckoo Filters	2021	SIGMOD	4.4929219e-05
2,848	A General-Purpose Counting Filter: Making Every Bit Count	2017	SIGMOD	8.0202739e-05
608	Monkey: Optimal Navigable Key-Value Store	2017	SIGMOD	0.00019233548
11,224	A Learned Cuckoo Filter for Approximate Membership Queries over Variable-sized Sliding Windows on Data Streams	2023	SIGMOD	4.1905499e-05
11,358	Workload-Adaptive Filtering in Storage Engines	2022	SIGMOD	4.1905499e-05
4,157	Performance-Optimal Filtering: Bloom Overtakes Cuckoo at High Throughput	2019	VLDB	6.3935343e-05
5,316	Cuckoo Index: A Lightweight Secondary Index Structure	2020	VLDB	5.5688295e-05
2,469	Morton Filters: Faster, Space-Efficient Cuckoo Filters via Biasing, Compression, and Decoupled Logical Sparsity	2018	VLDB	8.7255912e-05