Cosine: A Cloud-Cost Optimized Self-Designing Key-Value Storage Engine

Summary: Cosine is a self-designing KV store that adapts to workload, budget, and SLAs by exploring LSM, B-tree, and hybrid layouts. A unified I/O model and a learned CPU model predict performance and cost, enabling search; it beats RocksDB, WiredTiger, and FASTER. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID: 12625
Venue: VLDB
Year: 2022
Pagerank: 6.3381409e-05
Overall Rank: 4,227 | 70.63%
DOI: 10.14778/3485450.3485461

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 20 of 20 citing papers.

Rank	Citing Paper	Year	Venue	Pagerank
4,404	TreeLine: An Update-In-Place Key-Value Store for Modern Storage	2023	VLDB	6.2052645e-05
5,801	Dissecting, Designing, and Optimizing LSM-based Data Stores	2022	SIGMOD	5.3217858e-05
5,868	GRF: A Global Range Filter for LSM-Trees with Shape Encoding	2024	SIGMOD	5.2928769e-05
7,623	Learning to Optimize LSM-trees: Towards A Reinforcement Learning based Key-Value Store for Dynamic Workloads	2023	SIGMOD	4.6890662e-05
7,842	NOCAP: Near-Optimal Correlation-Aware Partitioning Joins	2023	SIGMOD	4.6336361e-05
8,011	CAMAL: Optimizing LSM-trees via Active Learning	2024	SIGMOD	4.6022693e-05
8,333	How to Grow an LSM-tree? Towards Bridging the Gap Between Theory and Practice	2025	SIGMOD	4.5390511e-05
8,434	SageDB: An Instance-Optimized Data Analytics System	2022	VLDB	4.5077955e-05
8,624	Limousine: Blending Learned and Classical Indexes to Self-Design Larger-than-Memory Cloud Storage Engines	2024	SIGMOD	4.4786127e-05
8,804	ArceKV: Towards Workload-driven LSM-compactions for Key-Value Store Under Dynamic Workloads	2026	VLDB	4.4424232e-05
8,876	MirrorKV: An Efficient Key-Value Store on Hybrid Cloud Storage with Balanced Performance of Compaction and Querying	2023	SIGMOD	4.4261814e-05
9,069	Structural Designs Meet Optimality: Exploring Optimized LSM-tree Structures in A Colossal Configuration Space	2024	SIGMOD	4.3983078e-05
9,322	Are Joins over LSM-trees Ready? Take RocksDB as an Example	2025	VLDB	4.351469e-05
9,390	Rethinking The Compaction Policies in LSM-trees	2025	SIGMOD	4.341433e-05
9,469	Database Gyms	2023	CIDR	4.3304872e-05
9,787	The Image Calculator: 10x Faster Image-AI Inference by Replacing JPEG with Self-designing Storage Format	2024	SIGMOD	4.2799988e-05
9,902	Towards Systematic Index Dynamization	2024	VLDB	4.2539423e-05
10,853	AXE: A Task Decomposition Approach to Learned LSM Tuning	2025	VLDB	4.1905499e-05
11,010	Breathing New Life into An Old Tree: Resolving Logging Dilemma of B+-tree on Modern Computational Storage Drives	2024	VLDB	4.1905499e-05
11,358	Workload-Adaptive Filtering in Storage Engines	2022	SIGMOD	4.1905499e-05

Outgoing Citations (Sorted by Pagerank)

Showing 29 of 29 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank	Cited Paper	Year	Venue	Pagerank
1	Access Path Selection in a Relational Database Management System	1979	SIGMOD	0.0040465394
101	The Case for Learned Index Structures	2018	SIGMOD	0.00049778866
183	Automatic Database Management System Tuning Through Large-scale Machine Learning	2017	SIGMOD	0.00036859633
281	LinkBench: a Database Benchmark Based on the Facebook Social Graph	2013	SIGMOD	0.00029084275
371	Self-Driving Database Management Systems	2017	CIDR	0.00025382677
407	Database Cracking	2007	CIDR	0.00023941779
454	An Overview of Query Optimization in Relational Systems	1998	PODS	0.00022796106
608	Monkey: Optimal Navigable Key-Value Store	2017	SIGMOD	0.00019233548
796	SageDB: A Learned Database System	2019	CIDR	0.00016541749
892	Faster: A Concurrent Key-Value Store with In-Place Updates	2018	SIGMOD	0.00015522869
2,050	Automatically Indexing Millions of Databases in Microsoft Azure SQL Database	2019	SIGMOD	9.6883066e-05
2,128	SQLGraph: An Efficient Relational-Based Property Graph Store	2015	SIGMOD	9.4804485e-05
2,153	The Data Calculator*: Data Structure Design and Cost Synthesis from First Principles and Learned Cost Models	2018	SIGMOD	9.418541e-05
2,606	Design Continuums and the Path Toward Self-Designing Key-Value Stores that Know and Learn	2019	CIDR	8.4621503e-05
2,977	ForkBase: An Efficient Storage Engine for Blockchain and Forkable Applications	2018	VLDB	7.7850958e-05
3,363	Lethe: A Tunable Delete-Aware LSM Engine	2020	SIGMOD	7.1680649e-05
3,545	Rosetta: A Robust Space-Time Optimized Range Filter for Key-Value Stores	2020	SIGMOD	6.9831585e-05
3,645	Autoscaling Tiered Cloud Storage in Anna	2019	VLDB	6.882432e-05
3,692	iBTune: Individualized Buffer Tuning for Large-scale Cloud Databases	2019	VLDB	6.8328808e-05
3,753	Choosing A Cloud DBMS: Architectures and Tradeoffs	2019	VLDB	6.7850001e-05
3,797	Constructing and Analyzing the LSM Compaction Design Space	2021	VLDB	6.7552936e-05
4,160	Access Path Selection in Main-Memory Optimized Data Systems: Should I Scan or Should I Probe?	2017	SIGMOD	6.3886736e-05
4,659	Nova-LSM: A Distributed, Component-based LSM-tree Key-value Store	2021	SIGMOD	6.0076537e-05
5,313	Key-Value Storage Engines	2020	SIGMOD	5.5711707e-05
5,367	LogKV: Exploiting Key-Value Stores for Event Log Processing	2013	CIDR	5.5461097e-05
5,847	Order-Preserving Key Compression for In-Memory Search Trees	2020	SIGMOD	5.3040014e-05
6,440	From Auto-tuning One Size Fits All to Self-designed and Learned Data-intensive Systems	2019	SIGMOD	5.0546781e-05
7,341	LSM-Trees and B-Trees: The Best of Both Worlds	2019	SIGMOD	4.7522998e-05
7,999	nKV in Action: Accelerating KV-Stores on Native Computational Storage with Near-Data Processing	2020	VLDB	4.6065616e-05

Semantically Similar Papers

Overall Rank	Paper	Year	Venue	Pagerank
4,387	LogStore: A Cloud-Native and Multi-Tenant Log Database	2021	SIGMOD	6.2219283e-05
1,368	SlimDB: A Space-Efficient Key-Value Storage Engine For Semi-Sorted Data	2017	VLDB	0.0001235708
8,876	MirrorKV: An Efficient Key-Value Store on Hybrid Cloud Storage with Balanced Performance of Compaction and Querying	2023	SIGMOD	4.4261814e-05
3,623	Cost Models for Big Data Query Processing: Learning, Retrofitting, and Our Findings	2020	SIGMOD	6.9017341e-05
10,182	Making LSM-Tree-based Key-Value Store Practical and Efficient for Multi-Tenant Serverless Cloud Databases	2026	SIGMOD	4.1905499e-05
12,053	COCCUS: Self-Configured Cost-Based Query Services in the Cloud	2013	SIGMOD	4.1905499e-05
11,573	From Worst-Case to Average-Case Analysis: Accurate Latency Predictions for Key-Value Storage Engines	2020	SIGMOD	4.1905499e-05
7,812	CaaS-LSM: Compaction-as-a-Service for LSM-based Key-Value Stores in Storage Disaggregated Infrastructure	2024	SIGMOD	4.6411266e-05
5,313	Key-Value Storage Engines	2020	SIGMOD	5.5711707e-05
8,624	Limousine: Blending Learned and Classical Indexes to Self-Design Larger-than-Memory Cloud Storage Engines	2024	SIGMOD	4.4786127e-05