Database Paper Browser

Back to papers

Are Joins over LSM-trees Ready? Take RocksDB as an Example

Summary: Exhaustive study and benchmark of join methods over LSM-trees (RocksDB), defining a configuration space of join algorithms, secondary index designs, and consistency strategies. Theoretical cost analysis plus unified implementations show how LSM read/write trade-offs reorder join performance and yield practical selection guidance. (summarized by gpt-5-mini on Feb 09 2026)

Paper ID
13779
Venue
VLDB
Year
2025
Pagerank
4.3556432e-05
Overall Rank
9,317 | 35.19%
DOI
10.14778/3717775.3717767

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 1 of 1 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 45 of 45 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
9 Implementation Techniques For Main Memory Database Systems 1984 SIGMOD 0.0014279444
71 How Good Are Query Optimizers, Really? 2016 VLDB 0.00059038975
281 LinkBench: a Database Benchmark Based on the Facebook Social Graph 2013 SIGMOD 0.0002906793
333 Neo: A Learned Query Optimizer 2019 VLDB 0.00027206884
390 CockroachDB: The Resilient Geo-Distributed SQL Database 2020 SIGMOD 0.00024607299
608 DeepDB: Learn from Data, not from Queries! 2020 VLDB 0.00019235898
609 Monkey: Optimal Navigable Key-Value Store 2017 SIGMOD 0.0001923446
640 Bao: Making Learned Query Optimization Practical 2021 SIGMOD 0.00018759152
806 An End-to-End Learning-based Cost Estimator 2020 VLDB 0.00016434274
857 The PGM-index: a fully-dynamic compressed learned index with provable worst-case bounds 2020 VLDB 0.00015882892
884 Plan-Structured Deep Neural Network Models for Query Performance Prediction 2019 VLDB 0.00015654004
910 NeuroCard: One Cardinality Estimator for All Tables 2021 VLDB 0.00015423056
1,311 Dostoevsky: Better Space-Time Trade-Offs for LSM-Tree Based Key-Value Stores via Adaptive Removal of Superfluous Merging 2018 SIGMOD 0.00012657439
1,438 AsterixDB: A Scalable, Open Source BDMS 2014 VLDB 0.00011973592
1,460 Benchmarking Learned Indexes 2021 VLDB 0.00011887068
1,610 MyRocks: LSM-Tree Database Storage Engine Serving Facebook's Social Graph 2020 VLDB 0.00011148094
1,613 Realtime Data Processing at Facebook 2016 SIGMOD 0.00011140777
1,804 An Experimental Comparison of Thirteen Relational Equi-Joins in Main Memory 2016 SIGMOD 0.00010501185
2,004 X-Engine: An Optimized Storage Engine for Large-scale E-commerce Transaction Processing 2019 SIGMOD 9.811707e-05
2,109 The Log-Structured Merge-Bush & the Wacky Continuum 2019 SIGMOD 9.5318694e-05
2,157 The Data Calculator*: Data Structure Design and Cost Synthesis from First Principles and Learned Cost Models 2018 SIGMOD 9.416022e-05
2,606 Design Continuums and the Path Toward Self-Designing Key-Value Stores that Know and Learn 2019 CIDR 8.4645832e-05
2,783 Flow-Loss: Learning Cardinality Estimates That Matter 2021 VLDB 8.1293383e-05
3,544 Rosetta: A Robust Space-Time Optimized Range Filter for Key-Value Stores 2020 SIGMOD 6.9898874e-05
3,564 Accordion: Better Memory Organization for LSM Key-Value Stores 2018 VLDB 6.9669032e-05
3,924 A Unified Deep Model of Learning from both Data and Queries for Cardinality Estimation 2021 SIGMOD 6.6271553e-05
3,965 Spooky: Granulating LSM-Tree Compactions Correctly 2022 VLDB 6.5820028e-05
3,990 FactorJoin: A New Cardinality Estimation Framework for Join Queries 2023 SIGMOD 6.5581983e-05
4,227 Cosine: A Cloud-Cost Optimized Self-Designing Key-Value Storage Engine 2022 VLDB 6.3434324e-05
4,245 A Disk-Based Join With Probabilistic Guarantees* 2005 SIGMOD 6.3272687e-05
4,835 Proteus: A Self-Designing Range Filter 2022 SIGMOD 5.8905445e-05
5,356 LogKV: Exploiting Key-Value Stores for Event Log Processing 2013 CIDR 5.5509715e-05
5,535 Lightweight Cardinality Estimation in LSM-based Systems 2018 SIGMOD 5.4539235e-05
5,631 A Comparative Study of Secondary Indexing Techniques in LSM-based NoSQL Databases 2018 SIGMOD 5.4019839e-05
5,762 Oasis: An Optimal Disjoint Segmented Learned Range Filter 2024 VLDB 5.3377299e-05
5,918 Breaking Down Memory Walls: Adaptive Memory Management in LSM-based Storage Systems 2021 VLDB 5.2737135e-05
6,192 SQLite: Past, Present, and Future 2022 VLDB 5.1641743e-05
6,398 Endure: A Robust Tuning Paradigm for LSM Trees Under Workload Uncertainty 2022 VLDB 5.0819209e-05
7,620 Learning to Optimize LSM-trees: Towards A Reinforcement Learning based Key-Value Store for Dynamic Workloads 2023 SIGMOD 4.693568e-05
7,743 Efficient Data Ingestion and Query Processing for LSM-Based Storage Systems 2019 VLDB 4.6626575e-05
8,009 CAMAL: Optimizing LSM-trees via Active Learning 2024 SIGMOD 4.6066863e-05
8,417 The Case for Learned In-Memory Joins 2023 VLDB 4.5194164e-05
8,855 A Design Space Exploration and Evaluation for Main-Memory Hash Joins in Storage Class Memory 2023 VLDB 4.4348906e-05
9,071 Structural Designs Meet Optimality: Exploring Optimized LSM-tree Structures in A Colossal Configuration Space 2024 SIGMOD 4.4025274e-05
9,187 POLAR: Adaptive and Non-invasive Join Order Selection via Plans of Least Resistance 2024 VLDB 4.3780059e-05
Previous Page 1 / 1 Next

Semantically Similar Papers