Database Paper Browser

Back to papers

The Data Calculator*: Data Structure Design and Cost Synthesis from First Principles and Learned Cost Models

Summary: Introduces the Data Calculator, a design engine for data structures built from fine-grained layout primitives. It uses learned cost models with first-principles design to synthesize and predict performance of arbitrary structures without building them. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
5592
Venue
SIGMOD
Year
2018
Pagerank
9.416022e-05
Overall Rank
2,157 | 85.00%
DOI
10.1145/3183713.3199671

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 35 of 35 citing papers.

Rank Citing Paper Year Venue Pagerank
514 An End-to-End Automatic Cloud Database Tuning System Using Deep Reinforcement Learning 2019 SIGMOD 0.0002124895
801 SageDB: A Learned Database System 2019 CIDR 0.00016505496
1,478 Learning Multi-dimensional Indexes 2020 SIGMOD 0.00011762542
1,855 AI Meets AI: Leveraging Query Executions to Improve Index Recommendations 2019 SIGMOD 0.00010315245
2,109 The Log-Structured Merge-Bush & the Wacky Continuum 2019 SIGMOD 9.5318694e-05
2,606 Design Continuums and the Path Toward Self-Designing Key-Value Stores that Know and Learn 2019 CIDR 8.4645832e-05
2,798 Chucky: A Succinct Cuckoo Filter for LSM-Tree 2021 SIGMOD 8.1080111e-05
3,488 Optimal Column Layout for Hybrid Workloads 2019 VLDB 7.0479329e-05
3,779 Instance-Optimized Data Layouts for Cloud Analytics Workloads 2021 SIGMOD 6.7747205e-05
3,787 White-box Compression: Learning and Exploiting Compact Table Representations 2020 CIDR 6.7674374e-05
3,965 Spooky: Granulating LSM-Tree Compactions Correctly 2022 VLDB 6.5820028e-05
4,227 Cosine: A Cloud-Cost Optimized Self-Designing Key-Value Storage Engine 2022 VLDB 6.3434324e-05
4,835 Proteus: A Self-Designing Range Filter 2022 SIGMOD 5.8905445e-05
5,308 Key-Value Storage Engines 2020 SIGMOD 5.576303e-05
5,791 Dissecting, Designing, and Optimizing LSM-based Data Stores 2022 SIGMOD 5.3268999e-05
5,924 HMAB: Self-Driving Hierarchy of Bandits for Integrated Physical Database Design Tuning 2023 VLDB 5.2719183e-05
6,221 Charting the Design Space of Query Execution using VOILA 2021 VLDB 5.1512158e-05
6,398 Endure: A Robust Tuning Paradigm for LSM Trees Under Workload Uncertainty 2022 VLDB 5.0819209e-05
6,456 From Auto-tuning One Size Fits All to Self-designed and Learned Data-intensive Systems 2019 SIGMOD 5.0564619e-05
7,343 LSM-Trees and B-Trees: The Best of Both Worlds 2019 SIGMOD 4.7568442e-05
7,470 The Case for Deep Query Optimisation 2020 CIDR 4.7201897e-05
7,995 BP-tree: Overcoming the Point-Range Operation Tradeoff for In-Memory B-trees 2023 VLDB 4.6109825e-05
8,214 Chemistry behind Agreement 2023 CIDR 4.5577562e-05
8,346 Deep Learning: Systems and Responsibility 2021 SIGMOD 4.5420668e-05
8,414 The next 50 Years in Database Indexing or: The Case for Automatically Generated Index Structures 2022 VLDB 4.5203005e-05
8,578 Robust and Budget-Constrained Encoding Configurations for In-Memory Database Systems 2022 VLDB 4.4923477e-05
8,627 Limousine: Blending Learned and Classical Indexes to Self-Design Larger-than-Memory Cloud Storage Engines 2024 SIGMOD 4.4829101e-05
8,774 Tiresias: Enabling Predictive Autonomous Storage and Indexing 2022 VLDB 4.4559995e-05
9,095 AirIndex: Versatile Index Tuning Through Data and Storage 2023 SIGMOD 4.3975034e-05
9,317 Are Joins over LSM-trees Ready? Take RocksDB as an Example 2025 VLDB 4.3556432e-05
9,806 The Image Calculator: 10x Faster Image-AI Inference by Replacing JPEG with Self-designing Storage Format 2024 SIGMOD 4.2805224e-05
10,849 AXE: A Task Decomposition Approach to Learned LSM Tuning 2025 VLDB 4.1945683e-05
10,931 Proactive Resume and Pause of Resources for Microsoft Azure SQL Database Serverless 2024 SIGMOD 4.1945683e-05
11,445 Learning Algorithms for Automatic Data Structure Design 2021 SIGMOD 4.1945683e-05
11,569 From Worst-Case to Average-Case Analysis: Accurate Latency Predictions for Key-Value Storage Engines 2020 SIGMOD 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 32 of 32 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
29 Evaluation of Database Access Paths 1978 SIGMOD 0.00080392503
103 Making B+-Trees Cache Conscious in Main Memory 2000 SIGMOD 0.00049150032
183 Automatic Database Management System Tuning Through Large-scale Machine Learning 2017 SIGMOD 0.00036721403
233 A Study of Index Structures for Main Memory Database Management Systems 1986 VLDB 0.00032021526
237 An Efficient, Cost-Driven Index Selection Tool for Microsoft SQL Server 1997 VLDB 0.00031726304
242 Generalized Search Trees for Database Systems (Extended Abstract) 1995 VLDB 0.00031110894
381 FAST: Fast Architecture Sensitive Tree Search on Modern CPUs and GPUs 2010 SIGMOD 0.00024873637
408 Database Cracking 2007 CIDR 0.00023953844
527 Rethinking Database System Architecture: Towards a Self-tuning RISC-style Database System 2000 VLDB 0.00020868847
566 Query Optimization by Simulated Annealing 1987 SIGMOD 0.00019970535
609 Monkey: Optimal Navigable Key-Value Store 2017 SIGMOD 0.0001923446
704 Building Efficient Query Engines in a High-Level Language 2014 VLDB 0.00017900583
1,007 Application Of An Analytical Model To Evaluate Storage Structures 1976 SIGMOD 0.00014668015
1,101 Generic Database Cost Models for Hierarchical Memory Systems 2002 VLDB 0.00014070632
1,700 Bridging the Archipelago between Row-Stores and Column-Stores for Hybrid Workloads 2016 SIGMOD 0.00010858865
1,780 LLAMA: A Cache/Storage Subsystem for Modern Hardware 2013 VLDB 0.00010580669
1,807 H2O: A Hands-free Adaptive Store 2014 SIGMOD 0.00010487796
1,999 Data Morphing: An Adaptive, Cache-Conscious Storage Technique 2003 VLDB 9.8235392e-05
2,165 Self-Tuning, GPU-Accelerated Kernel Density Models for Multidimensional Selectivity Estimation 2015 SIGMOD 9.389622e-05
2,229 Self-organizing Tuple Reconstruction in Column-stores 2009 SIGMOD 9.2350274e-05
2,419 Towards a One Size Fits All Database Architecture 2011 CIDR 8.853712e-05
2,516 Concurrency and Recovery in Generalized Search Trees 1997 SIGMOD 8.6106981e-05
2,915 Brainwash: A Data System for Feature Engineering 2013 CIDR 7.9078385e-05
2,987 The Uncracked Pieces in Database Cracking 2014 VLDB 7.7787088e-05
4,161 Access Path Selection in Main-Memory Optimized Data Systems: Should I Scan or Should I Probe? 2017 SIGMOD 6.3938006e-05
4,755 Indexing for Interactive Exploration of Big Data Series 2014 SIGMOD 5.946863e-05
5,376 Holistic Indexing in Main-memory Column-stores 2015 SIGMOD 5.5417421e-05
5,390 High-Performance Extensible Indexing 1999 VLDB 5.5346145e-05
6,201 Concurrency Control for Adaptive Indexing 2012 VLDB 5.1600319e-05
6,708 Just-In-Time Data Structures 2015 CIDR 4.953106e-05
7,650 amdb: An Access Method Debugging Tool 1998 SIGMOD 4.6882482e-05
7,819 Main Memory Adaptive Denormalization 2016 SIGMOD 4.6432769e-05
Previous Page 1 / 1 Next

Semantically Similar Papers