Database Paper Browser

Back to papers

The Case for Learned Index Structures

Summary: Indexes treated as models; learned indexes proposed as replacements for B-Tree, Hash, and Bitmap indexes. The paper analyzes theoretical conditions under which learned indexes outperform traditional ones and reports initial results, signaling ML-driven data management as a design paradigm. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
5555
Venue
SIGMOD
Year
2018
Pagerank
0.00049545203
Overall Rank
102 | 99.30%
DOI
10.1145/3183713.3196909

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 50 of 206 citing papers.

Rank Citing Paper Year Venue Pagerank
204 Learned Cardinalities: Estimating Correlated Joins with Deep Learning 2019 CIDR 0.00034784455
333 Neo: A Learned Query Optimizer 2019 VLDB 0.00027206884
608 DeepDB: Learn from Data, not from Queries! 2020 VLDB 0.00019235898
640 Bao: Making Learned Query Optimization Practical 2021 SIGMOD 0.00018759152
801 SageDB: A Learned Database System 2019 CIDR 0.00016505496
826 ALEX: An Updatable Adaptive Learned Index 2020 SIGMOD 0.00016224841
857 The PGM-index: a fully-dynamic compressed learned index with provable worst-case bounds 2020 VLDB 0.00015882892
884 Plan-Structured Deep Neural Network Models for Query Performance Prediction 2019 VLDB 0.00015654004
910 NeuroCard: One Cardinality Estimator for All Tables 2021 VLDB 0.00015423056
1,350 Northstar: An Interactive Data Science System 2018 VLDB 0.00012431059
1,375 FITing-Tree: A Data-aware Index Structure 2019 SIGMOD 0.00012303141
1,460 Benchmarking Learned Indexes 2021 VLDB 0.00011887068
1,478 Learning Multi-dimensional Indexes 2020 SIGMOD 0.00011762542
1,611 Qd-tree: Learning Data Layouts for Big Data Analytics 2020 SIGMOD 0.00011147324
1,643 CodexDB: Synthesizing Code for Query Processing from Natural Language Instructions using GPT-3 Codex 2022 VLDB 0.0001104256
1,703 Are We Ready For Learned Cardinality Estimation? 2021 VLDB 0.00010836769
1,737 QuickSel: Quick Selectivity Learning with Mixture Models 2020 SIGMOD 0.00010720294
1,855 AI Meets AI: Leveraging Query Executions to Improve Index Recommendations 2019 SIGMOD 0.00010315245
1,889 Tsunami: A Learned Multi-dimensional Index for Correlated Data and Skewed Workloads 2021 VLDB 0.00010200865
2,057 From Natural Language Processing to Neural Databases 2021 VLDB 9.6624862e-05
2,083 Towards a Learning Optimizer for Shared Clouds 2019 VLDB 9.5834572e-05
2,115 LISA: A Learned Index Structure for Spatial Data 2020 SIGMOD 9.5257379e-05
2,364 Deep Learning Models for Selectivity Estimation of Multi-Attribute Queries 2020 SIGMOD 8.9554751e-05
2,552 Updatable Learned Index with Precise Positions 2021 VLDB 8.5530411e-05
2,606 Design Continuums and the Path Toward Self-Designing Key-Value Stores that Know and Learn 2019 CIDR 8.4645832e-05
2,678 Effectively Learning Spatial Indices 2020 VLDB 8.3252088e-05
2,732 Efficiently Searching In-Memory Sorted Arrays: Revenge of the Interpolation Search? 2019 SIGMOD 8.2087602e-05
2,762 FLAT: Fast, Lightweight and Accurate Method for Cardinality Estimation 2021 VLDB 8.1585394e-05
2,798 Chucky: A Succinct Cuckoo Filter for LSM-Tree 2021 SIGMOD 8.1080111e-05
2,865 Designing Succinct Secondary Indexing Mechanism by Exploiting Column Correlations 2019 SIGMOD 7.9862595e-05
3,131 FINEdex: A Fine-grained Learned Index Scheme for Scalable and Concurrent Memory Systems 2022 VLDB 7.4985793e-05
3,269 iBTune: Individualized Buffer Tuning for Large-scale Cloud Databases 2019 VLDB 7.2998062e-05
3,416 LeCo: Lightweight Compression via Learning Serial Correlations 2024 SIGMOD 7.1196234e-05
3,473 AI Meets Database: AI4DB and DB4AI 2021 SIGMOD 7.062864e-05
3,488 Optimal Column Layout for Hybrid Workloads 2019 VLDB 7.0479329e-05
3,499 Fauce: Fast and Accurate Deep Ensembles with Uncertainty for Cardinality Estimation 2021 VLDB 7.0376445e-05
3,611 SNARF: A Learning-Enhanced Range Filter 2022 VLDB 6.9191399e-05
3,625 Cost Models for Big Data Query Processing: Learning, Retrofitting, and Our Findings 2020 SIGMOD 6.9055212e-05
3,658 Towards a Hands-Free Query Optimizer through Deep Learning 2019 CIDR 6.8704209e-05
3,725 Estimating Cardinalities with Deep Sketches 2019 SIGMOD 6.8170734e-05
3,727 Cost-based or Learning-based? A Hybrid Query Optimizer for Query Plan Selection 2022 VLDB 6.8141709e-05
3,779 Instance-Optimized Data Layouts for Cloud Analytics Workloads 2021 SIGMOD 6.7747205e-05
3,885 Density-optimized Intersection-free Mapping and Matrix Multiplication for Join-Project Operations 2022 VLDB 6.6674822e-05
4,084 APEX: A High-Performance Learned Index on Persistent Memory 2022 VLDB 6.4622113e-05
4,097 The Case for a Learned Sorting Algorithm 2020 SIGMOD 6.4551616e-05
4,128 Are Updatable Learned Indexes Ready? 2022 VLDB 6.4292373e-05
4,227 Cosine: A Cloud-Cost Optimized Self-Designing Key-Value Storage Engine 2022 VLDB 6.3434324e-05
4,278 Similarity Query Processing for High-Dimensional Data 2020 VLDB 6.2953764e-05
4,359 Astrid: Accurate Selectivity Estimation for String Predicates using Deep Learning 2021 VLDB 6.2569955e-05
4,399 HUNTER: An Online Cloud Database Hybrid Tuning System for Personalized Requirements 2022 SIGMOD 6.2225151e-05
Previous Page 1 / 5 Next

Outgoing Citations (Sorted by Pagerank)

Showing 18 of 18 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
32 Differential Files: Their Application To The Maintenance Of Large Data Bases 1976 SIGMOD 0.00077486306
44 The Design Of Postgres 1986 SIGMOD 0.00071838587
103 Making B+-Trees Cache Conscious in Main Memory 2000 SIGMOD 0.00049150032
233 A Study of Index Structures for Main Memory Database Management Systems 1986 VLDB 0.00032021526
368 Small Materialized Aggregates: A Light Weight Index Structure for Data Warehousing 1998 VLDB 0.000254931
381 FAST: Fast Architecture Sensitive Tree Search on Modern CPUs and GPUs 2010 SIGMOD 0.00024873637
720 Building a Database on S3 2008 SIGMOD 0.00017615431
1,213 RDF-3X: a RISC-style Engine for RDF 2008 VLDB 0.0001325231
1,312 Reducing the Storage Overhead of Main-Memory OLTP Databases with Hybrid Indexes 2016 SIGMOD 0.00012652548
1,471 Adaptive Range Filters for Cold Data: Avoiding Trips to Siberia 2013 VLDB 0.00011830111
1,696 A Seven-Dimensional Analysis of Hashing Methods and its Implications on Query Processing 2016 VLDB 0.00010881034
1,711 SCADS: Scale-Independent Storage for Social Computing Applications 2009 CIDR 0.0001080509
1,819 The End of a Myth: Distributed Transactions Can Scale 2017 VLDB 0.00010429773
1,873 An Architecture for Compiling UDF-centric Workflows 2015 VLDB 0.00010253002
1,913 BF-Tree: Approximate Tree Indexing 2014 VLDB 0.00010113937
3,777 A Hybrid B+-tree as Solution for In-Memory Indexing on CPU-GPU Heterogeneous Computing Platforms 2016 SIGMOD 6.7750901e-05
3,912 Two Birds, One Stone: A Fast, yet Lightweight, Indexing Scheme for Modern Database Systems 2017 VLDB 6.6354964e-05
4,897 The Wavelet Trie: Maintaining an Indexed Sequence of Strings in Compressed Space 2012 PODS 5.8469152e-05
Previous Page 1 / 1 Next

Semantically Similar Papers

Overall Rank Paper Year Venue Pagerank
8,811 Tuning Hierarchical Learned Indexes on Disk and Beyond 2022 SIGMOD 4.4441574e-05
5,157 Hist-Tree: Those Who Ignore It Are Doomed to Learn 2021 CIDR 5.6589595e-05
9,746 Why Are Learned Indexes So Effective but Sometimes Ineffective? 2025 VLDB 4.2897489e-05
6,445 Updatable Learned Indexes Meet Disk-Resident DBMS - From Evaluations to Design Choices 2023 SIGMOD 5.0589805e-05
2,552 Updatable Learned Index with Precise Positions 2021 VLDB 8.5530411e-05
4,128 Are Updatable Learned Indexes Ready? 2022 VLDB 6.4292373e-05
2,678 Effectively Learning Spatial Indices 2020 VLDB 8.3252088e-05
7,390 Making In-Memory Learned Indexes Efficient on Disk 2024 SIGMOD 4.7431654e-05
5,074 Learned Index: A Comprehensive Experimental Evaluation 2023 VLDB 5.7175726e-05
1,460 Benchmarking Learned Indexes 2021 VLDB 0.00011887068