Database Paper Browser

Back to papers

The Case for Learned In-Memory Joins

Summary: First systematic study applying CDF-based learned models and learned indexes (RMI, RadixSpline) to in-memory joins, exploring how model-predicted positions can accelerate INLJ, sort-based, and hash-based joins. Proposes optimized learned variants and shows experimentally that they beat state-of-the-art techniques across many datasets and scenarios. (summarized by gpt-5-mini on Feb 09 2026)

Paper ID
13033
Venue
VLDB
Year
2023
Pagerank
4.5194164e-05
Overall Rank
8,417 | 41.45%
DOI
10.14778/3587136.3587148

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 5 of 5 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 35 of 35 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
52 Database Architecture Optimized for the new Bottleneck: Memory Access 1999 VLDB 0.00066474881
71 How Good Are Query Optimizers, Really? 2016 VLDB 0.00059038975
102 The Case for Learned Index Structures 2018 SIGMOD 0.00049545203
204 Learned Cardinalities: Estimating Correlated Joins with Deep Learning 2019 CIDR 0.00034784455
333 Neo: A Learned Query Optimizer 2019 VLDB 0.00027206884
351 Sort vs. Hash Revisited: Fast Join Implementation on Modern Multi-Core CPUs 2009 VLDB 0.0002636504
404 Multi-Core, Main-Memory Joins: Sort vs. Hash Revisited 2014 VLDB 0.00024143076
540 Design and Evaluation of Main Memory Hash Join Algorithms for Multi-core CPUs 2011 SIGMOD 0.0002063443
585 Massively Parallel Sort-Merge Joins in Main Memory Multi-Core Database Systems 2012 VLDB 0.00019706145
640 Bao: Making Learned Query Optimization Practical 2021 SIGMOD 0.00018759152
735 Umbra: A Disk-Based System with In-Memory Performance 2020 CIDR 0.00017452467
775 Relational Joins on Graphics Processors 2008 SIGMOD 0.00016823862
826 ALEX: An Updatable Adaptive Learned Index 2020 SIGMOD 0.00016224841
857 The PGM-index: a fully-dynamic compressed learned index with provable worst-case bounds 2020 VLDB 0.00015882892
1,079 What happens during a Join? Dissecting CPU and Memory Optimization Effects 2000 VLDB 0.00014233415
1,195 Buffering Accesses to Memory-Resident Index Structures 2003 VLDB 0.00013406526
1,460 Benchmarking Learned Indexes 2021 VLDB 0.00011887068
1,478 Learning Multi-dimensional Indexes 2020 SIGMOD 0.00011762542
1,543 NUMA-aware algorithms: the case of data shuffling 2013 CIDR 0.0001145318
1,611 Qd-tree: Learning Data Layouts for Big Data Analytics 2020 SIGMOD 0.00011147324
1,804 An Experimental Comparison of Thirteen Relational Equi-Joins in Main Memory 2016 SIGMOD 0.00010501185
2,552 Updatable Learned Index with Precise Positions 2021 VLDB 8.5530411e-05
2,772 Quickstep: A Data Platform Based on the Scaling-Up Approach 2018 VLDB 8.1401661e-05
3,175 Asynchronous Memory Access Chaining 2016 VLDB 7.438501e-05
3,193 Exploiting Coroutines to Attack the "Killer Nanoseconds" 2018 VLDB 7.4089674e-05
3,443 Distributed Join Algorithms on Thousands of Cores 2017 VLDB 7.0887214e-05
3,721 To Partition, or Not to Partition, That is the Join Question in a Real System 2021 SIGMOD 6.8179379e-05
4,097 The Case for a Learned Sorting Algorithm 2020 SIGMOD 6.4551616e-05
4,128 Are Updatable Learned Indexes Ready? 2022 VLDB 6.4292373e-05
5,314 Can Learned Models Replace Hash Functions? 2023 VLDB 5.5724608e-05
5,671 LSched: A Workload-Aware Learned Query Scheduler for Analytical Database Systems 2022 SIGMOD 5.3803919e-05
6,297 Towards instance-optimized data systems 2021 VLDB 5.1227886e-05
6,724 A Critical Analysis of Recursive Model Indexes 2022 VLDB 4.9484506e-05
7,378 Cache-Oblivious Query Processing 2007 CIDR 4.7480163e-05
9,785 Is FPGA Useful for Hash Joins? Exploring Hash Joins on Coupled CPU-FPGA Architecture 2020 CIDR 4.284797e-05
Previous Page 1 / 1 Next

Semantically Similar Papers