Database Paper Browser

Back to papers

LeaFi: Data Series Indexes on Steroids with Learned Filters

Summary: LeaFi uses learned filters to boost pruning in tree-based data-series indexes. Models predict tight node-wise distance lower bounds for pruning, with train-time index building and query-time calibration to meet per-query recall targets; up to 20x pruning, 32x search speed at 99% recall.— (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
7046
Venue
SIGMOD
Year
2025
Pagerank
4.3690661e-05
Overall Rank
9,230 | 35.79%
DOI
10.1145/3709701

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 3 of 3 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 30 of 30 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
65 Fast Subsequence Matching in Time-Series Databases 1994 SIGMOD 0.00062029383
102 The Case for Learned Index Structures 2018 SIGMOD 0.00049545203
243 Locally Adaptive Dimensionality Reduction for Indexing Large Time Series Databases 2001 SIGMOD 0.00031074984
562 Query-Aware Locality-Sensitive Hashing for Approximate Nearest Neighbor Search 2016 VLDB 0.00020091752
1,157 A Data-adaptive and Dynamic Segmentation Index for Whole Matching on Time Series 2013 VLDB 0.00013610658
1,161 Querying and Mining of Time Series Data: Experimental Comparison of Representations and Distance Measures 2008 VLDB 0.00013585236
1,364 Improving Approximate Nearest Neighbor Search through Learned Adaptive Early Termination 2020 SIGMOD 0.00012370117
1,478 Learning Multi-dimensional Indexes 2020 SIGMOD 0.00011762542
1,516 k-Shape: Efficient and Accurate Clustering of Time Series 2015 SIGMOD 0.00011586255
1,889 Tsunami: A Learned Multi-dimensional Index for Correlated Data and Skewed Workloads 2021 VLDB 0.00010200865
2,381 TSB-UAD: An End-to-End Benchmark Suite for Univariate Time-Series Anomaly Detection 2022 VLDB 8.9327638e-05
3,183 Return of the Lernaean Hydra: Experimental Evaluation of Data Series Approximate Similarity Search 2020 VLDB 7.4228241e-05
3,266 Learned Cardinality Estimation: An In-depth Study 2022 SIGMOD 7.3074684e-05
3,400 ELPIS: Graph-Based Similarity Search for Scalable Data Science 2023 VLDB 7.1405533e-05
3,473 AI Meets Database: AI4DB and DB4AI 2021 SIGMOD 7.062864e-05
3,540 Scalable, Variable-Length Similarity Search in Data Series: The ULISSE Approach 2018 VLDB 6.9943185e-05
3,629 The Lernaean Hydra of Data Series Similarity Search: An Experimental Evaluation of the State of the Art 2019 VLDB 6.902069e-05
3,990 FactorJoin: A New Cardinality Estimation Framework for Join Queries 2023 SIGMOD 6.5581983e-05
4,462 LOGER: A Learned Optimizer towards Generating Efficient and Robust Query Execution Plans 2023 VLDB 6.1611784e-05
4,536 Data Series Progressive Similarity Search with Probabilistic Quality Guarantees 2020 SIGMOD 6.104642e-05
4,593 Auto-WLM: Machine Learning Enhanced Workload Management in Amazon Redshift 2023 SIGMOD 6.0606891e-05
4,731 Graph-Based Vector Search: An Experimental Evaluation of the State-of-the-Art 2025 SIGMOD 5.966659e-05
4,755 Indexing for Interactive Exploration of Big Data Series 2014 SIGMOD 5.946863e-05
5,158 Coconut: A Scalable Bottom-Up Approach for Building Data Series Indexes 2018 VLDB 5.6588553e-05
5,469 Learned Cardinality Estimation for Similarity Queries 2021 SIGMOD 5.4898192e-05
5,738 Hercules Against Data Series Similarity Search 2022 VLDB 5.3478528e-05
6,376 DET-LSH: A Locality-Sensitive Hashing Scheme with Dynamic Encoding Tree for Approximate Nearest Neighbor Search 2024 VLDB 5.0916875e-05
7,095 Dumpy: A Compact and Adaptive Index for Large Data Series Collections 2023 SIGMOD 4.8350023e-05
9,206 Odyssey: A Journey in the Land of Distributed Data Series Similarity Search 2023 VLDB 4.373492e-05
9,247 iEDeaL: A Deep Learning Framework for Detecting Highly Imbalanced Interictal Epileptiform Discharges 2023 VLDB 4.3690661e-05
Previous Page 1 / 1 Next

Semantically Similar Papers