Database Paper Browser

Back to papers

LES3: Learning-based Exact Set Similarity Search

Summary: LES3 introduces a learning-based exact set similarity search that partitions sets and uses a TGM bitmap index to prune candidates. Analytical partitioning under distributional assumptions informs L2P and PTR, enabling pruning and faster exact search than baselines. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
12387
Venue
VLDB
Year
2021
Pagerank
4.1945683e-05
Overall Rank
11,504 | 19.97%
DOI
10.14778/3476249.3476263

Incoming Non-self Citations Over Time

No non-self incoming citations found for this paper in this database.

Authors

Incoming Citations (Sorted by Pagerank)

Showing 0 of 0 citing papers.

Rank Citing Paper Year Venue Pagerank
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 25 of 25 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
2 R-Trees: A Dynamic Index Structure For Spatial Searching 1984 SIGMOD 0.0032169493
102 The Case for Learned Index Structures 2018 SIGMOD 0.00049545203
266 Efficient Exact Set-Similarity Joins 2006 VLDB 0.00029718727
826 ALEX: An Updatable Adaptive Learned Index 2020 SIGMOD 0.00016224841
1,305 Bayesian Locality Sensitive Hashing for Fast Similarity Search 2012 VLDB 0.00012687101
1,375 FITing-Tree: A Data-aware Index Structure 2019 SIGMOD 0.00012303141
1,396 Can We Beat the Prefix Filtering? An Adaptive Framework for Similarity Join and Search 2012 SIGMOD 0.00012204748
1,478 Learning Multi-dimensional Indexes 2020 SIGMOD 0.00011762542
1,715 V-SMART-Join: A Scalable MapReduce Framework for All-Pair Similarity Joins of Multisets and Vectors 2012 VLDB 0.00010803271
2,115 LISA: A Learned Index Structure for Spatial Data 2020 SIGMOD 9.5257379e-05
2,158 Uni-Detect: A Unified Approach to Automated Error Detection in Tables 2019 SIGMOD 9.4141354e-05
2,779 Hashed Samples: Selectivity Estimators For Set Similarity Selection Queries 2008 VLDB 8.1320575e-05
3,141 ClusterJoin: A Similarity Joins Framework using Map-Reduce 2014 VLDB 7.4829448e-05
3,437 Speculative Distributed CSV Data Parsing for Big Data Analytics 2019 SIGMOD 7.0942161e-05
3,459 An Empirical Evaluation of Set Similarity Join Techniques 2016 VLDB 7.072508e-05
3,490 Leveraging Set Relations in Exact Set Similarity Join 2017 VLDB 7.0465856e-05
3,514 Spatio-Textual Similarity Joins 2013 VLDB 7.0226998e-05
4,050 An Efficient Partition Based Method for Exact Set Similarity Joins 2016 VLDB 6.4953612e-05
4,097 The Case for a Learned Sorting Algorithm 2020 SIGMOD 6.4551616e-05
4,353 Overlap Set Similarity Joins with Theoretical Guarantees 2018 SIGMOD 6.263585e-05
4,607 Data Integration and Machine Learning: A Natural Synergy 2018 SIGMOD 6.0538827e-05
5,749 BinDex: A Two-Layered Index for Fast and Robust Scans 2020 SIGMOD 5.3418923e-05
6,891 Analysis of Indexing Structures for Immutable Data 2020 SIGMOD 4.8927093e-05
8,430 Tree-Encoded Bitmaps 2020 SIGMOD 4.5154973e-05
11,655 Top-k Queries over Digital Traces 2019 SIGMOD 4.1945683e-05
Previous Page 1 / 1 Next

Semantically Similar Papers