Database Paper Browser

Back to papers

Pigeonring: A Principle for Faster Thresholded Similarity Search

Summary: Introduces the pigeonring principle, organizing boxes in a ring to constrain multiple boxes, yielding stronger filtering for thresholded similarity search. Shows pigeonhole is a special case, presents a universal filtering framework, and demonstrates faster, minimally invasive integration with existing algorithms on real data. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
11965
Venue
VLDB
Year
2019
Pagerank
5.2242306e-05
Overall Rank
6,074 | 57.75%
DOI
10.14778/3275536.3275539

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 7 of 7 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 50 of 52 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
34 Similarity Search in High Dimensions via Hashing 1999 VLDB 0.00076637636
91 M-tree: An Efficient Access Method for Similarity Search in Metric Spaces 1997 VLDB 0.0005181666
125 Approximate String Joins in a Database (Almost) for Free 2001 VLDB 0.00044847972
155 Robust and Efficient Fuzzy Match for Online Data Cleaning 2003 SIGMOD 0.00040637896
250 Efficient set joins on similarity predicates 2004 SIGMOD 0.00030661988
251 Robust and Fast Similarity Search for Moving Object Trajectories 2005 SIGMOD 0.00030644658
266 Efficient Exact Set-Similarity Joins 2006 VLDB 0.00029718727
358 On The Marriage of Lp-norms and Edit Distance 2004 VLDB 0.0002599481
400 Multi-Probe LSH: Efficient Indexing for High-Dimensional Similarity Search 2007 VLDB 0.0002427237
547 An Efficient Algorithm for Mining Association Rules in Large Databases 1995 VLDB 0.00020420717
572 Substructure Similarity Search in Graph Databases 2005 SIGMOD 0.00019887011
605 Locality-Sensitive Hashing Scheme Based on Dynamic Collision Counting 2012 SIGMOD 0.000193396
867 SRS: Solving c-Approximate Nearest Neighbor Queries in High Dimensional Euclidean Space with a Tiny Index 2015 VLDB 0.00015792021
931 The Pyramid-Technique: Towards Breaking the Curse of Dimensionality 1998 SIGMOD 0.00015238406
951 Comparing Stars: On Approximating Graph Edit Distance 2009 VLDB 0.00015106325
1,061 Warping Indexes with Envelope Transforms for Query by Humming 2003 SIGMOD 0.00014368716
1,161 Querying and Mining of Time Series Data: Experimental Comparison of Representations and Distance Measures 2008 VLDB 0.00013585236
1,202 VGRAM: Improving Performance of Approximate Queries on String Collections Using Variable-Length Grams 2007 VLDB 0.00013326298
1,234 Ed-Join: An Efficient Algorithm for Similarity Joins With Edit Distance Constraints 2008 VLDB 0.00013122499
1,305 Bayesian Locality Sensitive Hashing for Fast Similarity Search 2012 VLDB 0.00012687101
1,396 Can We Beat the Prefix Filtering? An Adaptive Framework for Similarity Join and Search 2012 SIGMOD 0.00012204748
1,776 Distributed Trajectory Similarity Search 2017 VLDB 0.00010593716
2,024 ATLAS: A Probabilistic Algorithm for High Dimensional Similarity Search 2011 SIGMOD 9.7519678e-05
2,192 DITA: Distributed In-Memory Trajectory Analytics 2018 SIGMOD 9.3185895e-05
2,193 Cost-Based Variable-Length-Gram Selection for String Collections to Support Approximate Queries Efficiently 2008 SIGMOD 9.3178557e-05
2,376 Bed-Tree: An All-Purpose Index Structure for String Similarity Search Based on Edit Distance 2010 SIGMOD 8.9424361e-05
2,497 OASIS: An Online and Accurate Technique for Local-alignment Searches on Biological Sequences 2003 VLDB 8.6472036e-05
2,525 Connected Substructure Similarity Search 2010 SIGMOD 8.5981082e-05
2,592 Pass-Join: A Partition-based Method for Similarity Joins 2012 VLDB 8.4795761e-05
2,740 String Similarity Joins: An Experimental Evaluation 2014 VLDB 8.1980628e-05
3,459 An Empirical Evaluation of Set Similarity Join Techniques 2016 VLDB 7.072508e-05
3,490 Leveraging Set Relations in Exact Set Similarity Join 2017 VLDB 7.0465856e-05
3,514 Spatio-Textual Similarity Joins 2013 VLDB 7.0226998e-05
3,518 FTW: Fast Similarity Search under the Time Warping Distance 2005 PODS 7.0153323e-05
3,578 Efficient Approximate Entity Extraction with Edit Distance Constraints 2009 SIGMOD 6.9503858e-05
3,774 Efficient Exact Edit Similarity Query Processing with the Asymmetric Signature Scheme 2011 SIGMOD 6.7757301e-05
4,050 An Efficient Partition Based Method for Exact Set Similarity Joins 2016 VLDB 6.4953612e-05
4,250 Local Similarity Search for Unstructured Text 2016 SIGMOD 6.3241139e-05
4,353 Overlap Set Similarity Joins with Theoretical Guarantees 2018 SIGMOD 6.263585e-05
4,373 Efficient and Effective Similarity Search over Probabilistic Data based on Earth Mover's Distance 2010 VLDB 6.2443809e-05
5,179 SilkMoth: An Efficient Method for Finding Related Sets with Maximum Matching Constraints 2017 VLDB 5.6428428e-05
5,326 Earth Mover's Distance based Similarity Search at Scale 2014 VLDB 5.5680074e-05
5,812 Reference-Based Alignment in Large Sequence Databases 2009 VLDB 5.3172025e-05
6,595 Trajectory Similarity Join in Spatial Networks 2017 VLDB 4.9993852e-05
6,726 A Pivotal Prefix Based Filtering Algorithm for String Similarity Search 2014 SIGMOD 4.9484027e-05
6,786 Interactive Time Series Exploration Powered by the Marriage of Similarity Distances 2017 VLDB 4.9257516e-05
7,005 Indexing the Edges – A simple and yet efficient approach to high-dimensional indexing 2000 PODS 4.8654221e-05
7,210 Set-based Similarity Search for Time Series 2016 SIGMOD 4.799457e-05
7,791 Similarity Search on Bregman Divergence: Towards Non-Metric Indexing 2009 VLDB 4.6502309e-05
8,706 ALAE: Accelerating Local Alignment with Affine Gap Exactly in Biosequence Databases 2012 VLDB 4.4642586e-05
Previous Page 1 / 2 Next

Semantically Similar Papers