Database Paper Browser

Back to papers

On-the-Fly Token Similarity Joins in Relational Databases

Summary: Introduces tokenize, a relational operator that generates tokens and embeds them in the plan, enabling optimization without precomputed tokens. Key ideas: algebraic rules, cardinality estimates, and replication-free handling of nested tokenize in PostgreSQL. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
4901
Venue
SIGMOD
Year
2014
Pagerank
4.3423824e-05
Overall Rank
9,439 | 34.34%
DOI
10.1145/2588555.2610530

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 2 of 2 citing papers.

Rank Citing Paper Year Venue Pagerank
4,402 Smurf: Self-Service String Matching Using Random Forests 2019 VLDB 6.2195162e-05
4,775 Set Similarity Joins on MapReduce: An Experimental Survey 2018 VLDB 5.9315784e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 11 of 11 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers