Database Paper Browser

Back to papers

Allign: Aligning All-Pair Near-Duplicate Passages in Long Texts

Summary: Allign uses a min-hash based method to align all-pair near-duplicate passages in two texts, avoiding O(n^2 m^2) enumeration via compact windows. It matches windows by shared min-hash, reports the longest and sentence-level near-duplicates, and outperforms prior alignment methods on real data. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
6237
Venue
SIGMOD
Year
2021
Pagerank
4.6908858e-05
Overall Rank
7,635 | 46.89%
DOI
10.1145/3448016.3457548

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 6 of 6 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 14 of 14 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers