Database Paper Browser

Back to papers

TxtAlign: Efficient Near-Duplicate Text Alignment Search via Bottom-k Sketches for Plagiarism Detection

Summary: TxtAlign uses bottom-k sketches to estimate passage similarity, grouping O(n^2) passages into O(nk) sketch-based groups. Enables corpus-scale source retrieval: near-duplicate passage pairs found via cross-group sketches; grouping in O(n log n + nk). (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
6487
Venue
SIGMOD
Year
2022
Pagerank
4.5435639e-05
Overall Rank
8,291 | 42.33%
DOI
10.1145/3514221.3526178

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 4 of 4 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 13 of 13 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers