A Randomized Blocking Structure for Streaming Record Linkage
Summary: ExpBlock: a randomized blocking structure for streaming record linkage that probabilistically retains frequently accessed and recently used blocks in memory while letting inactive/old blocks and records decay to prioritize fresh, promising candidates. Implements lightweight renewal via random choices (no costly sorting), enabling simple, efficient in-memory maintenance and empirically scalable, timely, accurate linkage on high-rate streams. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Authors
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|
Outgoing Citations (Sorted by Pagerank)
Showing 6 of 6 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3,631 | On-the-Fly Entity-Aware Query Processing in the Presence of Linkage | 2010 | VLDB | 6.9014378e-05 |
| 4,104 | Online Entity Resolution Using an Oracle | 2016 | VLDB | 6.4493809e-05 |
| 4,383 | Incremental Record Linkage | 2014 | VLDB | 6.2383094e-05 |
| 4,974 | Supervised Meta-blocking | 2014 | VLDB | 5.7903293e-05 |
| 6,175 | Query-Driven Approach to Entity Resolution | 2013 | VLDB | 5.169496e-05 |
| 8,005 | Online Topic-Aware Entity Resolution Over Incomplete Data Streams | 2021 | SIGMOD | 4.6081461e-05 |
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 10,324 | Towards Efficient Random-Order Enumeration for Join Queries | 2026 | VLDB | 4.1945683e-05 |
| 1,410 | Entity Resolution with Iterative Blocking | 2009 | SIGMOD | 0.00012127555 |
| 1,064 | Processing Complex Aggregate Queries over Data Streams | 2002 | SIGMOD | 0.00014356481 |
| 8,015 | Streaming Quotient Filter: A Near Optimal Approximate Duplicate Detection Approach for Data Streams | 2013 | VLDB | 4.6051162e-05 |
| 3,665 | Ad-hoc Top-k Query Answering for Data Streams | 2007 | VLDB | 6.8633354e-05 |
| 4,966 | Relative Error Streaming Quantiles | 2021 | PODS | 5.7959749e-05 |
| 4,383 | Incremental Record Linkage | 2014 | VLDB | 6.2383094e-05 |
| 11,833 | Streaming Algorithms for Robust Distinct Elements | 2016 | SIGMOD | 4.1945683e-05 |
| 2,514 | Comparative Analysis of Approximate Blocking Techniques for Entity Resolution | 2016 | VLDB | 8.6139012e-05 |
| 4,905 | Randomized Error Removal for Online Spread Estimation in Data Streaming | 2021 | VLDB | 5.8398332e-05 |