Database Paper Browser

Back to papers

Online Topic-Aware Entity Resolution Over Incomplete Data Streams

Summary: Proposes TER-iDS: online, topic-aware ER over incomplete data streams; imputes missing attributes to link identical entities. Introduces online imputation, pruning, compact indexes/synopsis; efficient TER-iDS index-join algorithm; validated on real data. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
6129
Venue
SIGMOD
Year
2021
Pagerank
4.6081461e-05
Overall Rank
8,005 | 44.32%
DOI
10.1145/3448016.3457238

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 3 of 3 citing papers.

Rank Citing Paper Year Venue Pagerank
9,856 In-Database Data Imputation 2024 SIGMOD 4.269353e-05
11,223 Splitting Tuples of Mismatched Entities 2023 SIGMOD 4.1945683e-05
11,249 A Randomized Blocking Structure for Streaming Record Linkage 2023 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 19 of 19 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
319 Evaluation of entity resolution approaches on real-world match problems 2010 VLDB 0.00027781866
647 Progressive Approximate Aggregate Queries with a Multi-Resolution Tree Structure 2001 SIGMOD 0.00018668224
656 ERACER: A Database Approach for Statistical Inference and Data Cleaning 2010 SIGMOD 0.00018588729
754 Distributed Representations of Tuples for Entity Resolution 2018 VLDB 0.00017117211
1,159 Towards Certain Fixes with Editing Rules and Master Data 2010 VLDB 0.00013592813
1,717 Approximate Join Processing Over Data Streams 2003 SIGMOD 0.00010793312
2,175 Falcon: Scaling Up Hands-Off Crowdsourced Entity Matching to Build Cloud Services 2017 SIGMOD 9.3644117e-05
2,452 Data Fusion – Resolving Data Conflicts for Integration 2009 VLDB 8.7839322e-05
2,460 Combining Quantitative and Logical Data Cleaning 2016 VLDB 8.7617484e-05
3,018 Approximate NN Queries on Streams with Guaranteed Error/performance Bounds 2004 VLDB 7.7002798e-05
3,133 Time Series Data Cleaning: From Anomaly Detection to Anomaly Repairing 2017 VLDB 7.4978041e-05
4,104 Online Entity Resolution Using an Oracle 2016 VLDB 6.4493809e-05
5,002 Sequential Data Cleaning: A Statistical Approach 2016 SIGMOD 5.7671075e-05
5,081 Reducing Uncertainty of Schema Matching via Crowdsourcing 2013 VLDB 5.7132042e-05
5,253 Enriching Data Imputation with Extensive Similarity Neighbors 2015 VLDB 5.6014916e-05
5,852 Repairing Vertex Labels under Neighborhood Constraints 2014 VLDB 5.3007132e-05
6,583 SCREEN: Stream Data Cleaning under Speed Constraints 2015 SIGMOD 5.0027988e-05
7,345 Linking Temporal Records for Profiling Entities 2015 SIGMOD 4.756212e-05
8,944 A Probabilistic Model for Linking Named Entities in Web Text with Heterogeneous Information Networks 2014 SIGMOD 4.4258255e-05
Previous Page 1 / 1 Next

Semantically Similar Papers