Record Linkage with Uniqueness Constraints and Erroneous Values
Summary: Models linkage with uniqueness constraints as a k-partite graph clusterer, combining value similarity and source cooccurrence for linkage. Tolerates limited violations, separates incorrect values from true ones, with scalable, validated accuracy. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Songtao Guo
- 2. Xin Luna Dong
- 3. Divesh Srivastava
- 4. Remi Zajac
Incoming Citations (Sorted by Pagerank)
Showing 6 of 6 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,823 | Interaction between Record Matching and Data Repairing | 2011 | SIGMOD | 8.0593894e-05 |
| 4,383 | Incremental Record Linkage | 2014 | VLDB | 6.2383094e-05 |
| 6,690 | Parallel Discrepancy Detection and Incremental Detection | 2021 | VLDB | 4.9621556e-05 |
| 9,487 | Making It Tractable to Catch Duplicates and Conflicts in Graphs | 2023 | SIGMOD | 4.3341665e-05 |
| 11,054 | Enriching Relations with Additional Attributes for ER | 2024 | VLDB | 4.1945683e-05 |
| 11,223 | Splitting Tuples of Mismatched Entities | 2023 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 9 of 9 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 229 | Reference Reconciliation in Complex Information Spaces | 2005 | SIGMOD | 0.00032242633 |
| 265 | A Cost-Based Model and Effective Heuristic for Repairing Constraints by Value Modification | 2005 | SIGMOD | 0.00029763412 |
| 322 | Record Linkage: Similarity Measures and Algorithms | 2006 | SIGMOD | 0.00027518768 |
| 560 | Dependencies Revisited for Improving Data Quality | 2008 | PODS | 0.00020141923 |
| 702 | Reasoning about Record Matching Rules | 2009 | VLDB | 0.00017918203 |
| 855 | Integrating Conflicting Data: The Role of Source Dependence | 2009 | VLDB | 0.00015906735 |
| 2,386 | Leveraging Aggregate Constraints For Deduplication | 2007 | SIGMOD | 8.9231895e-05 |
| 2,452 | Data Fusion – Resolving Data Conflicts for Integration | 2009 | VLDB | 8.7839322e-05 |
| 7,653 | CORDS: Automatic Generation of Correlation Statistics in DB2 | 2004 | VLDB | 4.6875371e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 8,549 | LinkDB: A Probabilistic Linkage Database System | 2011 | SIGMOD | 4.4937074e-05 |
| 3,360 | Modeling and Querying Possible Repairs in Duplicate Detection | 2009 | VLDB | 7.1742067e-05 |
| 67 | The Merge/Purge Problem for Large Databases | 1995 | SIGMOD | 0.00061348205 |
| 7,345 | Linking Temporal Records for Profiling Entities | 2015 | SIGMOD | 4.756212e-05 |
| 3,631 | On-the-Fly Entity-Aware Query Processing in the Presence of Linkage | 2010 | VLDB | 6.9014378e-05 |
| 2,405 | Linking Temporal Records | 2011 | VLDB | 8.8815018e-05 |
| 11,223 | Splitting Tuples of Mismatched Entities | 2023 | SIGMOD | 4.1945683e-05 |
| 3,130 | Behavior Based Record Linkage | 2010 | VLDB | 7.4993061e-05 |
| 4,383 | Incremental Record Linkage | 2014 | VLDB | 6.2383094e-05 |
| 322 | Record Linkage: Similarity Measures and Algorithms | 2006 | SIGMOD | 0.00027518768 |