Large-Scale Collective Entity Matching
Summary: Proposes a principled, neighborhood-based framework to scale any generic Entity Matching (EM) algorithm by running multiple EM instances on small data neighborhoods and exchanging messages to converge on a global solution. It provides formal properties and empirical validation for scalable EM on large real-world datasets. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Vibhor Rastogi
- 2. Nilesh Dalvi
- 3. Minos Garofalakis
Incoming Citations (Sorted by Pagerank)
Showing 11 of 11 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 814 | Entity Resolution: Theory, Practice & Open Challenges | 2012 | VLDB | 0.00016370594 |
| 3,694 | Keys for Graphs | 2015 | VLDB | 6.8345712e-05 |
| 3,711 | Saga: A Platform for Continuous Construction and Serving of Knowledge At Scale | 2022 | SIGMOD | 6.823609e-05 |
| 4,126 | Waldo: An Adaptive Human Interface for Crowd Entity Resolution | 2017 | SIGMOD | 6.4314729e-05 |
| 6,690 | Parallel Discrepancy Detection and Incremental Detection | 2021 | VLDB | 4.9621556e-05 |
| 7,668 | Human-in-the-loop Data Integration | 2017 | VLDB | 4.6834075e-05 |
| 8,422 | Deducing Certain Fixes to Graphs | 2019 | VLDB | 4.5167705e-05 |
| 8,436 | A Critical Re-evaluation of Neural Methods for Entity Alignment | 2022 | VLDB | 4.5138915e-05 |
| 9,846 | HyperBlocker: Accelerating Rule-based Blocking in Entity Resolution using GPUs | 2025 | VLDB | 4.2721228e-05 |
| 11,930 | ConfSeer: Leveraging Customer Support Knowledge Bases for Automated Misconfiguration Detection | 2015 | VLDB | 4.1945683e-05 |
| 12,044 | Knowledge Harvesting in the Big-Data Era | 2013 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 4 of 4 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 229 | Reference Reconciliation in Complex Information Spaces | 2005 | SIGMOD | 0.00032242633 |
| 280 | Eliminating Fuzzy Duplicates in Data Warehouses | 2002 | VLDB | 0.00029113044 |
| 684 | Towards a Robust Query Optimizer: A Principled and Practical Approach | 2005 | SIGMOD | 0.00018179769 |
| 1,410 | Entity Resolution with Iterative Blocking | 2009 | SIGMOD | 0.00012127555 |
Previous
Page 1 / 1
Next