Database Paper Browser

Back to papers

Explore or Exploit? Effective Strategies for Disambiguating Large Databases

Summary: Disambiguation in large databases under limited cleaning budget, with uncertain candidate quality and success. The Explore-Exploit (EE) algorithm learns from ongoing cleaning to allocate budget, beating greedy baselines; robust to unknown cleaning probabilities, validated on real and synthetic data. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
10128
Venue
VLDB
Year
2010
Pagerank
4.9672601e-05
Overall Rank
6,670 | 53.60%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 3 of 3 citing papers.

Rank Citing Paper Year Venue Pagerank
2,722 Progressive Approach to Relational Entity Resolution 2014 VLDB 8.2338356e-05
7,185 Certus: An Effective Entity Resolution Approach with Graph Differential Dependencies (GDDs) 2019 VLDB 4.8066159e-05
9,043 Query-Guided Resolution in Uncertain Databases 2023 SIGMOD 4.4039656e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 9 of 9 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers