Database Paper Browser

Back to papers

In-context Clustering-based Entity Resolution with Large Language Models: A Design Space Exploration

Summary: In-context clustering: prompt LLMs to cluster record sets directly (vs pairwise), reducing API calls/time and improving scalability. LLM-CER maps the design space (set size, diversity, ordering), adds cluster-merging and hallucination mitigation, yielding up to 150% accuracy gains and 5× fewer API calls. (summarized by gpt-5-mini on Feb 11 2026)

Paper ID
7327
Venue
SIGMOD
Year
2026
Pagerank
4.1945683e-05
Overall Rank
10,022 | 30.28%
DOI
10.1145/3749170

Incoming Non-self Citations Over Time

No non-self incoming citations found for this paper in this database.

Authors

Incoming Citations (Sorted by Pagerank)

Showing 0 of 0 citing papers.

Rank Citing Paper Year Venue Pagerank
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 20 of 20 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
34 Similarity Search in High Dimensions via Hashing 1999 VLDB 0.00076637636
263 CrowdER: Crowdsourcing Entity Resolution 2012 VLDB 0.00029862413
300 Deep Learning for Entity Matching: A Design Space Exploration 2018 SIGMOD 0.00028441466
517 Can Foundation Models Wrangle Your Data? 2023 VLDB 0.00021169035
643 Corleone: Hands-Off Crowdsourcing for Entity Matching 2014 SIGMOD 0.00018754451
692 Pay-as-you-go User Feedback for Dataspace Systems 2008 SIGMOD 0.00018083948
712 Magellan: Toward Building Entity Matching Management Systems 2016 VLDB 0.00017732426
754 Distributed Representations of Tuples for Entity Resolution 2018 VLDB 0.00017117211
814 Entity Resolution: Theory, Practice & Open Challenges 2012 VLDB 0.00016370594
866 Leveraging Transitive Relations for Crowdsourced Joins 2013 SIGMOD 0.00015801196
936 Framework for Evaluating Clustering Algorithms in Duplicate Detection 2009 VLDB 0.0001521549
1,242 Question Selection for Crowd Entity Resolution 2013 VLDB 0.00013096655
1,841 Crowdsourcing Algorithms for Entity Resolution 2014 VLDB 0.00010348858
2,175 Falcon: Scaling Up Hands-Off Crowdsourced Entity Matching to Build Cloud Services 2017 SIGMOD 9.3644117e-05
2,514 Comparative Analysis of Approximate Blocking Techniques for Entity Resolution 2016 VLDB 8.6139012e-05
2,767 A Comprehensive Benchmark Framework for Active Learning Methods in Entity Matching 2020 SIGMOD 8.1513883e-05
3,640 Deep Learning for Blocking in Entity Matching: A Design Space Exploration 2021 VLDB 6.8891671e-05
4,619 Crowd-Based Deduplication: An Adaptive Approach 2015 SIGMOD 6.0444854e-05
5,362 Cost-Effective Crowdsourced Entity Resolution: A Partial-Order Approach 2016 SIGMOD 5.5473503e-05
5,533 Dual-Objective Fine-Tuning of BERT for Entity Matching 2021 VLDB 5.4544359e-05
Previous Page 1 / 1 Next

Semantically Similar Papers