Human-in-the-loop Data Integration
Summary: Hybrid human–machine data integration for entity matching uses learned rules and DIMA to propose candidate matches. A crowd-driven selection-inference-refine workflow verifies candidates with transitivity-based inference via a SQL-like CDB on platforms. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Guoliang Li
Incoming Citations (Sorted by Pagerank)
Showing 9 of 9 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 782 | QTune: A Query-Aware Database Tuning System with Deep Reinforcement Learning | 2019 | VLDB | 0.00016729063 |
| 2,730 | Open Data Integration | 2018 | VLDB | 8.2126735e-05 |
| 3,473 | AI Meets Database: AI4DB and DB4AI | 2021 | SIGMOD | 7.062864e-05 |
| 6,868 | Cost-Effective Data Annotation using Game-Based Crowdsourcing | 2019 | VLDB | 4.9010083e-05 |
| 8,008 | Entity Resolution On-Demand | 2022 | VLDB | 4.6067684e-05 |
| 9,896 | Towards Interpretable and Learnable Risk Analysis for Entity Resolution | 2020 | SIGMOD | 4.2600049e-05 |
| 10,216 | The Case For Language Model Approximated LIKE Predicate | 2026 | SIGMOD | 4.1945683e-05 |
| 10,617 | Deduplicated Sampling On-Demand | 2025 | VLDB | 4.1945683e-05 |
| 11,707 | A Rating-Ranking Method for Crowdsourced Top-k Computation | 2018 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 43 of 43 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 94 | CrowdDB: Answering Queries with Crowdsourcing | 2011 | SIGMOD | 0.00051013264 |
| 11,699 | (Artificial) Mind over Matter: Integrating Humans and Algorithms in Solving Matching Problems | 2018 | SIGMOD | 4.1945683e-05 |
| 866 | Leveraging Transitive Relations for Crowdsourced Joins | 2013 | SIGMOD | 0.00015801196 |
| 1,914 | Creating Embeddings of Heterogeneous Relational Datasets for Data Integration Tasks | 2020 | SIGMOD | 0.00010109102 |
| 672 | An Interactive Clustering-based Approach to Integrating Source Query Interfaces on the Deep Web | 2004 | SIGMOD | 0.00018355746 |
| 3,935 | CrowdQ: Crowdsourced Query Understanding | 2013 | CIDR | 6.6163464e-05 |
| 5,081 | Reducing Uncertainty of Schema Matching via Crowdsourcing | 2013 | VLDB | 5.7132042e-05 |
| 263 | CrowdER: Crowdsourcing Entity Resolution | 2012 | VLDB | 0.00029862413 |
| 1,841 | Crowdsourcing Algorithms for Entity Resolution | 2014 | VLDB | 0.00010348858 |
| 4,416 | CrowdMatcher: Crowd-Assisted Schema Matching | 2014 | SIGMOD | 6.2039225e-05 |