EasyDR: A Human-in-the-loop Error Detection and Repair Platform for Holistic Table Cleaning
Summary: Holistic cleaning with human-in-the-loop: EasyDR detects errors and repairs via ML-guided crowdsourcing. Generates domain-aware summaries and difficulty-aware ordering; provides a declarative language to tailor targets, scope, and ML-crowd cooperation. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Yihai Xi
- 2. Ning Wang
- 3. Xinyu Chen
- 4. Yiyi Zhang
- 5. Zilong Wang
- 6. Zhihong Xu
- 7. Yue Wang
Incoming Citations (Sorted by Pagerank)
Showing 1 of 1 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 10,723 | UniClean: A Scalable Data Cleaning Solution for Mixed Errors based on Unified Cleaners and Optimized Cleaning Workflow | 2025 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 6 of 6 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 192 | HoloClean: Holistic Data Repairs with Probabilistic Inference | 2017 | VLDB | 0.00035728858 |
| 643 | Corleone: Hands-Off Crowdsourcing for Entity Matching | 2014 | SIGMOD | 0.00018754451 |
| 1,546 | KATARA: A Data Cleaning System Powered by Knowledge Bases and Crowdsourcing | 2015 | SIGMOD | 0.00011446851 |
| 3,067 | CrowdFill: Collecting Structured Data from the Crowd | 2014 | SIGMOD | 7.6180371e-05 |
| 7,237 | CleanM: An Optimizable Query Language for Unified Scale-Out Data Cleaning | 2017 | VLDB | 4.7928651e-05 |
| 7,575 | Human-in-the-loop Outlier Detection | 2020 | SIGMOD | 4.7068909e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 6,817 | Error Diagnosis and Data Profiling with Data X-Ray | 2015 | VLDB | 4.9171711e-05 |
| 5,729 | KATARA: Reliable Data Cleaning with Knowledge Bases and Crowdsourcing | 2015 | VLDB | 5.3506368e-05 |
| 11,111 | Rock: Cleaning Data with both ML and Logic Rules | 2024 | VLDB | 4.1945683e-05 |
| 9,278 | Interactive and Deterministic Data Cleaning: A Tossed Stone Raises a Thousand Ripples | 2016 | SIGMOD | 4.3639892e-05 |
| 263 | CrowdER: Crowdsourcing Entity Resolution | 2012 | VLDB | 0.00029862413 |
| 833 | Guided Data Repair | 2011 | VLDB | 0.00016138432 |
| 10,512 | Auto-Test: Learning Semantic-Domain Constraints for Unsupervised Error Detection in Tables | 2025 | SIGMOD | 4.1945683e-05 |
| 9,577 | CoClean: Collaborative Data Cleaning | 2020 | SIGMOD | 4.3248438e-05 |
| 3,713 | GDR: A System for Guided Data Repair | 2010 | SIGMOD | 6.8224341e-05 |
| 10,821 | Demonstrating Matelda for Multi-Table Error Detection | 2025 | VLDB | 4.1945683e-05 |