CoClean: Collaborative Data Cleaning
Summary: CoClean enables crowd-in-the-loop data cleaning for Pandas via Collaborative dataframe (CDF), aggregating annotations from multiple users. Data assignment lay GUI cleaning, power-user Jupyter scripting, repair hints, supports blind-on/off collaboration. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Mashaal Musleh
- 2. Mourad Ouzzani
- 3. Nan Tang
- 4. AnHai Doan
Incoming Citations (Sorted by Pagerank)
Showing 3 of 3 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 9,434 | Rock: Cleaning Data by Embedding ML in Logic Rules | 2024 | SIGMOD | 4.3430376e-05 |
| 9,856 | In-Database Data Imputation | 2024 | SIGMOD | 4.269353e-05 |
| 10,610 | Weak-to-Strong Prompts with Lightweight-to-Powerful LLMs for High-Accuracy, Low-Cost, and Explainable Data Transformation | 2025 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 3 of 3 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 192 | HoloClean: Holistic Data Repairs with Probabilistic Inference | 2017 | VLDB | 0.00035728858 |
| 1,612 | Detecting Data Errors: Where are we and what needs to be done? | 2016 | VLDB | 0.00011142794 |
| 2,968 | Raha: A Configuration-Free Error Detection System | 2019 | SIGMOD | 7.7985097e-05 |
Previous
Page 1 / 1
Next