Back to papers
Potter's Wheel: An Interactive Data Cleaning System
Summary: Potter's Wheel is an interactive data-cleaning system that tightly couples transformation with discrepancy detection, enabling progressive fixes in a spreadsheet-like interface. Transforms are defined graphically or by example, with immediate visual feedback; the system infers value structures and checks constraints in the background.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 8762
- Venue
- VLDB
- Year
- 2001
- Pagerank
- 0.00047045036
- Overall Rank
- 112 | 99.23%
- DOI
-
-
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 16 of 66 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
| 7,777 |
Indexing Mixed Types for Approximate Retrieval |
2005 |
VLDB |
4.653704e-05 |
| 7,812 |
Foofah: A Programming-By-Example System for Synthesizing Data Transformation Programs |
2017 |
SIGMOD |
4.6443197e-05 |
| 7,838 |
Auto-Validate: Unsupervised Data Validation Using Data-Domain Patterns Inferred from Data Lakes |
2021 |
SIGMOD |
4.6377995e-05 |
| 8,092 |
Saga: A Scalable Framework for Optimizing Data Cleaning Pipelines for Machine Learning Applications |
2023 |
SIGMOD |
4.587921e-05 |
| 8,579 |
RECA: Related Tables Enhanced Column Semantic Type Annotation Framework |
2023 |
VLDB |
4.4922446e-05 |
| 9,049 |
JENNER: Just-in-time Enrichment in Query Processing |
2022 |
VLDB |
4.4039656e-05 |
| 9,278 |
Interactive and Deterministic Data Cleaning: A Tossed Stone Raises a Thousand Ripples |
2016 |
SIGMOD |
4.3639892e-05 |
| 9,301 |
Repairing Data through Regular Expressions |
2016 |
VLDB |
4.3587281e-05 |
| 9,389 |
DataVinci: Learning Syntactic and Semantic String Repairs |
2025 |
SIGMOD |
4.3441378e-05 |
| 9,984 |
Towards Scalable Visual Data Wrangling via Direct Manipulation |
2026 |
CIDR |
4.1945683e-05 |
| 10,216 |
The Case For Language Model Approximated LIKE Predicate |
2026 |
SIGMOD |
4.1945683e-05 |
| 10,610 |
Weak-to-Strong Prompts with Lightweight-to-Powerful LLMs for High-Accuracy, Low-Cost, and Explainable Data Transformation |
2025 |
VLDB |
4.1945683e-05 |
| 11,691 |
Enabling Data Science for the Majority |
2019 |
VLDB |
4.1945683e-05 |
| 12,202 |
Microsoft Codename "Montego" 6 Data Import, Transformation, and Publication for Information Workers |
2011 |
VLDB |
4.1945683e-05 |
| 12,425 |
XClean in Action: A Demonstration of Declarative XML Data Cleaning |
2007 |
CIDR |
4.1945683e-05 |
| 12,524 |
Quality Views: Capturing and Exploiting the User Perspective on Data Quality |
2006 |
VLDB |
4.1945683e-05 |
Outgoing Citations (Sorted by Pagerank)
Showing 8 of 8 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 1,546 |
KATARA: A Data Cleaning System Powered by Knowledge Bases and Crowdsourcing |
2015 |
SIGMOD |
0.00011446851 |
| 2,097 |
Predictive Interaction for Data Transformation |
2015 |
CIDR |
9.5489822e-05 |
| 7,926 |
CoCo: Interactive Exploration of Conformance Constraints for Data Understanding and Data Cleaning |
2021 |
SIGMOD |
4.6144554e-05 |
| 6,384 |
A Demonstration of DBWipes: Clean as You Query |
2012 |
VLDB |
5.0880333e-05 |
| 10,821 |
Demonstrating Matelda for Multi-Table Error Detection |
2025 |
VLDB |
4.1945683e-05 |
| 5,929 |
ActiveClean: An Interactive Data Cleaning Framework For Modern Machine Learning |
2016 |
SIGMOD |
5.2682177e-05 |
| 199 |
Declarative Data Cleaning: Language, Model, and Algorithms |
2001 |
VLDB |
0.00035041015 |
| 9,278 |
Interactive and Deterministic Data Cleaning: A Tossed Stone Raises a Thousand Ripples |
2016 |
SIGMOD |
4.3639892e-05 |
| 9,221 |
VisClean: Interactive Cleaning for Progressive Visualization |
2020 |
VLDB |
4.3699444e-05 |
| 7,564 |
PIClean: A Probabilistic and Interactive Data Cleaning System |
2019 |
SIGMOD |
4.7093702e-05 |