Back to papers
Interactive and Deterministic Data Cleaning: A Tossed Stone Raises a Thousand Ripples
Summary: Falcon is an interactive, deterministic, declarative data-cleaning system that repairs data with SQL UPDATEs, avoiding predefined quality rules. From one user update, it searches a lattice of repairs to yield a minimal, high-coverage set, via multi-hop search and user-guided validation.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 5269
- Venue
- SIGMOD
- Year
- 2016
- Pagerank
- 4.3639892e-05
- Overall Rank
- 9,278 | 35.46%
- DOI
-
10.1145/2882903.2915242
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 1 of 1 citing papers.
Outgoing Citations (Sorted by Pagerank)
Showing 26 of 26 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 112 |
Potter's Wheel: An Interactive Data Cleaning System |
2001 |
VLDB |
0.00047045036 |
| 199 |
Declarative Data Cleaning: Language, Model, and Algorithms |
2001 |
VLDB |
0.00035041015 |
| 224 |
CORDS: Automatic Discovery of Correlations and Soft Functional Dependencies |
2004 |
SIGMOD |
0.00032746205 |
| 265 |
A Cost-Based Model and Effective Heuristic for Repairing Constraints by Value Modification |
2005 |
SIGMOD |
0.00029763412 |
| 555 |
Discovering Denial Constraints |
2013 |
VLDB |
0.00020254908 |
| 732 |
Discovering Data Quality Rules |
2008 |
VLDB |
0.00017465093 |
| 833 |
Guided Data Repair |
2011 |
VLDB |
0.00016138432 |
| 881 |
Don’t be SCAREd: Use SCalable Automatic REpairing with Maximal Likelihood and Bounded Changes |
2013 |
SIGMOD |
0.00015661103 |
| 1,012 |
NADEEF: A Commodity Data Cleaning System |
2013 |
SIGMOD |
0.0001464733 |
| 1,159 |
Towards Certain Fixes with Editing Rules and Master Data |
2010 |
VLDB |
0.00013592813 |
| 1,197 |
The LLUNATIC Data-Cleaning Framework |
2013 |
VLDB |
0.00013390321 |
| 1,509 |
Discovering Queries based on Example Tuples |
2014 |
SIGMOD |
0.00011612727 |
| 1,572 |
Reverse Engineering Complex Join Queries |
2013 |
SIGMOD |
0.00011298251 |
| 2,078 |
Sample-Driven Schema Mapping |
2012 |
SIGMOD |
9.599707e-05 |
| 2,097 |
Predictive Interaction for Data Transformation |
2015 |
CIDR |
9.5489822e-05 |
| 2,638 |
Messing Up with BART: Error Generation for Evaluating Data-Cleaning Algorithms |
2016 |
VLDB |
8.399764e-05 |
| 2,823 |
Interaction between Record Matching and Data Repairing |
2011 |
SIGMOD |
8.0593894e-05 |
| 2,847 |
Building, Maintaining, and Using Knowledge Bases: A Report from the Trenches |
2013 |
SIGMOD |
8.0224023e-05 |
| 2,946 |
BigDansing: A System for Big Data Cleansing |
2015 |
SIGMOD |
7.8372441e-05 |
| 3,192 |
Towards Dependable Data Repairing with Fixing Rules |
2014 |
SIGMOD |
7.4095761e-05 |
| 4,422 |
Interactive Join Query Inference with JIM |
2014 |
VLDB |
6.2008389e-05 |
| 4,599 |
Playful Query Specification with DataPlay |
2012 |
VLDB |
6.0583418e-05 |
| 4,682 |
Scalable Discovery of Unique Column Combinations |
2014 |
VLDB |
6.0022412e-05 |
| 5,032 |
Actively Soliciting Feedback for Query Answers in Keyword Search-Based Data Integration |
2013 |
VLDB |
5.748807e-05 |
| 5,382 |
That's All Folks! LLUNATIC Goes Open Source |
2014 |
VLDB |
5.5397633e-05 |
| 6,350 |
NADEEF: A Generalized Data Cleaning System |
2013 |
VLDB |
5.101815e-05 |
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 5,445 |
QFix: Diagnosing Errors through Query Histories |
2017 |
SIGMOD |
5.5020909e-05 |
| 1,197 |
The LLUNATIC Data-Cleaning Framework |
2013 |
VLDB |
0.00013390321 |
| 11,837 |
QFix: Demonstrating Error Diagnosis in Query Histories |
2016 |
SIGMOD |
4.1945683e-05 |
| 2,823 |
Interaction between Record Matching and Data Repairing |
2011 |
SIGMOD |
8.0593894e-05 |
| 1,624 |
Sampling the Repairs of Functional Dependency Violations under Hard Constraints |
2010 |
VLDB |
0.00011099222 |
| 2,460 |
Combining Quantitative and Logical Data Cleaning |
2016 |
VLDB |
8.7617484e-05 |
| 5,660 |
Descriptive and Prescriptive Data Cleaning |
2014 |
SIGMOD |
5.3847321e-05 |
| 199 |
Declarative Data Cleaning: Language, Model, and Algorithms |
2001 |
VLDB |
0.00035041015 |
| 623 |
Improving Data Quality: Consistency and Accuracy |
2007 |
VLDB |
0.00018996374 |
| 3,192 |
Towards Dependable Data Repairing with Fixing Rules |
2014 |
SIGMOD |
7.4095761e-05 |