Back to papers
Cleaning Denial Constraint Violations through Relaxation
Summary: On-demand probabilistic repair of denial-constraint violations embedded in analytics. Daisy weaves cleaning operators into query plans, relaxing results over dirty data to adapt to workload, outperforming offline cleaning on synthetic and real data.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 5990
- Venue
- SIGMOD
- Year
- 2020
- Pagerank
- 6.3003864e-05
- Overall Rank
- 4,273 | 70.28%
- DOI
-
10.1145/3318464.3389775
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 17 of 17 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
| 3,396 |
Automatic Data Repair: Are We Ready to Deploy? |
2024 |
VLDB |
7.1455126e-05 |
| 6,477 |
Fast Algorithms for Denial Constraint Discovery |
2023 |
VLDB |
5.0488285e-05 |
| 7,667 |
Fast Detection of Denial Constraint Violations |
2022 |
VLDB |
4.683767e-05 |
| 8,472 |
Rapidash: Efficient Detection of Constraint Violations |
2024 |
VLDB |
4.5036378e-05 |
| 8,745 |
Sparcle: Boosting the Accuracy of Data Cleaning Systems through Spatial Awareness |
2024 |
VLDB |
4.456315e-05 |
| 8,836 |
Fast Approximate Denial Constraint Discovery |
2023 |
VLDB |
4.4393184e-05 |
| 9,049 |
JENNER: Just-in-time Enrichment in Query Processing |
2022 |
VLDB |
4.4039656e-05 |
| 9,240 |
ZIP: Lazy Imputation during Query Processing |
2024 |
VLDB |
4.3690661e-05 |
| 9,348 |
GIDCL: A Graph-Enhanced Interpretable Data Cleaning Framework with Large Language Models |
2024 |
SIGMOD |
4.3526427e-05 |
| 9,749 |
Efficient Differential Dependency Discovery |
2024 |
VLDB |
4.2897489e-05 |
| 9,849 |
Reptile: Aggregation-level Explanations for Hierarchical Data |
2022 |
SIGMOD |
4.2721228e-05 |
| 10,617 |
Deduplicated Sampling On-Demand |
2025 |
VLDB |
4.1945683e-05 |
| 10,679 |
How and Why False Denial Constraints are Discovered |
2025 |
VLDB |
4.1945683e-05 |
| 10,723 |
UniClean: A Scalable Data Cleaning Solution for Mixed Errors based on Unified Cleaners and Optimized Cleaning Workflow |
2025 |
VLDB |
4.1945683e-05 |
| 11,223 |
Splitting Tuples of Mismatched Entities |
2023 |
SIGMOD |
4.1945683e-05 |
| 11,507 |
TQEL: Framework for Query-Driven Linking of Top-K Entities in Social Media Blogs |
2021 |
VLDB |
4.1945683e-05 |
| 11,536 |
LOCATER: Cleaning WiFi Connectivity Datasets for Semantic Localization |
2021 |
VLDB |
4.1945683e-05 |
Outgoing Citations (Sorted by Pagerank)
Showing 17 of 17 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 49 |
Consistent Query Answers in Inconsistent Databases |
1999 |
PODS |
0.00067660624 |
| 192 |
HoloClean: Holistic Data Repairs with Probabilistic Inference |
2017 |
VLDB |
0.00035728858 |
| 791 |
ActiveClean: Interactive Data Cleaning For Statistical Modeling |
2016 |
VLDB |
0.00016629664 |
| 881 |
Don’t be SCAREd: Use SCalable Automatic REpairing with Maximal Likelihood and Bounded Changes |
2013 |
SIGMOD |
0.00015661103 |
| 1,012 |
NADEEF: A Commodity Data Cleaning System |
2013 |
SIGMOD |
0.0001464733 |
| 1,074 |
Processing Theta-Joins using MapReduce* |
2011 |
SIGMOD |
0.00014260096 |
| 1,197 |
The LLUNATIC Data-Cleaning Framework |
2013 |
VLDB |
0.00013390321 |
| 2,184 |
A Sample-and-Clean Framework for Fast and Accurate Query Processing on Dirty Data |
2014 |
SIGMOD |
9.3429789e-05 |
| 2,514 |
Comparative Analysis of Approximate Blocking Techniques for Entity Resolution |
2016 |
VLDB |
8.6139012e-05 |
| 2,573 |
Query Optimization for Dynamic Imputation |
2017 |
VLDB |
8.518235e-05 |
| 2,638 |
Messing Up with BART: Error Generation for Evaluating Data-Cleaning Algorithms |
2016 |
VLDB |
8.399764e-05 |
| 2,946 |
BigDansing: A System for Big Data Cleansing |
2015 |
SIGMOD |
7.8372441e-05 |
| 3,524 |
Efficient Querying of Inconsistent Databases with Binary Integer Programming |
2013 |
VLDB |
7.0087032e-05 |
| 5,586 |
QuERy: A Framework for Integrating Entity Resolution with Query Processing |
2016 |
VLDB |
5.4219548e-05 |
| 5,857 |
Making SQL Queries Correct on Incomplete Databases: A Feasibility Study |
2016 |
PODS |
5.3000054e-05 |
| 7,237 |
CleanM: An Optimizable Query Language for Unified Scale-Out Data Cleaning |
2017 |
VLDB |
4.7928651e-05 |
| 8,166 |
CAvSAT: A System for Query Answering over Inconsistent Databases |
2019 |
SIGMOD |
4.5712945e-05 |
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 9,278 |
Interactive and Deterministic Data Cleaning: A Tossed Stone Raises a Thousand Ripples |
2016 |
SIGMOD |
4.3639892e-05 |
| 7,867 |
Learning Over Dirty Data Without Cleaning |
2020 |
SIGMOD |
4.6320452e-05 |
| 9,478 |
Incremental Detection of Denial Constraint Violations |
2025 |
VLDB |
4.3341665e-05 |
| 1,197 |
The LLUNATIC Data-Cleaning Framework |
2013 |
VLDB |
0.00013390321 |
| 9,369 |
Constraint-Variance Tolerant Data Repairing |
2016 |
SIGMOD |
4.3481081e-05 |
| 1,624 |
Sampling the Repairs of Functional Dependency Violations under Hard Constraints |
2010 |
VLDB |
0.00011099222 |
| 5,660 |
Descriptive and Prescriptive Data Cleaning |
2014 |
SIGMOD |
5.3847321e-05 |
| 2,483 |
Discovery of Approximate (and Exact) Denial Constraints |
2020 |
VLDB |
8.6864916e-05 |
| 2,946 |
BigDansing: A System for Big Data Cleansing |
2015 |
SIGMOD |
7.8372441e-05 |
| 7,667 |
Fast Detection of Denial Constraint Violations |
2022 |
VLDB |
4.683767e-05 |