Estimating the Confidence of Conditional Functional Dependencies
Summary: One- or two-pass, small-space estimation of CFD confidence via sampling and sketching. Works with known or discovered pattern tableaux, offering additive-error guarantees (no relative guarantees) and strong empirical accuracy with compact summaries on real and synthetic data. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Graham Cormode
- 2. Lukasz Golab
- 3. Flip Korn
- 4. Andrew McGregor
- 5. Divesh Srivastava
- 6. Xi Zhang
Incoming Citations (Sorted by Pagerank)
Showing 5 of 5 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 894 | A Hybrid Approach to Functional Dependency Discovery | 2016 | SIGMOD | 0.00015556428 |
| 2,059 | Stream Warehousing with DataDepot | 2009 | SIGMOD | 9.6582554e-05 |
| 3,713 | GDR: A System for Guided Data Repair | 2010 | SIGMOD | 6.8224341e-05 |
| 4,682 | Scalable Discovery of Unique Column Combinations | 2014 | VLDB | 6.0022412e-05 |
| 5,153 | Horizon: Scalable Dependency-driven Data Cleaning | 2021 | VLDB | 5.6607963e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 11 of 11 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 224 | CORDS: Automatic Discovery of Correlations and Soft Functional Dependencies | 2004 | SIGMOD | 0.00032746205 |
| 473 | Sampling Large Databases for Association Rules | 1996 | VLDB | 0.0002233798 |
| 623 | Improving Data Quality: Consistency and Accuracy | 2007 | VLDB | 0.00018996374 |
| 732 | Discovering Data Quality Rules | 2008 | VLDB | 0.00017465093 |
| 1,188 | On Generating Near-Optimal Tableaux for Conditional Functional Dependencies | 2008 | VLDB | 0.00013441729 |
| 1,401 | Extending Dependencies with Conditions | 2007 | VLDB | 0.00012187775 |
| 1,472 | Space Efficient Mining of Multigraph Streams | 2005 | PODS | 0.00011828662 |
| 1,974 | BHUNT: Automatic Discovery of Fuzzy Algebraic Constraints in Relational Data | 2003 | VLDB | 9.8866171e-05 |
| 4,253 | The Power of Sampling in Knowledge Discovery | 1994 | PODS | 6.323083e-05 |
| 6,286 | A Dip in the Reservoir: Maintaining Sample Synopses of Evolving Datasets | 2006 | VLDB | 5.1280225e-05 |
| 6,385 | Propagating Functional Dependencies with Conditions | 2008 | VLDB | 5.0875028e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,159 | Sequential Dependencies | 2009 | VLDB | 9.4130956e-05 |
| 3,167 | Relational Confidence Bounds Are Easy With The Bootstrap* | 2005 | SIGMOD | 7.4523397e-05 |
| 1,624 | Sampling the Repairs of Functional Dependency Violations under Hard Constraints | 2010 | VLDB | 0.00011099222 |
| 5,192 | Pattern Functional Dependencies for Data Cleaning | 2020 | VLDB | 5.6375087e-05 |
| 4,442 | Approximating Predicates and Expressive Queries on Probabilistic Databases | 2008 | PODS | 6.186154e-05 |
| 25 | Dependency Inference (Extended Abstract) | 1987 | VLDB | 0.00083101742 |
| 2,450 | Functional Dependencies for Graphs | 2016 | SIGMOD | 8.7882979e-05 |
| 6,385 | Propagating Functional Dependencies with Conditions | 2008 | VLDB | 5.0875028e-05 |
| 10,587 | Efficient Discovery of Relaxed Functional Dependencies | 2025 | VLDB | 4.1945683e-05 |
| 1,188 | On Generating Near-Optimal Tableaux for Conditional Functional Dependencies | 2008 | VLDB | 0.00013441729 |