Database Paper Browser

Back to papers

Functional Dependency Discovery: An Experimental Evaluation of Seven Algorithms

Summary: Seven FD discovery algorithms re-implemented and benchmarked on synthetic and real data; categorization into three groups clarifies design space. Results: all FD methods scale poorly; data traits dictate choice, with guidelines and gaps highlighted. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
10988
Venue
VLDB
Year
2015
Pagerank
0.00014459715
Overall Rank
1,047 | 92.72%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 44 of 44 citing papers.

Rank Citing Paper Year Venue Pagerank
894 A Hybrid Approach to Functional Dependency Discovery 2016 SIGMOD 0.00015556428
1,625 Data Profiling with Metanome 2015 VLDB 0.00011094926
1,940 SliceLine: Fast, Linear-Algebra-based Slice Finding for ML Model Debugging 2021 SIGMOD 0.00010020173
2,077 Efficient Discovery of Approximate Dependencies 2018 VLDB 9.6001836e-05
2,253 Efficient Denial Constraint Discovery with Hydra 2018 VLDB 9.1937209e-05
2,483 Discovery of Approximate (and Exact) Denial Constraints 2020 VLDB 8.6864916e-05
2,574 Discovery of Genuine Functional Dependencies from Relational Data with Missing Values 2018 VLDB 8.5173637e-05
2,865 Designing Succinct Secondary Indexing Mechanism by Exploiting Column Correlations 2019 SIGMOD 7.9862595e-05
3,396 Automatic Data Repair: Are We Ready to Deploy? 2024 VLDB 7.1455126e-05
3,440 Approximate Denial Constraints 2020 VLDB 7.0918817e-05
3,501 MT-TeQL: Evaluating and Augmenting Neural NLIDB on Real-world Linguistic and Schema Variations 2022 VLDB 7.0366785e-05
3,702 Every Row Counts: Combining Sketches and Sampling for Accurate Group-By Result Estimates 2019 CIDR 6.8295759e-05
3,976 UGuide – User-Guided Discovery of FD-Detectable Errors 2017 SIGMOD 6.5736462e-05
4,127 A Statistical Perspective on Discovering Functional Dependencies in Noisy Data 2020 SIGMOD 6.4310458e-05
4,744 Effective and Complete Discovery of Order Dependencies via Set-based Axiomatization 2017 VLDB 5.957936e-05
5,191 Going Beyond Provenance: Explaining Query Answers with Pattern-based Counterbalances 2019 SIGMOD 5.6378768e-05
5,192 Pattern Functional Dependencies for Data Cleaning 2020 VLDB 5.6375087e-05
5,280 Explaining Dataset Changes for Semantic Data Versioning with Explain-Da-V 2023 VLDB 5.5896735e-05
5,383 Auto-Pipeline: Synthesizing Complex Data Pipelines By-Target Using Reinforcement Learning and Search 2021 VLDB 5.5393038e-05
5,613 Distributed implementations of dependency discovery algorithms 2019 VLDB 5.4102298e-05
5,618 Explaining Repaired Data with CFDs 2018 VLDB 5.4079415e-05
5,910 Normalizing Property Graphs 2023 VLDB 5.2768691e-05
6,280 Self-supervised and Interpretable Data Cleaning with Sequence Generative Adversarial Networks 2023 VLDB 5.1290457e-05
6,944 DataPrism: Exposing Disconnect between Data and Systems 2022 SIGMOD 4.8912787e-05
7,185 Certus: An Effective Entity Resolution Approach with Graph Differential Dependencies (GDDs) 2019 VLDB 4.8066159e-05
7,202 Conformance Constraint Discovery: Measuring Trust in Data-Driven Systems 2021 SIGMOD 4.8023314e-05
7,287 Discovering Association Rules from Big Graphs 2022 VLDB 4.7762276e-05
7,366 Discovery Algorithms for Embedded Functional Dependencies 2020 SIGMOD 4.7515248e-05
7,714 Identifying Insufficient Data Coverage in Databases with Multiple Relations 2020 VLDB 4.6700455e-05
7,926 CoCo: Interactive Exploration of Conformance Constraints for Data Understanding and Data Cleaning 2021 SIGMOD 4.6144554e-05
8,085 Discovery and Ranking of Embedded Uniqueness Constraints 2019 VLDB 4.5902231e-05
8,743 CtxPipe: Context-aware Data Preparation Pipeline Construction for Machine Learning 2024 SIGMOD 4.456315e-05
9,355 Discovering Top-k Rules using Subjective and Objective Criteria 2023 SIGMOD 4.3514328e-05
9,646 Discovering Functional Dependencies through Hitting Set Enumeration 2024 SIGMOD 4.3109001e-05
9,749 Efficient Differential Dependency Discovery 2024 VLDB 4.2897489e-05
9,847 Discovering Top-k Relevant and Diversified Rules 2024 SIGMOD 4.2721228e-05
10,019 Guardrail: Automated Integrity Constraint Synthesis From Noisy Data 2026 SIGMOD 4.1945683e-05
10,587 Efficient Discovery of Relaxed Functional Dependencies 2025 VLDB 4.1945683e-05
10,791 FDepHunter: Harnessing Negative Examples to Expose Fakes and Reveal Ghosts 2025 VLDB 4.1945683e-05
11,001 Capturing More Associations by Referencing External Graphs 2024 VLDB 4.1945683e-05
11,010 Mixed Covers of Keys and Functional Dependencies for Maintaining the Integrity of Data under Updates 2024 VLDB 4.1945683e-05
11,024 SplitDF: Splitting Dataframes for Memory-Efficient Data Analysis 2024 VLDB 4.1945683e-05
11,187 Regularized Pairwise Relationship based Analytics for Structured Data 2023 SIGMOD 4.1945683e-05
11,490 Logical Schema Design that Quantifies Update Inefficiency and Join Efficiency 2021 SIGMOD 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 2 of 2 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
224 CORDS: Automatic Discovery of Correlations and Soft Functional Dependencies 2004 SIGMOD 0.00032746205
751 Partition Semantics for Relations 1985 PODS 0.0001721247
Previous Page 1 / 1 Next

Semantically Similar Papers