Database Paper Browser

Back to papers

A Hybrid Approach to Functional Dependency Discovery

Summary: HyFD, a hybrid FD discovery algorithm, combines fast approximation with efficient validation to uncover all minimal functional dependencies. It uses compact data structures to scale to 50+ attributes and millions of records, outperforming prior methods. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
5230
Venue
SIGMOD
Year
2016
Pagerank
0.00015556428
Overall Rank
894 | 93.79%
DOI
10.1145/2882903.2915203

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 46 of 46 citing papers.

Rank Citing Paper Year Venue Pagerank
1,482 Automating Large-Scale Data Quality Verification 2018 VLDB 0.00011725533
1,894 Baran: Effective Error Correction via a Unified Context Representation and Transfer Learning 2020 VLDB 0.0001018378
2,077 Efficient Discovery of Approximate Dependencies 2018 VLDB 9.6001836e-05
2,253 Efficient Denial Constraint Discovery with Hydra 2018 VLDB 9.1937209e-05
2,483 Discovery of Approximate (and Exact) Denial Constraints 2020 VLDB 8.6864916e-05
2,650 Detecting Logic Bugs of Join Optimizations in DBMS 2023 SIGMOD 8.3708191e-05
2,968 Raha: A Configuration-Free Error Detection System 2019 SIGMOD 7.7985097e-05
2,982 FastQRE: Fast Query Reverse Engineering 2018 SIGMOD 7.7801984e-05
3,467 Data Profiling – A Tutorial 2017 SIGMOD 7.069081e-05
3,797 Stitching Web Tables for Improving Matching Quality 2017 VLDB 6.7597149e-05
4,127 A Statistical Perspective on Discovering Functional Dependencies in Noisy Data 2020 SIGMOD 6.4310458e-05
5,383 Auto-Pipeline: Synthesizing Complex Data Pipelines By-Target Using Reinforcement Learning and Search 2021 VLDB 5.5393038e-05
5,613 Distributed implementations of dependency discovery algorithms 2019 VLDB 5.4102298e-05
6,092 Observatory: Characterizing Embeddings of Relational Tables 2024 VLDB 5.2138566e-05
7,076 Mining Approximate Acyclic Schemes from Relations 2020 SIGMOD 4.8426354e-05
7,185 Certus: An Effective Entity Resolution Approach with Graph Differential Dependencies (GDDs) 2019 VLDB 4.8066159e-05
7,287 Discovering Association Rules from Big Graphs 2022 VLDB 4.7762276e-05
7,366 Discovery Algorithms for Embedded Functional Dependencies 2020 SIGMOD 4.7515248e-05
7,838 Auto-Validate: Unsupervised Data Validation Using Data-Domain Patterns Inferred from Data Lakes 2021 SIGMOD 4.6377995e-05
8,085 Discovery and Ranking of Embedded Uniqueness Constraints 2019 VLDB 4.5902231e-05
8,472 Rapidash: Efficient Detection of Constraint Violations 2024 VLDB 4.5036378e-05
8,475 DataProf: Semantic Profiling for Iterative Data Cleansing and Business Rule Acquisition 2018 SIGMOD 4.5028904e-05
8,703 Workload-driven, Lazy Discovery of Data Dependencies for Query Optimization 2022 CIDR 4.4647237e-05
8,836 Fast Approximate Denial Constraint Discovery 2023 VLDB 4.4393184e-05
8,850 Hitting Set Enumeration with Partial Information for Unique Column Combination Discovery 2020 VLDB 4.4364648e-05
9,355 Discovering Top-k Rules using Subjective and Objective Criteria 2023 SIGMOD 4.3514328e-05
9,410 Leveraging Application Data Constraints to Optimize Database-Backed Web Applications 2023 VLDB 4.3441378e-05
9,487 Making It Tractable to Catch Duplicates and Conflicts in Graphs 2023 SIGMOD 4.3341665e-05
9,646 Discovering Functional Dependencies through Hitting Set Enumeration 2024 SIGMOD 4.3109001e-05
9,649 DAFDiscover: Robust Mining Algorithm for Dynamic Approximate Functional Dependencies on Dirty Data 2024 VLDB 4.3109001e-05
9,749 Efficient Differential Dependency Discovery 2024 VLDB 4.2897489e-05
9,847 Discovering Top-k Relevant and Diversified Rules 2024 SIGMOD 4.2721228e-05
9,963 Parallel Rule Discovery from Large Datasets by Sampling 2022 SIGMOD 4.2294678e-05
10,029 Outliers: The Good, the Bad and the Ugly 2026 SIGMOD 4.1945683e-05
10,489 Incremental Rule Discovery in Response to Parameter Updates 2025 SIGMOD 4.1945683e-05
10,508 Synthesizing Third Normal Form Schemata that Minimize Integrity Maintenance and Update Overheads: Parameterizing 3NF by the Numbers of Minimal Keys and Functional Dependencies 2025 SIGMOD 4.1945683e-05
10,587 Efficient Discovery of Relaxed Functional Dependencies 2025 VLDB 4.1945683e-05
10,679 How and Why False Denial Constraints are Discovered 2025 VLDB 4.1945683e-05
10,791 FDepHunter: Harnessing Negative Examples to Expose Fakes and Reveal Ghosts 2025 VLDB 4.1945683e-05
11,001 Capturing More Associations by Referencing External Graphs 2024 VLDB 4.1945683e-05
11,010 Mixed Covers of Keys and Functional Dependencies for Maintaining the Integrity of Data under Updates 2024 VLDB 4.1945683e-05
11,024 SplitDF: Splitting Dataframes for Memory-Efficient Data Analysis 2024 VLDB 4.1945683e-05
11,173 Composite Object Normal Forms: Parameterizing Boyce-Codd Normal Form by the Number of Minimal Keys 2023 SIGMOD 4.1945683e-05
11,366 Statistical Schema Learning using Occam's Razor 2022 SIGMOD 4.1945683e-05
11,490 Logical Schema Design that Quantifies Update Inefficiency and Join Efficiency 2021 SIGMOD 4.1945683e-05
11,546 Making DBMSes Dependency-Aware 2020 CIDR 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 4 of 4 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
751 Partition Semantics for Relations 1985 PODS 0.0001721247
1,047 Functional Dependency Discovery: An Experimental Evaluation of Seven Algorithms 2015 VLDB 0.00014459715
1,625 Data Profiling with Metanome 2015 VLDB 0.00011094926
2,266 Estimating the Confidence of Conditional Functional Dependencies 2009 SIGMOD 9.1540815e-05
Previous Page 1 / 1 Next

Semantically Similar Papers