Back to papers
Efficient Discovery of Approximate Dependencies
Summary: Efficient discovery of approximate FDs and approximate UCCs. Pyro combines separate-and-conquer search with sampling-guided verification to quickly propose and validate candidates; scales to large datasets with minimal memory, outperforming prior methods by up to 33×.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 11781
- Venue
- VLDB
- Year
- 2018
- Pagerank
- 9.6001836e-05
- Overall Rank
- 2,077 | 85.56%
- DOI
-
10.14778/3192965.3192968
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 26 of 26 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
| 1,889 |
Tsunami: A Learned Multi-dimensional Index for Correlated Data and Skewed Workloads |
2021 |
VLDB |
0.00010200865 |
| 2,483 |
Discovery of Approximate (and Exact) Denial Constraints |
2020 |
VLDB |
8.6864916e-05 |
| 2,865 |
Designing Succinct Secondary Indexing Mechanism by Exploiting Column Correlations |
2019 |
SIGMOD |
7.9862595e-05 |
| 3,499 |
Fauce: Fast and Accurate Deep Ensembles with Uncertainty for Cardinality Estimation |
2021 |
VLDB |
7.0376445e-05 |
| 4,127 |
A Statistical Perspective on Discovering Functional Dependencies in Noisy Data |
2020 |
SIGMOD |
6.4310458e-05 |
| 4,567 |
Optimizing Video Analytics with Declarative Model Relationships |
2023 |
VLDB |
6.080526e-05 |
| 4,641 |
VIVA: An End-to-End System for Interactive Video Analytics |
2022 |
CIDR |
6.027004e-05 |
| 6,466 |
Pando: Enhanced Data Skipping with Logical Data Partitioning |
2023 |
VLDB |
5.0528281e-05 |
| 6,756 |
Fast Incremental Discovery of Pointwise Order Dependencies |
2020 |
VLDB |
4.9379361e-05 |
| 7,076 |
Mining Approximate Acyclic Schemes from Relations |
2020 |
SIGMOD |
4.8426354e-05 |
| 7,202 |
Conformance Constraint Discovery: Measuring Trust in Data-Driven Systems |
2021 |
SIGMOD |
4.8023314e-05 |
| 7,366 |
Discovery Algorithms for Embedded Functional Dependencies |
2020 |
SIGMOD |
4.7515248e-05 |
| 8,085 |
Discovery and Ranking of Embedded Uniqueness Constraints |
2019 |
VLDB |
4.5902231e-05 |
| 8,836 |
Fast Approximate Denial Constraint Discovery |
2023 |
VLDB |
4.4393184e-05 |
| 9,355 |
Discovering Top-k Rules using Subjective and Objective Criteria |
2023 |
SIGMOD |
4.3514328e-05 |
| 9,649 |
DAFDiscover: Robust Mining Algorithm for Dynamic Approximate Functional Dependencies on Dirty Data |
2024 |
VLDB |
4.3109001e-05 |
| 9,749 |
Efficient Differential Dependency Discovery |
2024 |
VLDB |
4.2897489e-05 |
| 9,847 |
Discovering Top-k Relevant and Diversified Rules |
2024 |
SIGMOD |
4.2721228e-05 |
| 9,963 |
Parallel Rule Discovery from Large Datasets by Sampling |
2022 |
SIGMOD |
4.2294678e-05 |
| 10,489 |
Incremental Rule Discovery in Response to Parameter Updates |
2025 |
SIGMOD |
4.1945683e-05 |
| 10,540 |
Discovering Approximate Inclusion Dependencies |
2025 |
VLDB |
4.1945683e-05 |
| 10,587 |
Efficient Discovery of Relaxed Functional Dependencies |
2025 |
VLDB |
4.1945683e-05 |
| 10,679 |
How and Why False Denial Constraints are Discovered |
2025 |
VLDB |
4.1945683e-05 |
| 11,024 |
SplitDF: Splitting Dataframes for Memory-Efficient Data Analysis |
2024 |
VLDB |
4.1945683e-05 |
| 11,490 |
Logical Schema Design that Quantifies Update Inefficiency and Join Efficiency |
2021 |
SIGMOD |
4.1945683e-05 |
| 11,579 |
AUDITOR: A System Designed for Automatic Discovery of Complex Integrity Constraints in Relational Databases |
2020 |
SIGMOD |
4.1945683e-05 |
Outgoing Citations (Sorted by Pagerank)
Showing 13 of 13 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 71 |
How Good Are Query Optimizers, Really? |
2016 |
VLDB |
0.00059038975 |
| 181 |
Mining Frequent Patterns without Candidate Generation |
2000 |
SIGMOD |
0.00036992674 |
| 224 |
CORDS: Automatic Discovery of Correlations and Soft Functional Dependencies |
2004 |
SIGMOD |
0.00032746205 |
| 894 |
A Hybrid Approach to Functional Dependency Discovery |
2016 |
SIGMOD |
0.00015556428 |
| 1,047 |
Functional Dependency Discovery: An Experimental Evaluation of Seven Algorithms |
2015 |
VLDB |
0.00014459715 |
| 1,197 |
The LLUNATIC Data-Cleaning Framework |
2013 |
VLDB |
0.00013390321 |
| 1,624 |
Sampling the Repairs of Functional Dependency Violations under Hard Constraints |
2010 |
VLDB |
0.00011099222 |
| 1,625 |
Data Profiling with Metanome |
2015 |
VLDB |
0.00011094926 |
| 2,549 |
GORDIAN: Efficient and Scalable Discovery of Composite Keys |
2006 |
VLDB |
8.5641554e-05 |
| 2,946 |
BigDansing: A System for Big Data Cleansing |
2015 |
SIGMOD |
7.8372441e-05 |
| 3,976 |
UGuide – User-Guided Discovery of FD-Detectable Errors |
2017 |
SIGMOD |
6.5736462e-05 |
| 4,499 |
Possible and Certain SQL Keys |
2015 |
VLDB |
6.1385333e-05 |
| 4,682 |
Scalable Discovery of Unique Column Combinations |
2014 |
VLDB |
6.0022412e-05 |
Semantically Similar Papers