Database Paper Browser

Back to papers

CORDS: Automatic Discovery of Correlations and Soft Functional Dependencies

Summary: CORDS automatically discovers correlations and soft functional dependencies among columns via sampling and chi-squared tests. Outputs dependency graphs and column-group statistics to guide optimizers, boosting selectivity estimates and speeding queries. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
3556
Venue
SIGMOD
Year
2004
Pagerank
0.00032746205
Overall Rank
224 | 98.45%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 50 of 80 citing papers.

Rank Citing Paper Year Venue Pagerank
71 How Good Are Query Optimizers, Really? 2016 VLDB 0.00059038975
333 Neo: A Learned Query Optimizer 2019 VLDB 0.00027206884
514 An End-to-End Automatic Cloud Database Tuning System Using Deep Reinforcement Learning 2019 SIGMOD 0.0002124895
555 Discovering Denial Constraints 2013 VLDB 0.00020254908
627 Management of Probabilistic Data: Foundations and Challenges 2007 PODS 0.00018959005
706 MYSTIQ: A system for finding more answers by using probabilities 2005 SIGMOD 0.00017845469
1,047 Functional Dependency Discovery: An Experimental Evaluation of Seven Algorithms 2015 VLDB 0.00014459715
1,188 On Generating Near-Optimal Tableaux for Conditional Functional Dependencies 2008 VLDB 0.00013441729
1,254 Selectivity Estimation for Range Predicates using Lightweight Models 2019 VLDB 0.00013027411
1,546 KATARA: A Data Cleaning System Powered by Knowledge Bases and Crowdsourcing 2015 SIGMOD 0.00011446851
1,547 Lightweight Graphical Models for Selectivity Estimation Without Independence Assumptions 2011 VLDB 0.00011442359
1,644 Finding Related Tables in Data Lakes for Interactive Data Science 2020 SIGMOD 0.00011041787
1,737 QuickSel: Quick Selectivity Learning with Mixture Models 2020 SIGMOD 0.00010720294
1,758 Sampling-Based Query Re-Optimization 2016 SIGMOD 0.00010655546
1,889 Tsunami: A Learned Multi-dimensional Index for Correlated Data and Skewed Workloads 2021 VLDB 0.00010200865
2,077 Efficient Discovery of Approximate Dependencies 2018 VLDB 9.6001836e-05
2,154 DIFF: A Relational Interface for Large-Scale Data Explanation 2019 VLDB 9.4208667e-05
2,158 Uni-Detect: A Unified Approach to Automated Error Detection in Tables 2019 SIGMOD 9.4141354e-05
2,266 Estimating the Confidence of Conditional Functional Dependencies 2009 SIGMOD 9.1540815e-05
2,356 Consistently Estimating the Selectivity of Conjuncts of Predicates 2005 VLDB 8.9620762e-05
2,506 Auto-Detect: Data-Driven Error Detection in Tables 2018 SIGMOD 8.6335464e-05
2,590 Answering Queries from Statistics and Probabilistic Views 2005 VLDB 8.483194e-05
2,837 Correlation Maps: A Compressed Access Method for Exploiting Soft Functional Dependencies 2009 VLDB 8.0414149e-05
2,865 Designing Succinct Secondary Indexing Mechanism by Exploiting Column Correlations 2019 SIGMOD 7.9862595e-05
2,969 Estimating Join Selectivities using Bandwidth-Optimized Kernel Density Models 2017 VLDB 7.7974762e-05
2,994 Data Generation for Application-Specific Benchmarking 2011 VLDB 7.761114e-05
3,013 Cardinality Estimation Using Sample Views with Quality Assurance 2007 SIGMOD 7.7137441e-05
3,207 Predicting Cost Amortization for Query Services 2011 SIGMOD 7.3818982e-05
3,299 SCODED: Statistical Constraint Oriented Data Error Detection 2020 SIGMOD 7.2546659e-05
3,467 Data Profiling – A Tutorial 2017 SIGMOD 7.069081e-05
3,702 Every Row Counts: Combining Sketches and Sampling for Accurate Group-By Result Estimates 2019 CIDR 6.8295759e-05
3,867 CORADD: Correlation Aware Database Designer for Materialized Views and Indexes 2010 VLDB 6.683173e-05
3,922 Pushing Data-Induced Predicates Through Joins in Big-Data Clusters 2020 VLDB 6.6291079e-05
3,924 A Unified Deep Model of Learning from both Data and Queries for Cardinality Estimation 2021 SIGMOD 6.6271553e-05
4,014 Exploiting Correlations for Expensive Predicate Evaluation 2015 SIGMOD 6.5273084e-05
4,127 A Statistical Perspective on Discovering Functional Dependencies in Noisy Data 2020 SIGMOD 6.4310458e-05
4,489 Automatic Generation of Normalized Relational Schemas from Nested Key-Value Data 2016 SIGMOD 6.1434237e-05
4,499 Possible and Certain SQL Keys 2015 VLDB 6.1385333e-05
4,567 Optimizing Video Analytics with Declarative Model Relationships 2023 VLDB 6.080526e-05
4,641 VIVA: An End-to-End System for Interactive Video Analytics 2022 CIDR 6.027004e-05
4,682 Scalable Discovery of Unique Column Combinations 2014 VLDB 6.0022412e-05
4,883 Content-Based Routing: Different Plans for Different Data 2005 VLDB 5.8545658e-05
4,904 Temporal Rules Discovery for Web Data Cleaning 2016 VLDB 5.8399195e-05
5,014 Dynamically Optimizing Queries over Large Scale Data Platforms 2014 SIGMOD 5.7586174e-05
5,025 Automated Statistics Collection in DB2 UDB 2004 VLDB 5.7533741e-05
5,072 Optimizing Machine Learning Inference Queries with Correlative Proxy Models 2022 VLDB 5.7185674e-05
5,096 Auto-Transform: Learning-to-Transform by Patterns 2020 VLDB 5.7011825e-05
5,509 Can Large Language Models Predict Data Correlations from Column Names? 2023 VLDB 5.4703368e-05
5,815 StatAdvisor: Recommending Statistical Views 2009 VLDB 5.3165295e-05
6,173 Exploiting Soft and Hard Correlations in Big Data Query Optimization 2016 VLDB 5.1699414e-05
Previous Page 1 / 2 Next

Outgoing Citations (Sorted by Pagerank)

Showing 9 of 9 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers