Database Paper Browser

Back to papers

Scalable Discovery of Unique Column Combinations

Summary: Ducc scales discovery of all unique and non-unique column combinations by framing it as a graph-coloring problem and applying a hybrid column-based DFS/random-walk pruning. Row-based pruning with scale-out deployment yields up to 631× faster than Gordian and 398× faster than HCA on multi-million-row datasets. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
10908
Venue
VLDB
Year
2014
Pagerank
6.0022412e-05
Overall Rank
4,682 | 67.43%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 17 of 17 citing papers.

Rank Citing Paper Year Venue Pagerank
1,625 Data Profiling with Metanome 2015 VLDB 0.00011094926
2,077 Efficient Discovery of Approximate Dependencies 2018 VLDB 9.6001836e-05
2,253 Efficient Denial Constraint Discovery with Hydra 2018 VLDB 9.1937209e-05
2,483 Discovery of Approximate (and Exact) Denial Constraints 2020 VLDB 8.6864916e-05
3,467 Data Profiling – A Tutorial 2017 SIGMOD 7.069081e-05
3,976 UGuide – User-Guided Discovery of FD-Detectable Errors 2017 SIGMOD 6.5736462e-05
4,499 Possible and Certain SQL Keys 2015 VLDB 6.1385333e-05
6,270 MATE: Multi-Attribute Table Extraction 2022 VLDB 5.1337451e-05
6,756 Fast Incremental Discovery of Pointwise Order Dependencies 2020 VLDB 4.9379361e-05
8,085 Discovery and Ranking of Embedded Uniqueness Constraints 2019 VLDB 4.5902231e-05
8,590 Exploratory Training: When Annotators Learn About Data 2023 SIGMOD 4.4896282e-05
8,743 CtxPipe: Context-aware Data Preparation Pipeline Construction for Machine Learning 2024 SIGMOD 4.456315e-05
8,836 Fast Approximate Denial Constraint Discovery 2023 VLDB 4.4393184e-05
8,850 Hitting Set Enumeration with Partial Information for Unique Column Combination Discovery 2020 VLDB 4.4364648e-05
9,278 Interactive and Deterministic Data Cleaning: A Tossed Stone Raises a Thousand Ripples 2016 SIGMOD 4.3639892e-05
10,679 How and Why False Denial Constraints are Discovered 2025 VLDB 4.1945683e-05
11,518 A Demonstration of RELIC: A System for REtrospective Lineage InferenCe of Data Workflows 2021 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 9 of 9 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers