Database Paper Browser

Back to papers

Data Profiling with Metanome

Summary: Metanome is an extensible data profiling platform that automatically discovers metadata beyond simple statistics. It integrates state-of-the-art profiling algorithms, supports benchmarking and ranking, and provides visualization to compare approaches. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
11067
Venue
VLDB
Year
2015
Pagerank
0.00011094926
Overall Rank
1,625 | 88.70%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 26 of 26 citing papers.

Rank Citing Paper Year Venue Pagerank
894 A Hybrid Approach to Functional Dependency Discovery 2016 SIGMOD 0.00015556428
1,683 Cardinality Estimation: An Experimental Survey 2018 VLDB 0.00010922679
2,077 Efficient Discovery of Approximate Dependencies 2018 VLDB 9.6001836e-05
2,158 Uni-Detect: A Unified Approach to Automated Error Detection in Tables 2019 SIGMOD 9.4141354e-05
2,253 Efficient Denial Constraint Discovery with Hydra 2018 VLDB 9.1937209e-05
2,280 SMOKE: Fine-grained Lineage at Interactive Speed 2018 VLDB 9.1111033e-05
2,483 Discovery of Approximate (and Exact) Denial Constraints 2020 VLDB 8.6864916e-05
3,467 Data Profiling – A Tutorial 2017 SIGMOD 7.069081e-05
3,976 UGuide – User-Guided Discovery of FD-Detectable Errors 2017 SIGMOD 6.5736462e-05
5,192 Pattern Functional Dependencies for Data Cleaning 2020 VLDB 5.6375087e-05
5,361 Efficient Estimation of Inclusion Coefficient using HyperLogLog Sketches 2018 VLDB 5.547935e-05
5,509 Can Large Language Models Predict Data Correlations from Column Names? 2023 VLDB 5.4703368e-05
5,981 DataPrep.EDA: Task-Centric Exploratory Data Analysis for Statistical Modeling in Python 2021 SIGMOD 5.2448986e-05
7,076 Mining Approximate Acyclic Schemes from Relations 2020 SIGMOD 4.8426354e-05
7,745 Crossing the finish line faster when paddling the Data Lake with KAYAK 2017 VLDB 4.6618625e-05
8,949 Discovering Similarity Inclusion Dependencies 2023 SIGMOD 4.4234478e-05
8,974 DataLoom: Simplifying Data Loading with LLMs 2024 VLDB 4.4184286e-05
9,646 Discovering Functional Dependencies through Hitting Set Enumeration 2024 SIGMOD 4.3109001e-05
9,673 Don’t Be a Tattle-Tale: Preventing Leakages through Data Dependencies on Access Control Protected Data 2022 VLDB 4.3055474e-05
10,540 Discovering Approximate Inclusion Dependencies 2025 VLDB 4.1945683e-05
10,817 Mining Meaningful Keys and Foreign Keys with High Precision and Recall 2025 VLDB 4.1945683e-05
11,024 SplitDF: Splitting Dataframes for Memory-Efficient Data Analysis 2024 VLDB 4.1945683e-05
11,366 Statistical Schema Learning using Occam's Razor 2022 SIGMOD 4.1945683e-05
11,515 From Papers to Practice: The openclean Open-Source Data Cleaning Library 2021 VLDB 4.1945683e-05
11,546 Making DBMSes Dependency-Aware 2020 CIDR 4.1945683e-05
11,710 Demonstration of Smoke: A Deep Breath of Data-Intensive Lineage Applications 2018 SIGMOD 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 3 of 3 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
1,047 Functional Dependency Discovery: An Experimental Evaluation of Seven Algorithms 2015 VLDB 0.00014459715
4,682 Scalable Discovery of Unique Column Combinations 2014 VLDB 6.0022412e-05
4,784 Divide & Conquer-based Inclusion Dependency Discovery 2015 VLDB 5.9240851e-05
Previous Page 1 / 1 Next

Semantically Similar Papers