Database Paper Browser

Back to papers

Mining Database Structure; Or, How to Build a Data Quality Browser

Summary: Bellman mines database structure to build a data quality browser for federated schemas. Techniques identify similar fields, join paths, and estimate join directions and cardinalities to reveal structure, aiding schema mapping and data preparation. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
3353
Venue
SIGMOD
Year
2002
Pagerank
0.00022303253
Overall Rank
475 | 96.70%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 34 of 34 citing papers.

Rank Citing Paper Year Venue Pagerank
383 An Optimal Algorithm for the Distinct Elements Problem 2010 PODS 0.00024820873
555 Discovering Denial Constraints 2013 VLDB 0.00020254908
727 On Synopses for Distinct-Value Estimation Under Multiset Operations 2007 SIGMOD 0.00017508726
732 Discovering Data Quality Rules 2008 VLDB 0.00017465093
1,482 Automating Large-Scale Data Quality Verification 2018 VLDB 0.00011725533
1,664 On Multi-Column Foreign Key Discovery 2010 VLDB 0.00010976887
1,796 Summary Graphs for Relational Database Schemas 2011 VLDB 0.00010524897
1,908 Information-Theoretic Tools for Mining Database Structure from Large Data Sets 2004 SIGMOD 0.00010126101
2,158 Uni-Detect: A Unified Approach to Automated Error Detection in Tables 2019 SIGMOD 9.4141354e-05
2,174 iMAP: Discovering Complex Semantic Matches between Database Schemas 2004 SIGMOD 9.3672342e-05
2,549 GORDIAN: Efficient and Scalable Discovery of Composite Keys 2006 VLDB 8.5641554e-05
2,982 FastQRE: Fast Query Reverse Engineering 2018 SIGMOD 7.7801984e-05
3,015 Chorus: Foundation Models for Unified Data Discovery and Exploration 2024 VLDB 7.7092391e-05
3,050 Comparing Data Streams Using Hamming Norms (How to Zero In) 2002 VLDB 7.6512619e-05
3,252 Auto-Suggest: Learning-to-Recommend Data Preparation Steps Using Data Science Notebooks 2020 SIGMOD 7.3178277e-05
3,426 Discovering Topical Structures of Databases 2008 SIGMOD 7.1063105e-05
3,467 Data Profiling – A Tutorial 2017 SIGMOD 7.069081e-05
3,735 Auto-Join: Joining Tables by Leveraging Transformations 2017 VLDB 6.8061318e-05
3,928 Tighter Estimation using Bottom-k Sketches 2008 VLDB 6.6254568e-05
4,435 Sampling Dirty Data for Matching Attributes 2010 SIGMOD 6.1918164e-05
4,873 Power-Law Based Estimation of Set Similarity Join Size 2009 VLDB 5.8602304e-05
4,929 Data Auditor: Exploring Data Quality and Semantics using Pattern Tableaux 2010 VLDB 5.8217296e-05
5,096 Auto-Transform: Learning-to-Transform by Patterns 2020 VLDB 5.7011825e-05
5,200 SetSketch: Filling the Gap between MinHash and HyperLogLog 2021 VLDB 5.6337581e-05
5,415 Coordinated Weighted Sampling for Estimating Aggregates Over Multiple Weight Assignments 2009 VLDB 5.5196338e-05
5,506 Exploring Change – A New Dimension of Data Analytics 2019 VLDB 5.473324e-05
6,713 Query Relaxation Using Malleable Schemas 2007 SIGMOD 4.951387e-05
7,838 Auto-Validate: Unsupervised Data Validation Using Data-Domain Patterns Inferred from Data Lakes 2021 SIGMOD 4.6377995e-05
8,338 DBChEx: Interactive Exploration of Data and Schema Change 2019 CIDR 4.5434254e-05
8,921 Leveraging Similarity Joins for Signal Reconstruction 2018 VLDB 4.427232e-05
10,019 Guardrail: Automated Integrity Constraint Synthesis From Noisy Data 2026 SIGMOD 4.1945683e-05
10,946 An LDP Compatible Sketch for Securely Approximating Set Intersection Cardinalities 2024 SIGMOD 4.1945683e-05
11,332 The White-Box Adversarial Data Stream Model 2022 PODS 4.1945683e-05
12,166 Get the Most out of Your Sample: Optimal Unbiased Estimators using Partial Information 2011 PODS 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 13 of 13 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers