Automatic Discovery of Attributes in Relational Databases
Summary: Data-driven clustering of relational columns to discover attributes, without external knowledge. Treats the database as a column-relationship graph and partitions it into connected components to form attribute clusters; experiments on real and synthetic data validate schema matching. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 11 of 11 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,517 | Annotating Columns with Pre-trained Language Models | 2022 | SIGMOD | 8.6092139e-05 |
| 2,982 | FastQRE: Fast Query Reverse Engineering | 2018 | SIGMOD | 7.7801984e-05 |
| 4,859 | Integrating Data Lake Tables | 2023 | VLDB | 5.8732433e-05 |
| 5,486 | Fast Foreign-Key Detection in Microsoft SQL Server PowerPivot for Excel | 2014 | VLDB | 5.4811603e-05 |
| 7,048 | Magneto: Combining Small and Large Language Models for Schema Matching | 2025 | VLDB | 4.8520651e-05 |
| 8,499 | Synthesizing Mapping Relationships Using Table Corpus | 2017 | SIGMOD | 4.4975851e-05 |
| 10,645 | OpenForge: Probabilistic Metadata Integration | 2025 | VLDB | 4.1945683e-05 |
| 10,754 | OmniMatch: Joinability Discovery in Data Products | 2025 | VLDB | 4.1945683e-05 |
| 11,205 | Steered Training Data Generation for Learned Semantic Type Detection | 2023 | SIGMOD | 4.1945683e-05 |
| 11,230 | VersaMatch: Ontology Matching with Weak Supervision | 2023 | VLDB | 4.1945683e-05 |
| 11,775 | Building Structured Databases of Factual Knowledge from Massive Text Corpora | 2017 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 7 of 7 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 34 | Similarity Search in High Dimensions via Hashing | 1999 | VLDB | 0.00076637636 |
| 294 | Using Schema Matching to Simplify Heterogeneous Data Translation | 1998 | VLDB | 0.00028669519 |
| 303 | Generic Schema Matching with Cupid | 2001 | VLDB | 0.00028301477 |
| 382 | COMA - A system for flexible combination of schema matching approaches | 2002 | VLDB | 0.00024823252 |
| 916 | On Schema Matching with Opaque Column Names and Data Values | 2003 | SIGMOD | 0.00015379422 |
| 1,664 | On Multi-Column Foreign Key Discovery | 2010 | VLDB | 0.00010976887 |
| 2,114 | Rondo: A Programming Platform for Generic Model Management | 2003 | SIGMOD | 9.5268855e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3,921 | On the Complexity of Deriving Schema Mappings from Database Instances | 2008 | PODS | 6.6301252e-05 |
| 1,849 | Improving Database Schemes by Adding Attributes | 1983 | PODS | 0.00010329397 |
| 10,924 | Improved Approximation Algorithms for Relational Clustering | 2024 | PODS | 4.1945683e-05 |
| 3,047 | Comprehensive Approach to the Design of Relational Database Schemes | 1984 | VLDB | 7.6561027e-05 |
| 1,664 | On Multi-Column Foreign Key Discovery | 2010 | VLDB | 0.00010976887 |
| 1,796 | Summary Graphs for Relational Database Schemas | 2011 | VLDB | 0.00010524897 |
| 916 | On Schema Matching with Opaque Column Names and Data Values | 2003 | SIGMOD | 0.00015379422 |
| 3,143 | Extracting and Analyzing Hidden Graphs from Relational Databases | 2017 | SIGMOD | 7.4804326e-05 |
| 1,510 | Summarizing Relational Databases | 2009 | VLDB | 0.00011606901 |
| 146 | Knowledge Discovery in Databases: An Attribute-Oriented Approach | 1992 | VLDB | 0.00041315295 |