Multi-column Substring Matching for Database Schema Translation
Summary: Unsupervised method for translating schemas from substrings in columns, without training data. Iterative algorithm deduces the substring concatenation to map between schemas; evaluated on real and synthetic data for fixed- and variable-length fields. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 6 of 6 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3,230 | Learning Semantic String Transformations from Examples | 2012 | VLDB | 7.339123e-05 |
| 3,735 | Auto-Join: Joining Tables by Leveraging Transformations | 2017 | VLDB | 6.8061318e-05 |
| 3,992 | Discovering Linkage Points over Web Data | 2013 | VLDB | 6.5544834e-05 |
| 4,850 | SEMA-JOIN: Joining Semantically-Related Tables Using Big Table Corpora | 2015 | VLDB | 5.8768452e-05 |
| 5,434 | Auto-FuzzyJoin: Auto-Program Fuzzy Similarity Joins Without Labeled Examples | 2021 | SIGMOD | 5.5045402e-05 |
| 9,490 | Auto-BI: Automatically Build BI-Models Leveraging Local Join Prediction and Global Schema Graph | 2023 | VLDB | 4.3341665e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 6 of 6 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 155 | Robust and Efficient Fuzzy Match for Online Data Cleaning | 2003 | SIGMOD | 0.00040637896 |
| 208 | Reconciling Schemas of Disparate Data Sources: A Machine-Learning Approach | 2001 | SIGMOD | 0.0003460594 |
| 303 | Generic Schema Matching with Cupid | 2001 | VLDB | 0.00028301477 |
| 1,065 | Data-Driven Understanding and Refinement of Schema Mappings | 2001 | SIGMOD | 0.00014338146 |
| 2,174 | iMAP: Discovering Complex Semantic Matches between Database Schemas | 2004 | SIGMOD | 9.3672342e-05 |
| 4,026 | Flexible String Matching Against Large Databases in Practice | 2004 | VLDB | 6.5169976e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 9,104 | Data Sharing Through Query Translation in Autonomous Sources | 2004 | VLDB | 4.3954618e-05 |
| 5,509 | Can Large Language Models Predict Data Correlations from Column Names? | 2023 | VLDB | 5.4703368e-05 |
| 2,945 | Few-shot Text-to-SQL Translation using Structure and Content Prompt Learning | 2023 | SIGMOD | 7.8377395e-05 |
| 3,823 | Automatic Discovery of Attributes in Relational Databases | 2011 | SIGMOD | 6.7261168e-05 |
| 2,425 | Instance-based Schema Matching for Web Databases by Domain-specific Query Probing | 2004 | VLDB | 8.8376569e-05 |
| 1,664 | On Multi-Column Foreign Key Discovery | 2010 | VLDB | 0.00010976887 |
| 6,290 | Putting Context into Schema Matching | 2006 | VLDB | 5.1271647e-05 |
| 3,921 | On the Complexity of Deriving Schema Mappings from Database Instances | 2008 | PODS | 6.6301252e-05 |
| 916 | On Schema Matching with Opaque Column Names and Data Values | 2003 | SIGMOD | 0.00015379422 |
| 294 | Using Schema Matching to Simplify Heterogeneous Data Translation | 1998 | VLDB | 0.00028669519 |