Database Paper Browser

Back to papers

Reconciling Schemas of Disparate Data Sources: A Machine-Learning Approach

Summary: Introduces LSD, a semi-automatic data integration system that learns semantic mappings to a schema from seed mappings. It combines multiple learners—using schema, data, domain constraints, and XML structure—with a meta-learner to map sources. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
3295
Venue
SIGMOD
Year
2001
Pagerank
0.0003460594
Overall Rank
208 | 98.56%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 50 of 52 citing papers.

Rank Citing Paper Year Venue Pagerank
107 WebTables: Exploring the Power of Tables on the Web 2008 VLDB 0.00048377684
303 Generic Schema Matching with Cupid 2001 VLDB 0.00028301477
382 COMA - A system for flexible combination of schema matching approaches 2002 VLDB 0.00024823252
420 InfoGather: Entity Augmentation and Attribute Discovery By Holistic Matching with Web Tables 2012 SIGMOD 0.00023719065
475 Mining Database Structure; Or, How to Build a Data Quality Browser 2002 SIGMOD 0.00022303253
518 Data Integration for the Relational Web 2009 VLDB 0.00021158934
672 An Interactive Clustering-based Approach to Integrating Source Query Interfaces on the Deep Web 2004 SIGMOD 0.00018355746
692 Pay-as-you-go User Feedback for Dataspace Systems 2008 SIGMOD 0.00018083948
893 Data Integration: The Teenage Years 2006 VLDB 0.00015558352
902 Statistical Schema Matching across Web Query Interfaces 2003 SIGMOD 0.00015486247
916 On Schema Matching with Opaque Column Names and Data Values 2003 SIGMOD 0.00015379422
1,147 Web-scale Data Integration: You can only afford to Pay As You Go 2007 CIDR 0.00013677658
1,198 Crossing the Structure Chasm 2003 CIDR 0.00013366708
1,252 Principles of Dataspace Systems 2006 PODS 0.00013033186
1,527 Generic Schema Matching, Ten Years Later 2011 VLDB 0.00011499442
1,537 Google's Deep-Web Crawl 2008 VLDB 0.00011465704
1,693 Merging Models Based on Given Correspondences 2003 VLDB 0.00010900382
1,762 Tuning Schema Matching Software using Synthetic Scenarios 2005 VLDB 0.00010646894
2,078 Sample-Driven Schema Mapping 2012 SIGMOD 9.599707e-05
2,174 iMAP: Discovering Complex Semantic Matches between Database Schemas 2004 SIGMOD 9.3672342e-05
2,333 A Platform for Personal Information Management and Integration 2005 CIDR 9.0169986e-05
2,425 Instance-based Schema Matching for Web Databases by Domain-specific Query Probing 2004 VLDB 8.8376569e-05
2,447 WISE-Integrator: An Automatic Integrator of Web Search Interfaces for E-Commerce 2003 VLDB 8.8037197e-05
2,479 Efficient Query Reformulation in Peer Data Management Systems 2004 SIGMOD 8.6909119e-05
2,984 Efficiently Incorporating User Feedback into Information Extraction and Integration Programs 2009 SIGMOD 7.7796344e-05
3,328 Multi-column Substring Matching for Database Schema Translation 2006 VLDB 7.2174278e-05
3,426 Discovering Topical Structures of Databases 2008 SIGMOD 7.1063105e-05
3,724 Toward Large Scale Integration: Building a MetaQuerier over Databases on the Web 2005 CIDR 6.8173288e-05
5,032 Actively Soliciting Feedback for Query Answers in Keyword Search-Based Data Integration 2013 VLDB 5.748807e-05
5,174 Mapping Maintenance for Data Integration Systems 2005 VLDB 5.6443463e-05
5,289 Exchanging Intensional XML Data 2003 SIGMOD 5.5831013e-05
5,549 Query Processing over Incomplete Autonomous Databases 2007 VLDB 5.4428494e-05
5,571 HAMSTER: Using Search Clicklogs for Schema and Taxonomy Matching 2009 VLDB 5.4283499e-05
6,024 Similarity Search for Web Services 2004 VLDB 5.2415551e-05
6,290 Putting Context into Schema Matching 2006 VLDB 5.1271647e-05
6,652 Information Preserving XML Schema Embedding 2005 VLDB 4.9761854e-05
6,676 A Gauss Function Based Approach for Unbalanced Ontology Matching 2009 SIGMOD 4.9659764e-05
6,713 Query Relaxation Using Malleable Schemas 2007 SIGMOD 4.951387e-05
6,758 Data Migration using Datalog Program Synthesis 2020 VLDB 4.937199e-05
6,792 Automatically Incorporating New Sources in Keyword Search-Based Data Integration 2010 SIGMOD 4.9249098e-05
6,827 Light-weight Domain-based Form Assistant: Querying Web Databases On the Fly 2005 VLDB 4.9137918e-05
7,006 Synthesizing Products for Online Catalogs 2011 VLDB 4.8653916e-05
7,661 Just-In-Time Query Retrieval Over Partially Indexed Data on Structured P2P Overlays 2008 SIGMOD 4.6862657e-05
8,824 Analyzing and Revising Data Integration Schemas to Improve Their Matchability 2008 VLDB 4.4415658e-05
9,777 Data Augmentation for ML-driven Data Preparation and Integration 2021 VLDB 4.2856106e-05
9,818 Structures, Semantics and Statistics 2004 VLDB 4.2777808e-05
11,770 Staging User Feedback toward Rapid Conflict Resolution in Data Fusion 2017 SIGMOD 4.1945683e-05
12,170 Schema-As-You-Go: On Probabilistic Tagging and Querying of Wide Tables 2011 SIGMOD 4.1945683e-05
12,218 A Learning Algorithm for Top-Down XML Transformations 2010 PODS 4.1945683e-05
12,326 Kosmix: High-Performance Topic Exploration using the Deep Web 2009 VLDB 4.1945683e-05
Previous Page 1 / 2 Next

Outgoing Citations (Sorted by Pagerank)

Showing 5 of 5 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers