Reconciling Schemas of Disparate Data Sources: A Machine-Learning Approach
Summary: Introduces LSD, a semi-automatic data integration system that learns semantic mappings to a schema from seed mappings. It combines multiple learners—using schema, data, domain constraints, and XML structure—with a meta-learner to map sources. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. AnHai Doan
- 2. Pedro Domingos
- 3. Alon Halevy
Incoming Citations (Sorted by Pagerank)
Showing 2 of 52 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 12,546 | SMART: A Tool for Semantic-Driven Creation of Complex XML Mappings | 2005 | SIGMOD | 4.1945683e-05 |
| 12,633 | Schema-driven Customization of Web Services | 2003 | VLDB | 4.1945683e-05 |
Outgoing Citations (Sorted by Pagerank)
Showing 5 of 5 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 127 | Querying Heterogeneous Information Sources Using Source Descriptions | 1996 | VLDB | 0.00044642203 |
| 151 | Optimizing Queries across Diverse Data Sources | 1997 | VLDB | 0.00041016476 |
| 173 | Schema Mapping as Query Discovery | 2000 | VLDB | 0.00038627829 |
| 294 | Using Schema Matching to Simplify Heterogeneous Data Translation | 1998 | VLDB | 0.00028669519 |
| 394 | An Adaptive Query Execution System for Data Integration* | 1999 | SIGMOD | 0.00024460855 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 5,529 | Data-Driven Domain Discovery for Structured Datasets | 2020 | VLDB | 5.4566641e-05 |
| 3,110 | Learning to Create Data-Integrating Queries | 2008 | VLDB | 7.5475982e-05 |
| 5,564 | Interactive Generation of Integrated Schemas | 2008 | SIGMOD | 5.4320854e-05 |
| 1,858 | Bootstrapping Pay-As-You-Go Data Integration Systems | 2008 | SIGMOD | 0.00010301124 |
| 3,992 | Discovering Linkage Points over Web Data | 2013 | VLDB | 6.5544834e-05 |
| 6,792 | Automatically Incorporating New Sources in Keyword Search-Based Data Integration | 2010 | SIGMOD | 4.9249098e-05 |
| 3,637 | Semantic Integration in Heterogeneous Databases Using Neural Networks | 1994 | VLDB | 6.8959519e-05 |
| 12,223 | Schema Clustering and Retrieval for Multi-domain Pay-As-You-Go Data Integration Systems | 2010 | SIGMOD | 4.1945683e-05 |
| 902 | Statistical Schema Matching across Web Query Interfaces | 2003 | SIGMOD | 0.00015486247 |
| 8,824 | Analyzing and Revising Data Integration Schemas to Improve Their Matchability | 2008 | VLDB | 4.4415658e-05 |