Back to papers
Statistical Schema Matching across Web Query Interfaces
Summary: Statistical schema matching (MGS) discovers a hidden generative schema to align many input schemas holistically, beyond pairwise matches. MGS_sd specializes in synonym discovery; tested on hundreds of Web sources across four domains, showing robust accuracy amid deep-Web vocabulary convergence.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 3434
- Venue
- SIGMOD
- Year
- 2003
- Pagerank
- 0.00015486247
- Overall Rank
- 902 | 93.73%
- DOI
-
-
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 29 of 29 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
| 420 |
InfoGather: Entity Augmentation and Attribute Discovery By Holistic Matching with Web Tables |
2012 |
SIGMOD |
0.00023719065 |
| 672 |
An Interactive Clustering-based Approach to Integrating Source Query Interfaces on the Deep Web |
2004 |
SIGMOD |
0.00018355746 |
| 893 |
Data Integration: The Teenage Years |
2006 |
VLDB |
0.00015558352 |
| 1,178 |
Table Union Search on Open Data |
2018 |
VLDB |
0.00013468118 |
| 1,527 |
Generic Schema Matching, Ten Years Later |
2011 |
VLDB |
0.00011499442 |
| 1,762 |
Tuning Schema Matching Software using Synthetic Scenarios |
2005 |
VLDB |
0.00010646894 |
| 1,858 |
Bootstrapping Pay-As-You-Go Data Integration Systems |
2008 |
SIGMOD |
0.00010301124 |
| 2,095 |
Knocking the Door to the Deep Web: Integrating Web Query Interfaces |
2004 |
SIGMOD |
9.5505068e-05 |
| 2,174 |
iMAP: Discovering Complex Semantic Matches between Database Schemas |
2004 |
SIGMOD |
9.3672342e-05 |
| 2,362 |
Understanding Web Query Interfaces: Best-Effort Parsing with Hidden Syntax |
2004 |
SIGMOD |
8.9582251e-05 |
| 2,425 |
Instance-based Schema Matching for Web Databases by Domain-specific Query Probing |
2004 |
VLDB |
8.8376569e-05 |
| 2,730 |
Open Data Integration |
2018 |
VLDB |
8.2126735e-05 |
| 3,724 |
Toward Large Scale Integration: Building a MetaQuerier over Databases on the Web |
2005 |
CIDR |
6.8173288e-05 |
| 3,797 |
Stitching Web Tables for Improving Matching Quality |
2017 |
VLDB |
6.7597149e-05 |
| 4,229 |
Harnessing the Deep Web: Present and Future |
2009 |
CIDR |
6.3399547e-05 |
| 5,174 |
Mapping Maintenance for Data Integration Systems |
2005 |
VLDB |
5.6443463e-05 |
| 6,713 |
Query Relaxation Using Malleable Schemas |
2007 |
SIGMOD |
4.951387e-05 |
| 6,827 |
Light-weight Domain-based Form Assistant: Querying Web Databases On the Fly |
2005 |
VLDB |
4.9137918e-05 |
| 8,154 |
MetaQuerier: Querying Structured Web Sources On-the-fly |
2005 |
SIGMOD |
4.5745458e-05 |
| 8,460 |
WISE-Integrator: A System for Extracting and Integrating Complex Web Search Interfaces of the Deep Web |
2005 |
VLDB |
4.5061526e-05 |
| 8,499 |
Synthesizing Mapping Relationships Using Table Corpus |
2017 |
SIGMOD |
4.4975851e-05 |
| 8,823 |
The Role of Schema Matching in Large Enterprises |
2009 |
CIDR |
4.4415658e-05 |
| 8,878 |
Learning to Extract Form Labels |
2008 |
VLDB |
4.4302126e-05 |
| 9,548 |
Optimal Algorithms for Crawling a Hidden Database in the Web |
2012 |
VLDB |
4.3258142e-05 |
| 9,549 |
Attribute Domain Discovery for Hidden Web Databases |
2011 |
SIGMOD |
4.3258142e-05 |
| 9,818 |
Structures, Semantics and Statistics |
2004 |
VLDB |
4.2777808e-05 |
| 9,943 |
Stop Word and Related Problems in Web Interface Integration |
2009 |
VLDB |
4.2456408e-05 |
| 12,223 |
Schema Clustering and Retrieval for Multi-domain Pay-As-You-Go Data Integration Systems |
2010 |
SIGMOD |
4.1945683e-05 |
| 12,478 |
Randomized Algorithms for Data Reconciliation in Wide Area Aggregate Query Processing |
2007 |
VLDB |
4.1945683e-05 |
Outgoing Citations (Sorted by Pagerank)
Showing 5 of 5 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Semantically Similar Papers