Database Paper Browser

Back to papers

Statistical Schema Matching across Web Query Interfaces

Summary: Statistical schema matching (MGS) discovers a hidden generative schema to align many input schemas holistically, beyond pairwise matches. MGS_sd specializes in synonym discovery; tested on hundreds of Web sources across four domains, showing robust accuracy amid deep-Web vocabulary convergence. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
3434
Venue
SIGMOD
Year
2003
Pagerank
0.00015486247
Overall Rank
902 | 93.73%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 29 of 29 citing papers.

Rank Citing Paper Year Venue Pagerank
420 InfoGather: Entity Augmentation and Attribute Discovery By Holistic Matching with Web Tables 2012 SIGMOD 0.00023719065
672 An Interactive Clustering-based Approach to Integrating Source Query Interfaces on the Deep Web 2004 SIGMOD 0.00018355746
893 Data Integration: The Teenage Years 2006 VLDB 0.00015558352
1,178 Table Union Search on Open Data 2018 VLDB 0.00013468118
1,527 Generic Schema Matching, Ten Years Later 2011 VLDB 0.00011499442
1,762 Tuning Schema Matching Software using Synthetic Scenarios 2005 VLDB 0.00010646894
1,858 Bootstrapping Pay-As-You-Go Data Integration Systems 2008 SIGMOD 0.00010301124
2,095 Knocking the Door to the Deep Web: Integrating Web Query Interfaces 2004 SIGMOD 9.5505068e-05
2,174 iMAP: Discovering Complex Semantic Matches between Database Schemas 2004 SIGMOD 9.3672342e-05
2,362 Understanding Web Query Interfaces: Best-Effort Parsing with Hidden Syntax 2004 SIGMOD 8.9582251e-05
2,425 Instance-based Schema Matching for Web Databases by Domain-specific Query Probing 2004 VLDB 8.8376569e-05
2,730 Open Data Integration 2018 VLDB 8.2126735e-05
3,724 Toward Large Scale Integration: Building a MetaQuerier over Databases on the Web 2005 CIDR 6.8173288e-05
3,797 Stitching Web Tables for Improving Matching Quality 2017 VLDB 6.7597149e-05
4,229 Harnessing the Deep Web: Present and Future 2009 CIDR 6.3399547e-05
5,174 Mapping Maintenance for Data Integration Systems 2005 VLDB 5.6443463e-05
6,713 Query Relaxation Using Malleable Schemas 2007 SIGMOD 4.951387e-05
6,827 Light-weight Domain-based Form Assistant: Querying Web Databases On the Fly 2005 VLDB 4.9137918e-05
8,154 MetaQuerier: Querying Structured Web Sources On-the-fly 2005 SIGMOD 4.5745458e-05
8,460 WISE-Integrator: A System for Extracting and Integrating Complex Web Search Interfaces of the Deep Web 2005 VLDB 4.5061526e-05
8,499 Synthesizing Mapping Relationships Using Table Corpus 2017 SIGMOD 4.4975851e-05
8,823 The Role of Schema Matching in Large Enterprises 2009 CIDR 4.4415658e-05
8,878 Learning to Extract Form Labels 2008 VLDB 4.4302126e-05
9,548 Optimal Algorithms for Crawling a Hidden Database in the Web 2012 VLDB 4.3258142e-05
9,549 Attribute Domain Discovery for Hidden Web Databases 2011 SIGMOD 4.3258142e-05
9,818 Structures, Semantics and Statistics 2004 VLDB 4.2777808e-05
9,943 Stop Word and Related Problems in Web Interface Integration 2009 VLDB 4.2456408e-05
12,223 Schema Clustering and Retrieval for Multi-domain Pay-As-You-Go Data Integration Systems 2010 SIGMOD 4.1945683e-05
12,478 Randomized Algorithms for Data Reconciliation in Wide Area Aggregate Query Processing 2007 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 5 of 5 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers