Instance-based Schema Matching for Web Databases by Domain-specific Query Probing
Summary: Proposes a domain-specific model separating interface and result schemas in web DBs, with instance-based matching guided by query probing. Addresses intra-site extraction and inter-site integration with cross-validation, achieving strong precision and recall on web data. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Jiying Wang
- 2. Ji-Rong Wen
- 3. Fred Lochovsky
- 4. Wei-Ying Ma
Incoming Citations (Sorted by Pagerank)
Showing 8 of 8 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,537 | Google's Deep-Web Crawl | 2008 | VLDB | 0.00011465704 |
| 1,858 | Bootstrapping Pay-As-You-Go Data Integration Systems | 2008 | SIGMOD | 0.00010301124 |
| 3,797 | Stitching Web Tables for Improving Matching Quality | 2017 | VLDB | 6.7597149e-05 |
| 5,774 | A Hierarchical Approach to Model Web Query Interfaces for Web Source Integration | 2009 | VLDB | 5.3313642e-05 |
| 7,397 | A Probabilistic Approach for Automatically Filling Form-Based Web Interfaces | 2011 | VLDB | 4.7417648e-05 |
| 8,460 | WISE-Integrator: A System for Extracting and Integrating Complex Web Search Interfaces of the Deep Web | 2005 | VLDB | 4.5061526e-05 |
| 9,943 | Stop Word and Related Problems in Web Interface Integration | 2009 | VLDB | 4.2456408e-05 |
| 12,326 | Kosmix: High-Performance Topic Exploration using the Deep Web | 2009 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 10 of 10 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 208 | Reconciling Schemas of Disparate Data Sources: A Machine-Learning Approach | 2001 | SIGMOD | 0.0003460594 |
| 234 | Crawling the Hidden Web | 2001 | VLDB | 0.00032018108 |
| 303 | Generic Schema Matching with Cupid | 2001 | VLDB | 0.00028301477 |
| 382 | COMA - A system for flexible combination of schema matching approaches | 2002 | VLDB | 0.00024823252 |
| 533 | RoadRunner: Towards Automatic Data Extraction from Large Web Sites | 2001 | VLDB | 0.00020757722 |
| 587 | Extracting Structured Data from Web Pages | 2003 | SIGMOD | 0.00019648348 |
| 902 | Statistical Schema Matching across Web Query Interfaces | 2003 | SIGMOD | 0.00015486247 |
| 1,131 | Automatic Discovery of Language Models for Text Databases | 1999 | SIGMOD | 0.00013777757 |
| 2,447 | WISE-Integrator: An Automatic Integrator of Web Search Interfaces for E-Commerce | 2003 | VLDB | 8.8037197e-05 |
| 3,637 | Semantic Integration in Heterogeneous Databases Using Neural Networks | 1994 | VLDB | 6.8959519e-05 |
Previous
Page 1 / 1
Next