A Hierarchical Approach to Model Web Query Interfaces for Web Source Integration
Summary: Hierarchical extraction of Web query interfaces into schema trees for integration, merging text and field tokens from layout. Domain rules map interfaces to schemas; tested on 3 corpora with 500+ interfaces, 15 domains, with ~6.5% accuracy gains. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Eduard C. Dragut
- 2. Thomas Kabisch
- 3. Clement Yu
- 4. Ulf Leser
Incoming Citations (Sorted by Pagerank)
Showing 7 of 7 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 6,133 | DIADEM: Thousands of Websites to a Single Database | 2014 | VLDB | 5.1954702e-05 |
| 9,248 | Web Record Extraction with Invariants | 2023 | VLDB | 4.3690661e-05 |
| 9,432 | Aggregate Estimation Over Dynamic Hidden Web Databases | 2014 | VLDB | 4.3431757e-05 |
| 9,548 | Optimal Algorithms for Crawling a Hidden Database in the Web | 2012 | VLDB | 4.3258142e-05 |
| 9,549 | Attribute Domain Discovery for Hidden Web Databases | 2011 | SIGMOD | 4.3258142e-05 |
| 9,943 | Stop Word and Related Problems in Web Interface Integration | 2009 | VLDB | 4.2456408e-05 |
| 12,260 | Deep Web Integration with VisQI | 2010 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 9 of 9 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 234 | Crawling the Hidden Web | 2001 | VLDB | 0.00032018108 |
| 382 | COMA - A system for flexible combination of schema matching approaches | 2002 | VLDB | 0.00024823252 |
| 672 | An Interactive Clustering-based Approach to Integrating Source Query Interfaces on the Deep Web | 2004 | SIGMOD | 0.00018355746 |
| 1,147 | Web-scale Data Integration: You can only afford to Pay As You Go | 2007 | CIDR | 0.00013677658 |
| 2,362 | Understanding Web Query Interfaces: Best-Effort Parsing with Hidden Syntax | 2004 | SIGMOD | 8.9582251e-05 |
| 2,425 | Instance-based Schema Matching for Web Databases by Domain-specific Query Probing | 2004 | VLDB | 8.8376569e-05 |
| 3,551 | Data Management Projects at Google | 2006 | SIGMOD | 6.9812665e-05 |
| 7,422 | Meaningful Labeling of Integrated Query Interfaces | 2006 | VLDB | 4.7343948e-05 |
| 8,154 | MetaQuerier: Querying Structured Web Sources On-the-fly | 2005 | SIGMOD | 4.5745458e-05 |
Previous
Page 1 / 1
Next