Attribute Domain Discovery for Hidden Web Databases
Summary: Domain discovery for hidden web databases via their form-based interfaces; achievability guaranteed solely by interface design. Proposes novel techniques with guarantees on discovery completeness, supported by theory and extensive experiments. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Xin Jin
- 2. Nan Zhang
- 3. Gautam Das
Incoming Citations (Sorted by Pagerank)
Showing 4 of 4 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 8,678 | Progressive Deep Web Crawling Through Keyword Queries For Data Enrichment | 2019 | SIGMOD | 4.4702119e-05 |
| 9,548 | Optimal Algorithms for Crawling a Hidden Database in the Web | 2012 | VLDB | 4.3258142e-05 |
| 12,181 | MOBIES: Mobile-Interface Enhancement Service for Hidden Web Database | 2011 | SIGMOD | 4.1945683e-05 |
| 12,189 | Randomized Generalization for Aggregate Suppression Over Hidden Web Databases | 2011 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 10 of 10 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 234 | Crawling the Hidden Web | 2001 | VLDB | 0.00032018108 |
| 902 | Statistical Schema Matching across Web Query Interfaces | 2003 | SIGMOD | 0.00015486247 |
| 1,492 | Distributed Search over the Hidden Web: Hierarchical Database Sampling and Selection | 2002 | VLDB | 0.00011694396 |
| 2,362 | Understanding Web Query Interfaces: Best-Effort Parsing with Hidden Syntax | 2004 | SIGMOD | 8.9582251e-05 |
| 2,813 | Mining Search Engine Query Logs via Suggestion Sampling | 2008 | VLDB | 8.0773142e-05 |
| 3,246 | Accessing the Web: From Search to Integration | 2006 | SIGMOD | 7.326175e-05 |
| 5,140 | A Random Walk Approach to Sampling Hidden Databases | 2007 | SIGMOD | 5.668209e-05 |
| 5,774 | A Hierarchical Approach to Model Web Query Interfaces for Web Source Integration | 2009 | VLDB | 5.3313642e-05 |
| 7,422 | Meaningful Labeling of Integrated Query Interfaces | 2006 | VLDB | 4.7343948e-05 |
| 8,684 | Unbiased Estimation of Size and Other Aggregates Over Hidden Web Databases | 2010 | SIGMOD | 4.4677591e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 12,301 | Privacy Preservation of Aggregates in Hidden Databases: Why and How? | 2009 | SIGMOD | 4.1945683e-05 |
| 146 | Knowledge Discovery in Databases: An Attribute-Oriented Approach | 1992 | VLDB | 0.00041315295 |
| 9,548 | Optimal Algorithms for Crawling a Hidden Database in the Web | 2012 | VLDB | 4.3258142e-05 |
| 8,684 | Unbiased Estimation of Size and Other Aggregates Over Hidden Web Databases | 2010 | SIGMOD | 4.4677591e-05 |
| 3,823 | Automatic Discovery of Attributes in Relational Databases | 2011 | SIGMOD | 6.7261168e-05 |
| 12,223 | Schema Clustering and Retrieval for Multi-domain Pay-As-You-Go Data Integration Systems | 2010 | SIGMOD | 4.1945683e-05 |
| 7,422 | Meaningful Labeling of Integrated Query Interfaces | 2006 | VLDB | 4.7343948e-05 |
| 2,425 | Instance-based Schema Matching for Web Databases by Domain-specific Query Probing | 2004 | VLDB | 8.8376569e-05 |
| 8,129 | Discovering the Skyline of Web Databases | 2016 | VLDB | 4.5784968e-05 |
| 5,529 | Data-Driven Domain Discovery for Structured Datasets | 2020 | VLDB | 5.4566641e-05 |