Distributed Hypertext Resource Discovery Through Examples
Summary: Relational-DB based hypertext resource discovery architecture that queries page content, metadata, and hyperlink structure. A crawler guided by a relevance classifier and a distiller, integrated in IBM Universal DB, yields substantial I/O savings and SQL-driven adaptive strategies; experiments demonstrate efficiency, effectiveness, and robustness. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 2 of 2 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 409 | Focused Crawling Using Context Graphs | 2000 | VLDB | 0.00023944056 |
| 7,768 | Accurate and Efficient Crawling for Relevant Websites | 2004 | VLDB | 4.6563056e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 3 of 3 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 186 | Proximity Search in Databases | 1998 | VLDB | 0.00036215179 |
| 1,606 | Enhanced hypertext categorization using hyperlinks | 1998 | SIGMOD | 0.00011174873 |
| 1,759 | Information Translation, Mediation, and Mosaic-Based Browsing in the TSIMMIS System | 1995 | SIGMOD | 0.00010653127 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 8,691 | Efficient and Effective Metasearch for Text Databases Incorporating Linkages among Documents | 2001 | SIGMOD | 4.466355e-05 |
| 1,616 | Relational link-based ranking | 2004 | VLDB | 0.00011128652 |
| 12,928 | Indexing in a Hypertext Database | 1990 | VLDB | 4.1945683e-05 |
| 12,669 | Self-similarity in the web | 2001 | VLDB | 4.1945683e-05 |
| 13,602 | Information Discovery in Loosely Integrated Data | 2007 | SIGMOD | - |
| 1,606 | Enhanced hypertext categorization using hyperlinks | 1998 | SIGMOD | 0.00011174873 |
| 3,950 | Probe, Count, and Classify: Categorizing Hidden-Web Databases | 2001 | SIGMOD | 6.5953844e-05 |
| 5,672 | Effective Keyword-based Selection of Relational Databases | 2007 | SIGMOD | 5.3784128e-05 |
| 9,548 | Optimal Algorithms for Crawling a Hidden Database in the Web | 2012 | VLDB | 4.3258142e-05 |
| 1,492 | Distributed Search over the Hidden Web: Hierarchical Database Sampling and Selection | 2002 | VLDB | 0.00011694396 |