From Focused Crawling to Expert Information: an Application Framework for Web Exploration and Portal Generation
Summary: BINGO! focused crawling framework for theme-specific web directories and portals, spanning Surface and Deep Web from seeds. Adaptive: HITS and linear-SVM select targets, retrain, and expose hidden sources as Semantic Web ontology-based Web services; HIP and MIPS. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Authors
- 1. Sergej Sizov
- 2. Jens Graupmann
- 3. Martin Theobald
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 4 of 4 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 234 | Crawling the Hidden Web | 2001 | VLDB | 0.00032018108 |
| 1,132 | Building light-weight wrappers for legacy Web data-sources using W4F | 1999 | VLDB | 0.00013777657 |
| 3,950 | Probe, Count, and Classify: Categorizing Hidden-Web Databases | 2001 | SIGMOD | 6.5953844e-05 |
| 12,615 | The BINGO! System for Information Portal Generation and Expert Web Search | 2003 | CIDR | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 12,260 | Deep Web Integration with VisQI | 2010 | VLDB | 4.1945683e-05 |
| 2,095 | Knocking the Door to the Deep Web: Integrating Web Query Interfaces | 2004 | SIGMOD | 9.5505068e-05 |
| 1,537 | Google's Deep-Web Crawl | 2008 | VLDB | 0.00011465704 |
| 3,950 | Probe, Count, and Classify: Categorizing Hidden-Web Databases | 2001 | SIGMOD | 6.5953844e-05 |
| 409 | Focused Crawling Using Context Graphs | 2000 | VLDB | 0.00023944056 |
| 672 | An Interactive Clustering-based Approach to Integrating Source Query Interfaces on the Deep Web | 2004 | SIGMOD | 0.00018355746 |
| 8,678 | Progressive Deep Web Crawling Through Keyword Queries For Data Enrichment | 2019 | SIGMOD | 4.4702119e-05 |
| 7,768 | Accurate and Efficient Crawling for Relevant Websites | 2004 | VLDB | 4.6563056e-05 |
| 234 | Crawling the Hidden Web | 2001 | VLDB | 0.00032018108 |
| 12,615 | The BINGO! System for Information Portal Generation and Expert Web Search | 2003 | CIDR | 4.1945683e-05 |