Database Paper Browser

Back to papers

Accurate and Efficient Crawling for Relevant Websites

Summary: Two-level focused website crawler with graph-based external site selection and per-site focused page crawling. Models websites as first-class units and beats prior site-adapted focused crawlers by efficient, targeted intra-site discovery. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
9151
Venue
VLDB
Year
2004
Pagerank
4.6563056e-05
Overall Rank
7,768 | 45.96%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 1 of 1 citing papers.

Rank Citing Paper Year Venue Pagerank
5,442 RankMass Crawler: A Crawler with High Personalized PageRank Coverage Guarantee 2007 VLDB 5.5026403e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 3 of 3 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
409 Focused Crawling Using Context Graphs 2000 VLDB 0.00023944056
771 Distributed Hypertext Resource Discovery Through Examples 1999 VLDB 0.00016887664
1,606 Enhanced hypertext categorization using hyperlinks 1998 SIGMOD 0.00011174873
Previous Page 1 / 1 Next

Semantically Similar Papers