Database Paper Browser

Back to papers

The Evolution of the Web and Implications for an Incremental Crawler

Summary: Incremental, selective updating of crawls to keep index and local collections fresh, not batch refresh. Empirical study over 0.5M pages in 4 months characterizes page evolution, compares crawl strategies, and proposes a hybrid architecture combining best choices for timelier updates. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
8647
Venue
VLDB
Year
2000
Pagerank
4.8925595e-05
Overall Rank
6,928 | 51.81%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 4 of 4 citing papers.

Rank Citing Paper Year Venue Pagerank
234 Crawling the Hidden Web 2001 VLDB 0.00032018108
8,320 Effective Change Detection Using Sampling 2002 VLDB 4.5435639e-05
12,333 NEAR-Miner: Mining Evolution Associations of Web Site Directories for Efficient Maintenance of Web Archives 2009 VLDB 4.1945683e-05
13,527 Dealing with Web Data: History and Look ahead 2010 VLDB -
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 1 of 1 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
1,304 Synchronizing a database to Improve Freshness 2000 SIGMOD 0.00012691283
Previous Page 1 / 1 Next

Semantically Similar Papers