Database Paper Browser

Back to papers

RoadRunner: Automatic Data Extraction from Data-Intensive Web Sites

Summary: RoadRunner applies a matching technique to auto-generate a common wrapper/schema from pages in the same class. Uses Approximate and Partial Matching to cope with irregular HTML, yielding a single wrapper/schema for varied pages. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
3399
Venue
SIGMOD
Year
2002
Pagerank
5.0797045e-05
Overall Rank
6,403 | 55.46%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 3 of 3 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 1 of 1 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
533 RoadRunner: Towards Automatic Data Extraction from Large Web Sites 2001 VLDB 0.00020757722
Previous Page 1 / 1 Next

Semantically Similar Papers