Database Paper Browser

Back to papers

Automatic Wrappers for Large Scale Web Extraction

Summary: Unsupervised wrapper induction tolerant to noisy training data enables scalable information extraction from data without site-level supervision. Generic framework yields accuracy surpassing unsupervised methods; deployed at Yahoo powering live apps. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
10251
Venue
VLDB
Year
2011
Pagerank
6.8517545e-05
Overall Rank
3,678 | 74.42%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 11 of 11 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 7 of 7 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers