Database Paper Browser

Back to papers

WADaR: Joint Wrapper and Data Repair

Summary: WADaR is a scalable tool for joint wrapper and data repair in web-scraped relations. It uses off-the-shelf entity recognizers to locate targets and Markov-chain repairs to fix data and wrappers; yields 15–60% quality gains and full wrapper repair in >50% without site knowledge. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
11102
Venue
VLDB
Year
2015
Pagerank
5.1618114e-05
Overall Rank
6,195 | 56.91%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 4 of 4 citing papers.

Rank Citing Paper Year Venue Pagerank
3,252 Auto-Suggest: Learning-to-Recommend Data Preparation Steps Using Data Science Notebooks 2020 SIGMOD 7.3178277e-05
6,412 CERES: Distantly Supervised Relation Extraction from the Semi-Structured Web 2018 VLDB 5.0740036e-05
7,826 The Smallest Extraction Problem 2021 VLDB 4.6416742e-05
9,248 Web Record Extraction with Invariants 2023 VLDB 4.3690661e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 7 of 7 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers