Database Paper Browser

Back to papers

Schema Extraction for Tabular Data on the Web

Summary: CRF-based schema extraction for tabular Web data using a novel logarithmic binning feature encoding to infer table construction from nearby rows. Extends beyond HTML tables to full tables (including spreadsheets), extracting row groups and surpassing WebTables in schema accuracy. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
10707
Venue
VLDB
Year
2013
Pagerank
8.4063569e-05
Overall Rank
2,633 | 81.69%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 9 of 9 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 8 of 8 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers