Ten Years of WebTables
Summary: Retrospective on WebTables: exploiting accessible HTML tables as informal structured data. Surveys a decade of progress across academia and industry, situates WebTables in the data-management landscape, and sketches a forward-looking agenda for informal data research. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Michael Cafarella
- 2. Alon Halevy
- 3. Hongrae Lee
- 4. Jayant Madhavan
- 5. Cong Yu
- 6. Daisy Zhe Wang
- 7. Eugene Wu
Incoming Citations (Sorted by Pagerank)
Showing 7 of 7 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,644 | Finding Related Tables in Data Lakes for Interactive Data Science | 2020 | SIGMOD | 0.00011041787 |
| 3,520 | GitTables: A Large-Scale Corpus of Relational Tables | 2023 | SIGMOD | 7.0131061e-05 |
| 4,630 | Knowledge Graphs 2021: A Data Odyssey | 2021 | VLDB | 6.0348379e-05 |
| 7,424 | Table Extraction and Understanding for Scientific and Enterprise Applications | 2020 | VLDB | 4.7339251e-05 |
| 8,787 | QuTE: Answering Quantity Queries from Web Tables | 2021 | SIGMOD | 4.4520613e-05 |
| 9,161 | Automatically Generating Interesting Facts from Wikipedia Tables | 2019 | SIGMOD | 4.3849295e-05 |
| 11,063 | Searching Data Lakes for Nested and Joined Data | 2024 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 19 of 19 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 12,401 | Large-Scale Collaborative Analysis and Extraction of Web Data | 2008 | VLDB | 4.1945683e-05 |
| 13,986 | Is Web-site Management a Database Problem? | 1998 | VLDB | - |
| 14,014 | Future Directions and Research Problems in the World Wide Web | 1996 | PODS | - |
| 13,820 | Data Management: Lasting Impact of the Wild, Wild, Web | 2001 | SIGMOD | - |
| 2,633 | Schema Extraction for Tabular Data on the Web | 2013 | VLDB | 8.4063569e-05 |
| 13,588 | Databases on the Web | 2007 | SIGMOD | - |
| 6,586 | Web Data Management | 2011 | SIGMOD | 5.0023398e-05 |
| 1,851 | An Analysis of Structured Data on the Web | 2012 | VLDB | 0.00010327871 |
| 107 | WebTables: Exploring the Power of Tables on the Web | 2008 | VLDB | 0.00048377684 |
| 8,135 | Applying WebTables in Practice | 2015 | CIDR | 4.5777549e-05 |