Table Extraction and Understanding for Scientific and Enterprise Applications
Summary: Survey and tutorial on table extraction and understanding for scientific and enterprise docs. From border/cell segmentation to semantic linking with headers, units, captions, and surrounding text, it surveys methods, open problems, and applications. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 2 of 2 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 10,115 | ST-Raptor: LLM-Powered Semi-Structured Table Question Answering | 2026 | SIGMOD | 4.1945683e-05 |
| 10,117 | AixelAsk: A Stepwise-Guided Retrieval and Reasoning Framework for Large Table QA | 2026 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 6 of 6 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 107 | WebTables: Exploring the Power of Tables on the Web | 2008 | VLDB | 0.00048377684 |
| 1,001 | Recovering Semantics of Tables on the Web | 2011 | VLDB | 0.00014706505 |
| 3,155 | Ten Years of WebTables | 2018 | VLDB | 7.4672742e-05 |
| 3,285 | Using the Structure of Web Sites for Automatic Segmentation of Tables | 2004 | SIGMOD | 7.2759001e-05 |
| 3,303 | Fonduer: Knowledge Base Construction from Richly Formatted Data | 2018 | SIGMOD | 7.2487486e-05 |
| 8,467 | Creation and Interaction with Large-scale Domain-Specific Knowledge Bases | 2017 | VLDB | 4.504802e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 10,973 | Unstructured Data Fusion for Schema and Data Extraction | 2024 | SIGMOD | 4.1945683e-05 |
| 1,001 | Recovering Semantics of Tables on the Web | 2011 | VLDB | 0.00014706505 |
| 10,109 | Retrieve-and-Verify: A Table Context Selection Framework for Accurate Column Annotations | 2026 | SIGMOD | 4.1945683e-05 |
| 2,633 | Schema Extraction for Tabular Data on the Web | 2013 | VLDB | 8.4063569e-05 |
| 3,742 | TEGRA: Table Extraction by Global Record Alignment | 2015 | SIGMOD | 6.7966898e-05 |
| 9,423 | Database Principles in Information Extraction | 2014 | PODS | 4.3441378e-05 |
| 8,135 | Applying WebTables in Practice | 2015 | CIDR | 4.5777549e-05 |
| 5,449 | Transformers for Tabular Data Representation: A Tutorial on Models and Applications | 2022 | VLDB | 5.5008652e-05 |
| 1,317 | Harvesting Relational Tables from Lists on the Web | 2009 | VLDB | 0.00012625853 |
| 8,913 | Making Table Understanding Work in Practice | 2022 | CIDR | 4.427232e-05 |