Improving Information Extraction from Visually Rich Documents using Visual Span Representations
Summary: Artemis, a visually aware IE method for heterogeneous visually rich documents, encodes visual+textual+layout context into fixed-length span representations. Minimal supervision for visual-span boundaries; multimodal embeddings boost IE, up to 17 F1 points on four datasets. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Ritesh Sarkhel
- 2. Arnab Nandi
Incoming Citations (Sorted by Pagerank)
Showing 2 of 2 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 10,126 | Visual Template Inference for Data Extraction from Documents | 2026 | SIGMOD | 4.1945683e-05 |
| 11,256 | Self-Training for Label-Efficient Information Extraction from Semi-Structured Web-Pages | 2023 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 6 of 6 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 254 | Snorkel: Rapid Training Data Creation with Weak Supervision | 2018 | VLDB | 0.00030540555 |
| 3,303 | Fonduer: Knowledge Base Construction from Richly Formatted Data | 2018 | SIGMOD | 7.2487486e-05 |
| 4,087 | Snorkel: Fast Training Set Generation for Information Extraction | 2017 | SIGMOD | 6.4607746e-05 |
| 6,135 | Extracting Logical Hierarchical Structure of HTML Documents Based on Headings | 2015 | VLDB | 5.1930114e-05 |
| 6,412 | CERES: Distantly Supervised Relation Extraction from the Semi-Structured Web | 2018 | VLDB | 5.0740036e-05 |
| 8,461 | Visual Segmentation for Information Extraction from Heterogeneous Visually Rich Documents | 2019 | SIGMOD | 4.5061205e-05 |
Previous
Page 1 / 1
Next