Enterprise Information Extraction: Recent Developments and Open Challenges
Summary: Survey of enterprise information extraction: declarative languages, scalable infrastructure, and development tooling for DB-style optimization. Identifies open challenges and opportunities for the database community to advance enterprise IE. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Laura Chiticariu
- 2. Yunyao Li
- 3. Sriram Raghavan
- 4. Frederick R. Reiss
Incoming Citations (Sorted by Pagerank)
Showing 5 of 5 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 7,102 | Mondrian: Spreadsheet Layout Detection | 2022 | SIGMOD | 4.8307982e-05 |
| 8,461 | Visual Segmentation for Information Extraction from Heterogeneous Visually Rich Documents | 2019 | SIGMOD | 4.5061205e-05 |
| 11,420 | Detecting Layout Templates in Complex Multiregion Files | 2022 | VLDB | 4.1945683e-05 |
| 11,975 | Which Concepts Are Worth Extracting? | 2014 | SIGMOD | 4.1945683e-05 |
| 13,491 | The SystemT IDE: An Integrated Development Environment for Information Extraction Rules | 2011 | SIGMOD | - |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 0 of 0 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 13,669 | Contextual Insight in Search: Enabling Technologies and Applications | 2005 | VLDB | - |
| 12,115 | Just-in-Time Information Extraction using Extraction Views | 2012 | SIGMOD | 4.1945683e-05 |
| 8,457 | The Continued Saga of DB-IR Integration | 2004 | VLDB | 4.5073797e-05 |
| 1,552 | Overview of Data Exploration Techniques | 2015 | SIGMOD | 0.00011408814 |
| 13,491 | The SystemT IDE: An Integrated Development Environment for Information Extraction Rules | 2011 | SIGMOD | - |
| 13,720 | QXtract: A Building Block for Efficient Information Extraction from Text Databases | 2003 | SIGMOD | - |
| 287 | Declarative Information Extraction Using Datalog with Embedded Extraction Predicates | 2007 | VLDB | 0.00028971272 |
| 1,395 | Structured Querying of Web Text: A Technical Challenge | 2007 | CIDR | 0.00012207039 |
| 13,626 | Managing Information Extraction [Tutorial Outline] | 2006 | SIGMOD | - |
| 9,423 | Database Principles in Information Extraction | 2014 | PODS | 4.3441378e-05 |