Database Paper Browser

Back to papers

Automatic Rule Refinement for Information Extraction

Summary: Uses data-provenance lineage techniques to guide automatic refinement of rule-based information extraction. Given labeled correct/incorrect extractions, it produces a ranked list of rule modifications for expert refinement; implemented in SystemT and demonstrated effective improvement. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
10107
Venue
VLDB
Year
2010
Pagerank
5.0244622e-05
Overall Rank
6,534 | 54.55%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 8 of 8 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 7 of 7 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
31 Provenance Semirings 2007 PODS 0.0007857786
287 Declarative Information Extraction Using Datalog with Embedded Extraction Predicates 2007 VLDB 0.00028971272
487 Why Not? 2009 SIGMOD 0.00022050218
652 On the Provenance of Non-Answers to Queries over Extracted Data 2008 VLDB 0.00018634477
2,562 Explaining Missing Answers to SPJUA Queries 2010 VLDB 8.5386194e-05
3,477 Toward Best-Effort Information Extraction 2008 SIGMOD 7.0583481e-05
7,280 I4E: Interactive Investigation of Iterative Information Extraction 2010 SIGMOD 4.778826e-05
Previous Page 1 / 1 Next

Semantically Similar Papers