Database Paper Browser

Back to papers

Cleaning Inconsistencies in Information Extraction via Prioritized Repairs

Summary: Declarative framework for cleaning inconsistent IE outputs by integrating prioritized repairs into document spanners, enabling user-declared conflict-resolution policies that capture industrial cleaning operations and POSIX regex semantics. Analyzes unambiguity and expressive power of such policies, with both positive and negative (decidability/complexity) results. (summarized by gpt-5-mini on Feb 09 2026)

Paper ID
1616
Venue
PODS
Year
2014
Pagerank
5.5295577e-05
Overall Rank
5,398 | 62.45%
DOI
10.1145/2595438.2594540

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 6 of 6 citing papers.

Rank Citing Paper Year Venue Pagerank
2,483 Discovery of Approximate (and Exact) Denial Constraints 2020 VLDB 8.6864916e-05
3,042 Dichotomies in the Complexity of Preferred Repairs 2015 PODS 7.669374e-05
7,702 Counting and Enumerating (Preferred) Database Repairs 2017 PODS 4.6736471e-05
8,722 Preference-aware Integration of Temporal Data 2015 VLDB 4.4606662e-05
9,423 Database Principles in Information Extraction 2014 PODS 4.3441378e-05
11,240 Autonomously Computable Information Extraction 2023 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 9 of 9 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers