Database Paper Browser

Back to papers

Computational Aspects of Resilient Data Extraction from Semistructured Sources

Summary: Formalizes resilient extraction using "unambiguous extraction expressions" (regular expressions with extra structure), defining resilience as producing maximal such expressions. Derives characterization theorems, complexity bounds for testing, and synthesis algorithms for maximal extractors. (summarized by gpt-5-mini on Feb 09 2026)

Paper ID
1203
Venue
PODS
Year
2000
Pagerank
4.8857878e-05
Overall Rank
6,958 | 51.60%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 1 of 1 citing papers.

Rank Citing Paper Year Venue Pagerank
2,698 Visual Web Information Extraction with Lixto* 2001 VLDB 8.2753317e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 5 of 5 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
114 A Query Language and Optimization Techniques for Unstructured Data 1996 SIGMOD 0.00046339735
385 NoDoSE - A Tool for Semi-Automatically Extracting Structured and Semistructured Data from Text Documents. 1998 SIGMOD 0.00024795739
1,314 Semistructured Data 1997 PODS 0.0001263326
1,919 Cut and Paste 1997 PODS 0.00010094755
3,150 Template-Based Wrappers in the TSIMMIS System 1997 SIGMOD 7.4736975e-05
Previous Page 1 / 1 Next

Semantically Similar Papers