Cut and Paste
Summary: EDITOR: a search-and-cut&paste DSL to select and restructure semi-structured Web documents; Java implementation used in ARANEUS to create DB views over sites. Proves EDITOR is computationally complete and identifies a 'safe' fragment that exactly captures polynomial-time restructurings. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 8 of 8 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 533 | RoadRunner: Towards Automatic Data Extraction from Large Web Sites | 2001 | VLDB | 0.00020757722 |
| 1,370 | Monadic Datalog and the Expressive Power of Languages for Web Information Extraction | 2002 | PODS | 0.00012338027 |
| 2,005 | Record-Boundary Discovery in Web Documents | 1999 | SIGMOD | 9.8112591e-05 |
| 2,204 | To Weave the Web | 1997 | VLDB | 9.2970809e-05 |
| 2,698 | Visual Web Information Extraction with Lixto* | 2001 | VLDB | 8.2753317e-05 |
| 3,657 | The ARANEUS Web-Base Management System | 1998 | SIGMOD | 6.8713279e-05 |
| 6,958 | Computational Aspects of Resilient Data Extraction from Semistructured Sources | 2000 | PODS | 4.8857878e-05 |
| 12,663 | Querying Websites Using Compact Skeletons | 2001 | PODS | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 10 of 10 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 114 | A Query Language and Optimization Techniques for Unstructured Data | 1996 | SIGMOD | 0.00046339735 |
| 393 | From Structured Documents to Novel Query Facilities | 1994 | SIGMOD | 0.00024524092 |
| 437 | W3QS: A Query System for the World-Wide Web | 1995 | VLDB | 0.00023240203 |
| 466 | Querying and Updating the File* | 1993 | VLDB | 0.00022453592 |
| 922 | Mind Your Grammar: a New Approach to Modelling Text | 1987 | VLDB | 0.00015297648 |
| 1,599 | Formal Models of Web Queries | 1997 | PODS | 0.00011202032 |
| 2,593 | A Database Interface for File Update | 1995 | SIGMOD | 8.4746844e-05 |
| 4,285 | A Query Language for List-Based Complex Objects | 1994 | PODS | 6.2913523e-05 |
| 6,129 | Sequences, Datalog and Transducers | 1995 | PODS | 5.1974539e-05 |
| 6,420 | Text Dominated Databases, Theory Practice and Experience | 1994 | PODS | 5.0689272e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 8,752 | Query Evaluation Over SLP-Represented Document Databases With Complex Document Editing | 2022 | PODS | 4.456315e-05 |
| 1,498 | A Language for Manipulating Arrays | 1997 | VLDB | 0.0001168534 |
| 2,342 | Rewriting of Regular Expressions and Regular Path Queries | 1999 | PODS | 9.0015589e-05 |
| 2,319 | Expressive and Flexible Access to Web-Extracted Data: A Keyword-based Structured Query Language | 2010 | SIGMOD | 9.0387108e-05 |
| 385 | NoDoSE - A Tool for Semi-Automatically Extracting Structured and Semistructured Data from Text Documents. | 1998 | SIGMOD | 0.00024795739 |
| 14,300 | Unstructured Data Bases or Very Efficient Text Searching | 1983 | PODS | - |
| 6,958 | Computational Aspects of Resilient Data Extraction from Semistructured Sources | 2000 | PODS | 4.8857878e-05 |
| 2,204 | To Weave the Web | 1997 | VLDB | 9.2970809e-05 |
| 5,198 | Algebras for Querying Text Regions (Extended Abstract) | 1995 | PODS | 5.6346171e-05 |
| 3,845 | On Repairing Structural Problems In Semi-structured Data | 2013 | VLDB | 6.7073366e-05 |