OXPath: A Language for Scalable, Memory-efficient Data Extraction from Web Applications
Summary: OXPath extends XPath to interact with rich web apps and precisely capture relevant data for scalable, embedded web extraction. Page-at-a-time evaluation yields memory usage independent of visited pages (polynomial time), with rendering cost dominating; it outperforms existing tools and is open source. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Tim Furche
- 2. Georg Gottlob
- 3. Giovanni Grasso
- 4. Christian Schallhart
- 5. Andrew Sellers
Incoming Citations (Sorted by Pagerank)
Showing 1 of 1 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,880 | High-Performance Complex Event Processing over XML Streams | 2012 | SIGMOD | 5.8573822e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 3 of 3 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 287 | Declarative Information Extraction Using Datalog with Embedded Extraction Predicates | 2007 | VLDB | 0.00028971272 |
| 1,132 | Building light-weight wrappers for legacy Web data-sources using W4F | 1999 | VLDB | 0.00013777657 |
| 2,698 | Visual Web Information Extraction with Lixto* | 2001 | VLDB | 8.2753317e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,931 | Efficient Evaluation of XQuery over Streaming Data | 2005 | VLDB | 5.8207617e-05 |
| 3,084 | On the minimization of Xpath queries | 2003 | VLDB | 7.6011919e-05 |
| 869 | APEX: An Adaptive Path Index for XML Data | 2002 | SIGMOD | 0.00015788339 |
| 4,880 | High-Performance Complex Event Processing over XML Streams | 2012 | SIGMOD | 5.8573822e-05 |
| 987 | XPath Queries on Streaming Data | 2003 | SIGMOD | 0.00014819204 |
| 2,977 | A Framework for Using Materialized XPath Views in XML Query Processing | 2004 | VLDB | 7.7876083e-05 |
| 3,695 | On the Memory Requirements of XPath Evaluation over XML Streams | 2004 | PODS | 6.8345021e-05 |
| 4,660 | XPathLearner: An On-Line Self-Tuning Markov Histogram for XML Path Selectivity Estimation | 2002 | VLDB | 6.014625e-05 |
| 713 | Efficient Algorithms for Processing XPath Queries | 2002 | VLDB | 0.00017731096 |
| 7,681 | SXPath - Extending XPath towards Spatial Querying on Web Documents | 2011 | VLDB | 4.6804276e-05 |