Database Paper Browser

Back to papers

Scalable Web Data Extraction for Online Market Intelligence

Summary: Cloud-enabled, scalable web data extraction for online market intelligence; parameterized navigation with on-the-fly deduplication. Orchestrates streams into a data warehouse for analytics; Lixto-based cloud deployment with a computers/electronics case. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
9834
Venue
VLDB
Year
2009
Pagerank
4.6617126e-05
Overall Rank
7,746 | 46.12%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 1 of 1 citing papers.

Rank Citing Paper Year Venue Pagerank
7,681 SXPath - Extending XPath towards Spatial Querying on Web Documents 2011 VLDB 4.6804276e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 2 of 2 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
1,370 Monadic Datalog and the Expressive Power of Languages for Web Information Extraction 2002 PODS 0.00012338027
2,698 Visual Web Information Extraction with Lixto* 2001 VLDB 8.2753317e-05
Previous Page 1 / 1 Next

Semantically Similar Papers