Database Paper Browser

Back to papers

Visual Web Information Extraction with Lixto*

Summary: Visual, interactive wrapper generation for web information extraction with Lixto enables semi-automatic HTML-to-XML translation. Internally, Elog, a declarative language, drives extraction; users need not know Elog or HTML, as Lixto yields an XML-Companion for changing pages. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
8736
Venue
VLDB
Year
2001
Pagerank
8.2753317e-05
Overall Rank
2,698 | 81.24%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 19 of 19 citing papers.

Rank Citing Paper Year Venue Pagerank
1,095 The Lixto Data Extraction Project - Back and Forth between Theory and Practice 2004 PODS 0.00014126427
1,221 A Web of Concepts 2009 PODS 0.00013219242
1,370 Monadic Datalog and the Expressive Power of Languages for Web Information Extraction 2002 PODS 0.00012338027
1,663 Conjunctive Queries over Trees 2004 PODS 0.00010977096
2,947 Interactive Data Integration through Smart Copy & Paste 2009 CIDR 7.834316e-05
3,117 Processing Queries on Tree-Structured Data Efficiently 2006 PODS 7.5407318e-05
4,440 Robust Web Extraction: An Approach Based on a Probabilistic Tree-Edit Model 2009 SIGMOD 6.187819e-05
5,609 Documentum ECI Self-Repairing Wrappers: Performance Analysis 2006 SIGMOD 5.4129892e-05
5,652 From Information to Knowledge: Harvesting Entities and Relationships from Web Sources 2010 PODS 5.3903671e-05
5,705 Datalog Unchained 2021 PODS 5.3621239e-05
6,133 DIADEM: Thousands of Websites to a Single Database 2014 VLDB 5.1954702e-05
7,405 The INFOMIX System for Advanced Integration of Incomplete and Inconsistent Data 2005 SIGMOD 4.7378885e-05
7,746 Scalable Web Data Extraction for Online Market Intelligence 2009 VLDB 4.6617126e-05
8,603 OXPath: A Language for Scalable, Memory-efficient Data Extraction from Web Applications 2011 VLDB 4.4866461e-05
8,632 Measuring the Structural Similarity of Semistructured Documents Using Entropy 2007 VLDB 4.4803734e-05
9,026 Robust and Noise Resistant Wrapper Induction 2016 SIGMOD 4.4051668e-05
9,717 Supervised Wrapper Generation with Lixto 2001 VLDB 4.299267e-05
12,525 Automatic Extraction of Dynamic Record Sections From Search Engine Result Pages 2006 VLDB 4.1945683e-05
12,589 COMPASS: A Concept-based Web Search Engine for HTML, XML, and Deep Web Data 2004 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 7 of 7 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers