COMMIX: Towards Effective Web Information Extraction, Integration and Query Answering
Summary: Ontology-guided wrappers extract web pages into an XML DB, with ontology-to-XML-DTD data integration for cross-site information. An XML-QL extension (union, join, aggregates) supports multi-document integration, while view-based query answering and a QBE-style GUI expose semantic views to expert users. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Authors
- 1. Tengjiao Wang
- 2. Shiwei Tang
- 3. Dongqing Yang
- 4. Jun Gao
- 5. Yuqing Wu
- 6. Jian Pei
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 0 of 0 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 437 | W3QS: A Query System for the World-Wide Web | 1995 | VLDB | 0.00023240203 |
| 1,395 | Structured Querying of Web Text: A Technical Challenge | 2007 | CIDR | 0.00012207039 |
| 12,506 | AQAX: A System for Approximate XML Query Answers | 2006 | VLDB | 4.1945683e-05 |
| 12,558 | MIX: A Meta-data Indexing System for XML | 2005 | VLDB | 4.1945683e-05 |
| 2,694 | XML-Based Information Mediation with MIX | 1999 | SIGMOD | 8.2832701e-05 |
| 3,667 | Querying Structured Text in an XML Database | 2003 | SIGMOD | 6.8602249e-05 |
| 13,720 | QXtract: A Building Block for Efficient Information Extraction from Text Databases | 2003 | SIGMOD | - |
| 7,405 | The INFOMIX System for Advanced Integration of Incomplete and Inconsistent Data | 2005 | SIGMOD | 4.7378885e-05 |
| 3,931 | Extracting and Querying a Comprehensive Web Database | 2009 | CIDR | 6.6193836e-05 |
| 2,319 | Expressive and Flexible Access to Web-Extracted Data: A Keyword-based Structured Query Language | 2010 | SIGMOD | 9.0387108e-05 |