The SphereSearch Engine for Unified Ranked Retrieval of Heterogeneous XML and Web Documents
Summary: SphereSearch: unified ranked retrieval across heterogeneous XML and Web data. Supports vague structure and text criteria, plus concept- and link-aware ranking via IR statistics and ontologies; HTML/PDF pages convert to XML with semantic tagging, enabling rich queries on a richly tagged encyclopedia. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Jens Graupmann
- 2. Ralf Schenkel
- 3. Gerhard Weikum
Incoming Citations (Sorted by Pagerank)
Showing 9 of 9 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 301 | BLINKS: Ranked Keyword Searches on Graphs | 2007 | SIGMOD | 0.00028370644 |
| 2,012 | DB&IR: Both Sides Now (Extended Abstract) | 2007 | SIGMOD | 9.7951657e-05 |
| 2,125 | EASE: An Effective 3-in-1 Keyword Search Method for Unstructured, Semi-structured and Structured Data | 2008 | SIGMOD | 9.4893973e-05 |
| 2,183 | Keyword Search on External Memory Data Graphs | 2008 | VLDB | 9.3439219e-05 |
| 3,341 | Avatar Semantic Search: A Database Approach to Information Retrieval | 2006 | SIGMOD | 7.1972915e-05 |
| 5,385 | Indexing Dataspaces | 2007 | SIGMOD | 5.5381684e-05 |
| 6,135 | Extracting Logical Hierarchical Structure of HTML Documents Based on Headings | 2015 | VLDB | 5.1930114e-05 |
| 8,456 | iDM: A Unified and Versatile Data Model for Personal Dataspace Management | 2006 | VLDB | 4.5073797e-05 |
| 11,975 | Which Concepts Are Worth Extracting? | 2014 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 14 of 14 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 13,602 | Information Discovery in Loosely Integrated Data | 2007 | SIGMOD | - |
| 5,712 | Flexible and Efficient XML Search with Complex Full-Text Predicates | 2006 | SIGMOD | 5.3584486e-05 |
| 7,256 | Effective and Efficient Retrieval of Structured Entities | 2020 | VLDB | 4.7869419e-05 |
| 434 | XSEarch: A Semantic Search Engine for XML | 2003 | VLDB | 0.0002328559 |
| 73 | XRANK: Ranked Keyword Search over XML Documents | 2003 | SIGMOD | 0.00058443993 |
| 1,734 | Entity Search Engine: Towards Agile Best-Effort Information Integration over the Web | 2007 | CIDR | 0.00010723542 |
| 12,589 | COMPASS: A Concept-based Web Search Engine for HTML, XML, and Deep Web Data | 2004 | VLDB | 4.1945683e-05 |
| 1,140 | EntityRank: Searching Entities Directly and Holistically | 2007 | VLDB | 0.00013720706 |
| 1,192 | The XXL Search Engine: Ranked Retrieval of XML Data using Indexes and Ontologies | 2002 | SIGMOD | 0.00013432765 |
| 12,455 | The TopX DB&IR Engine | 2007 | SIGMOD | 4.1945683e-05 |