Inferring XML Schema Definitions from XML Data
Summary: Infers XML Schema Definitions (XSDs) from XML data, addressing context-sensitive content models beyond DTDs. Proposes a practically relevant XSD subclass with a theoretically complete algorithm that recovers the true schema from a large corpus, plus a robust variant for incomplete data. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Geert Jan Bex
- 2. Frank Neven
- 3. Stijn Vansummeren
Incoming Citations (Sorted by Pagerank)
Showing 10 of 10 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 66 | Spark SQL: Relational Data Processing in Spark | 2015 | SIGMOD | 0.00061639801 |
| 809 | Curated Databases | 2008 | PODS | 0.00016430384 |
| 3,631 | On-the-Fly Entity-Aware Query Processing in the Presence of Linkage | 2010 | VLDB | 6.9014378e-05 |
| 3,845 | On Repairing Structural Problems In Semi-structured Data | 2013 | VLDB | 6.7073366e-05 |
| 6,894 | TableDC: Deep Clustering for Tabular Data | 2025 | SIGMOD | 4.8925595e-05 |
| 7,571 | Reducing Ambiguity in Json Schema Discovery | 2021 | SIGMOD | 4.7075853e-05 |
| 8,255 | Discovering XSD Keys from XML Data | 2013 | SIGMOD | 4.5491362e-05 |
| 12,217 | Simplifying XML Schema: Single-Type Approximations of Regular Tree Languages | 2010 | PODS | 4.1945683e-05 |
| 12,306 | Simplifying XML Schema: Effortless Handling of Nondeterministic Regular Expressions | 2009 | SIGMOD | 4.1945683e-05 |
| 12,372 | SchemaScope: a System for Inferring and Cleaning XML Schemas | 2008 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 12 of 12 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,215 | Generating XML Structure Using Examples and Constraints | 2008 | VLDB | 6.3527334e-05 |
| 12,217 | Simplifying XML Schema: Single-Type Approximations of Regular Tree Languages | 2010 | PODS | 4.1945683e-05 |
| 12,306 | Simplifying XML Schema: Effortless Handling of Nondeterministic Regular Expressions | 2009 | SIGMOD | 4.1945683e-05 |
| 8,190 | XML Schema Mappings | 2009 | PODS | 4.5641911e-05 |
| 6,652 | Information Preserving XML Schema Embedding | 2005 | VLDB | 4.9761854e-05 |
| 3,138 | Inference of Concise DTDs from XML Data | 2006 | VLDB | 7.4876241e-05 |
| 8,255 | Discovering XSD Keys from XML Data | 2013 | SIGMOD | 4.5491362e-05 |
| 12,372 | SchemaScope: a System for Inferring and Cleaning XML Schemas | 2008 | SIGMOD | 4.1945683e-05 |
| 882 | DTD Inference for Views of XML Data | 2000 | PODS | 0.00015657456 |
| 992 | XTRACT: A System for Extracting Document Type Descriptors from XML Documents | 2000 | SIGMOD | 0.00014799689 |