TreeScope: Finding Structural Anomalies In Semi-Structured Data
Summary: TreeScope identifies structural anomalies in semi-structured data (XML/JSON) by learning robust structural models with high support. It then concisely summarizes detected structural errors and offers plausible explanations, enabling interactive exploration in a data-quality demo. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Authors
- 1. Shanshan Ying
- 2. Barna Saha
- 3. Flip Korn
- 4. Divesh Srivastava
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 4 of 4 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 992 | XTRACT: A System for Extracting Document Type Descriptors from XML Documents | 2000 | SIGMOD | 0.00014799689 |
| 3,138 | Inference of Concise DTDs from XML Data | 2006 | VLDB | 7.4876241e-05 |
| 3,545 | DBLP — Some Lessons Learned | 2009 | VLDB | 6.989355e-05 |
| 7,510 | MESSIAH: Missing Element-Conscious SLCA Nodes Search in XML Data | 2013 | SIGMOD | 4.7180617e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,092 | Structured Annotations of Web Queries | 2010 | SIGMOD | 6.4561959e-05 |
| 7,350 | STEED: An Analytical Database System for TrEE-structured Data | 2017 | VLDB | 4.754748e-05 |
| 7,256 | Effective and Efficient Retrieval of Structured Entities | 2020 | VLDB | 4.7869419e-05 |
| 3,845 | On Repairing Structural Problems In Semi-structured Data | 2013 | VLDB | 6.7073366e-05 |
| 7,571 | Reducing Ambiguity in Json Schema Discovery | 2021 | SIGMOD | 4.7075853e-05 |
| 3,419 | Approximate XML Query Answers | 2004 | SIGMOD | 7.1173416e-05 |
| 11,874 | Graph-based Exploration of Non-graph Datasets | 2016 | VLDB | 4.1945683e-05 |
| 9,750 | ReCG: Bottom-Up JSON Schema Discovery Using a Repetitive Cluster-and-Generalize Framework | 2024 | VLDB | 4.2897489e-05 |
| 6,241 | Scaling Similarity Joins over Tree-Structured Data | 2015 | VLDB | 5.1411469e-05 |
| 12,372 | SchemaScope: a System for Inferring and Cleaning XML Schemas | 2008 | SIGMOD | 4.1945683e-05 |