Database Paper Browser

Back to papers

DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases

Summary: DataGuides provide structural summaries for semistructured data, acting as on-the-fly schemas. They enable query formulation, store stats and samples, and guide optimization; implemented with incremental upkeep in Lore for browsing and planning. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
8444
Venue
VLDB
Year
1997
Pagerank
0.00064329285
Overall Rank
61 | 99.58%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 50 of 83 citing papers.

Rank Citing Paper Year Venue Pagerank
203 Graph Indexing: A Frequent Structure-based Approach 2004 SIGMOD 0.00034889335
350 FG-Index: Towards Verification-Free Query Processing on Graph Databases 2007 SIGMOD 0.00026365067
391 Indexing and Querying XML Data for Regular Path Expressions 2001 VLDB 0.00024564567
415 A Fast Index for Semistructured Data 2001 VLDB 0.00023814619
491 Your Mediators Need Data Conversion! 1998 SIGMOD 0.00022011503
501 Query Optimization for XML 1999 VLDB 0.00021530411
817 Covering Indexes for Branching Path Queries 2002 SIGMOD 0.00016352717
869 APEX: An Adaptive Path Index for XML Data 2002 SIGMOD 0.00015788339
882 DTD Inference for Views of XML Data 2000 PODS 0.00015657456
926 XMill: an Efficient Compressor for XML Data 2000 SIGMOD 0.00015251799
992 XTRACT: A System for Extracting Document Type Descriptors from XML Documents 2000 SIGMOD 0.00014799689
993 D(K)-Index: An Adaptive Structural Summary for Graph-Structured Data 2003 SIGMOD 0.00014765816
1,027 Accelerating XPath Location Steps 2002 SIGMOD 0.0001458865
1,046 Estimating the Selectivity of XML Path Expressions for Internet Scale Applications 2001 VLDB 0.00014462307
1,163 Extracting Schema from Semistructured Data 1998 SIGMOD 0.00013577466
1,314 Semistructured Data 1997 PODS 0.0001263326
1,639 Incremental Maintenance for Materialized Views over Semistructured Data 1998 VLDB 0.00011048834
1,662 Querying Network Directories 1999 SIGMOD 0.00010977682
1,815 Indexing XML Data Stored in a Relational Database 2004 VLDB 0.00010455025
1,897 Type Inference for Queries on Semistructured Data (Extended Abstract) 1999 PODS 0.00010178006
2,161 On the Integration of Structure Indexes and Inverted Lists 2004 SIGMOD 9.4002771e-05
2,168 ViST: A Dynamic Index Method for Querying XML Data by Tree Structures 2003 SIGMOD 9.3848723e-05
2,173 Querying Data Provenance 2010 SIGMOD 9.3676609e-05
2,399 Query Rewriting for Semistructured Data 1999 SIGMOD 8.8973689e-05
2,438 Towards Graph Containment Search and Indexing 2007 VLDB 8.8214248e-05
2,507 Path Queries on Compressed XML 2003 VLDB 8.6311009e-05
2,864 Inferring XML Schema Definitions from XML Data 2007 VLDB 7.9863574e-05
2,977 A Framework for Using Materialized XPath Views in XML Query Processing 2004 VLDB 7.7876083e-05
3,113 Structure and Value Synopses for XML Data Graphs 2002 VLDB 7.5469926e-05
3,138 Inference of Concise DTDs from XML Data 2006 VLDB 7.4876241e-05
3,466 Updates for Structure Indexes 2002 VLDB 7.0695018e-05
3,681 Queries with Incomplete Answers over Semistructured Data 1999 PODS 6.8492288e-05
4,010 A Web Odyssey: from Codd to XML 2001 PODS 6.5351699e-05
4,508 iTrails: Pay-as-you-go Information Integration in Dataspaces 2007 VLDB 6.1298098e-05
4,587 On Boosting Holism in XML Twig Pattern Matching Using Structural Indexing Techniques 2005 SIGMOD 6.0658154e-05
5,139 Lazy Query Evaluation for Active XML 2004 SIGMOD 5.6686638e-05
5,385 Indexing Dataspaces 2007 SIGMOD 5.5381684e-05
5,574 Efficient Processing of XML Twig Queries with OR-Predicates 2004 SIGMOD 5.4268403e-05
5,663 Incremental Maintenance of XML Structural Indexes 2004 SIGMOD 5.3832923e-05
5,776 Capturing Topology in Graph Pattern Matching 2012 VLDB 5.3309758e-05
5,820 Efficient Processing of XML Path Queries Using the Disk-based F&B Index 2005 VLDB 5.3135144e-05
5,933 Capturing and Querying Multiple Aspects of Semistructured Data 1999 VLDB 5.2668503e-05
6,231 An LSM-based Tuple Compaction Framework for Apache AsterixDB 2020 VLDB 5.1457863e-05
6,872 XQuery Optimization 2003 VLDB 4.8991822e-05
6,874 ROX: Run-time Optimization of XQueries 2009 SIGMOD 4.8978984e-05
6,954 Indexing Temporal XML Documents 2004 VLDB 4.8864906e-05
7,007 Closing the functional and Performance Gap between SQL and NoSQL 2016 SIGMOD 4.8653116e-05
7,238 A Crash Course on Database Queries 2007 PODS 4.7928464e-05
7,240 Sum-Max Monotonic Ranked Joins for Evaluating Top-K Twig Queries on Weighted Data Graphs 2007 VLDB 4.792172e-05
7,298 Structured Materialized Views for XML Queries 2007 VLDB 4.770411e-05
Previous Page 1 / 2 Next

Outgoing Citations (Sorted by Pagerank)

Showing 6 of 6 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
114 A Query Language and Optimization Techniques for Unstructured Data 1996 SIGMOD 0.00046339735
414 Timber: A Sophisticated Relation Browser 1982 VLDB 0.0002388204
437 W3QS: A Query System for the World-Wide Web 1995 VLDB 0.00023240203
1,535 PESTO: An Integrated Query/Browser for Object Databases 1996 VLDB 0.00011467414
1,724 OdeView: The Graphical Interface to Ode 1990 SIGMOD 0.00010750441
2,081 On Index Selection Schemes for Nested Object Hierarchies 1994 VLDB 9.5870732e-05
Previous Page 1 / 1 Next

Semantically Similar Papers

Overall Rank Paper Year Venue Pagerank
2,676 LORE: A Lightweight Object REpository for Semistructured Data 1996 SIGMOD 8.3274001e-05
14,283 Browsing in a Loosely Structured Database 1984 SIGMOD -
1,271 Schema Summarization 2006 VLDB 0.00012923966
1,163 Extracting Schema from Semistructured Data 1998 SIGMOD 0.00013577466
1,639 Incremental Maintenance for Materialized Views over Semistructured Data 1998 VLDB 0.00011048834
1,065 Data-Driven Understanding and Refinement of Schema Mappings 2001 SIGMOD 0.00014338146
723 GUIDE: Graphical User Interface for Database Exploration 1982 VLDB 0.00017542556
4,038 Querying Complex Structured Databases 2007 VLDB 6.5082212e-05
501 Query Optimization for XML 1999 VLDB 0.00021530411
1,314 Semistructured Data 1997 PODS 0.0001263326