Statistical Learning Techniques for Costing XML Queries
Summary: Comet provides XML query cost modeling via new simple-path statistics and system catalogs. It uses transform regression, not analytic formulas, to predict XML operator costs, enabling self-tuning as workloads evolve; validated on XNav with synthetic, benchmark, and real data. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Ning Zhang
- 2. Peter J. Haas
- 3. Vanja Josifovski
- 4. Guy M. Lohman
- 5. Chun Zhang
Incoming Citations (Sorted by Pagerank)
Showing 8 of 8 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 806 | An End-to-End Learning-based Cost Estimator | 2020 | VLDB | 0.00016434274 |
| 884 | Plan-Structured Deep Neural Network Models for Query Performance Prediction | 2019 | VLDB | 0.00015654004 |
| 1,019 | Robust Estimation of Resource Consumption for SQL Queries using Statistical Techniques | 2012 | VLDB | 0.00014625603 |
| 3,580 | Query Performance Prediction for Concurrent Queries using Graph Embedding | 2020 | VLDB | 6.9500996e-05 |
| 4,152 | openGauss: An Autonomous Database System | 2021 | VLDB | 6.4060406e-05 |
| 5,850 | Active and Accelerated Learning of Cost Models for Optimizing Scientific Applications | 2006 | VLDB | 5.3009887e-05 |
| 7,296 | Multi-Tenant Cloud Data Services: State-of-the-Art, Challenges and Opportunities | 2022 | SIGMOD | 4.7723197e-05 |
| 12,291 | Visualizing the robustness of query execution | 2009 | CIDR | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 10 of 10 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1 | Access Path Selection in a Relational Database Management System | 1979 | SIGMOD | 0.0040449103 |
| 182 | LEO - DB2's LEarning Optimizer | 2001 | VLDB | 0.00036962631 |
| 240 | Holistic Twig Joins: Optimal XML Pattern Matching | 2002 | SIGMOD | 0.00031603463 |
| 269 | Fast Incremental Maintenance of Approximate Histograms | 1997 | VLDB | 0.00029656549 |
| 1,046 | Estimating the Selectivity of XML Path Expressions for Internet Scale Applications | 2001 | VLDB | 0.00014462307 |
| 2,010 | StatiX: Making XML Count | 2002 | SIGMOD | 9.7970026e-05 |
| 3,419 | Approximate XML Query Answers | 2004 | SIGMOD | 7.1173416e-05 |
| 4,207 | Mixed Mode XML Query Processing | 2003 | VLDB | 6.359465e-05 |
| 5,025 | Automated Statistics Collection in DB2 UDB | 2004 | VLDB | 5.7533741e-05 |
| 5,632 | Bloom Histogram: Path Selectivity Estimation for XML Data with Updates | 2004 | VLDB | 5.4014372e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 12,363 | XML Query Optimization in the Presence of Side Effects | 2008 | SIGMOD | 4.1945683e-05 |
| 1,019 | Robust Estimation of Resource Consumption for SQL Queries using Statistical Techniques | 2012 | VLDB | 0.00014625603 |
| 2,316 | Statistical Synopses for Graph-Structured XML Databases | 2002 | SIGMOD | 9.0419716e-05 |
| 3,828 | Zero-Shot Cost Models for Out-of-the-box Learned Cost Prediction | 2022 | VLDB | 6.7208524e-05 |
| 12,362 | Relational-Style XML Query | 2008 | SIGMOD | 4.1945683e-05 |
| 7,727 | Semantic Query Optimization for XQuery over XML Streams | 2005 | VLDB | 4.6663256e-05 |
| 806 | An End-to-End Learning-based Cost Estimator | 2020 | VLDB | 0.00016434274 |
| 6,685 | How Good are Learned Cost Models, Really? Insights from Query Optimization Tasks | 2025 | SIGMOD | 4.9627485e-05 |
| 4,660 | XPathLearner: An On-Line Self-Tuning Markov Histogram for XML Path Selectivity Estimation | 2002 | VLDB | 6.014625e-05 |
| 501 | Query Optimization for XML | 1999 | VLDB | 0.00021530411 |