A Scalable Index for Top-k Subtree Similarity Queries
Summary: Scalable top-k subtree similarity via inverted lists; processes subtrees first and supports incremental updates in linear space. Tuning-free, data-type agnostic; outperforms state-of-the-art indexes in time and memory, up to four orders of magnitude. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 2 of 2 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3,259 | AS-Parser: Log Parsing Based on Adaptive Segmentation | 2023 | SIGMOD | 7.3147783e-05 |
| 8,511 | JEDI: These aren't the JSON documents you're looking for... | 2022 | SIGMOD | 4.495029e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 10 of 10 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 7 | Optimal Aggregation Algorithms for Middleware [Extended Abstract] | 2001 | PODS | 0.0015496097 |
| 1,027 | Accelerating XPath Location Steps | 2002 | SIGMOD | 0.0001458865 |
| 1,808 | Top-k Query Evaluation with Probabilistic Guarantees | 2004 | VLDB | 0.00010486213 |
| 2,161 | On the Integration of Structure Indexes and Inverted Lists | 2004 | SIGMOD | 9.4002771e-05 |
| 3,199 | Similarity Evaluation on Tree-structured Data | 2005 | SIGMOD | 7.3927291e-05 |
| 4,186 | Best Position Algorithms for Top-k Queries | 2007 | VLDB | 6.3764858e-05 |
| 4,406 | Approximate Matching of Hierarchical Data Using pq-Grams | 2005 | VLDB | 6.2141638e-05 |
| 6,241 | Scaling Similarity Joins over Tree-Structured Data | 2015 | VLDB | 5.1411469e-05 |
| 6,807 | Indexing for Subtree Similarity-Search using Edit Distance | 2013 | SIGMOD | 4.9217776e-05 |
| 7,815 | DeltaNI: An Efficient Labeling Scheme for Versioned Hierarchical Data | 2013 | SIGMOD | 4.6438721e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 9,283 | Adaptive Indexing in High-Dimensional Metric Spaces | 2023 | VLDB | 4.3631652e-05 |
| 6,732 | An Incrementally Maintainable Index for Approximate Lookups in Hierarchical Data | 2006 | VLDB | 4.9477058e-05 |
| 3,609 | Similarity search in the blink of an eye with compressed indices | 2023 | VLDB | 6.9215236e-05 |
| 8,505 | Top-K Nearest Keyword Search on Large Graphs | 2013 | VLDB | 4.4958064e-05 |
| 7,109 | Efficient Similarity Join and Search on Multi-Attribute Data | 2015 | SIGMOD | 4.8292998e-05 |
| 7,708 | Efficient Top-k Algorithms for Approximate Substring Matching | 2013 | SIGMOD | 4.6721808e-05 |
| 6,919 | Efficient Indexing and Querying over Syntactically Annotated Trees | 2012 | VLDB | 4.8925595e-05 |
| 6,241 | Scaling Similarity Joins over Tree-Structured Data | 2015 | VLDB | 5.1411469e-05 |
| 6,807 | Indexing for Subtree Similarity-Search using Edit Distance | 2013 | SIGMOD | 4.9217776e-05 |
| 3,199 | Similarity Evaluation on Tree-structured Data | 2005 | SIGMOD | 7.3927291e-05 |