Optimizing Index for Taxonomy Keyword Search
Summary: Proposes an auxiliary index for taxonomy-driven keyword search, complementing the inverted index. Shows NP-hardness of index selection under space constraints; provides a DP pseudo-polynomial algorithm and a 1/4(1-1/e) approximation; experiments show ~10% extra index space substantially reduces query cost. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Bolin Ding
- 2. Haixun Wang
- 3. Ruoming Jin
- 4. Jiawei Han
- 5. Zhongyuan Wang
Incoming Citations (Sorted by Pagerank)
Showing 2 of 2 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,066 | Probase: A Probabilistic Taxonomy for Text Understanding | 2012 | SIGMOD | 0.0001433416 |
| 5,958 | Fine-grained Concept Linking using Neural Networks in Healthcare | 2018 | SIGMOD | 5.2563968e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 5 of 5 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,066 | Probase: A Probabilistic Taxonomy for Text Understanding | 2012 | SIGMOD | 0.0001433416 |
| 2,464 | Fast Set Intersection in Memory | 2011 | VLDB | 8.7524354e-05 |
| 3,074 | On the Complexity of the View-Selection Problem | 1999 | PODS | 7.6110034e-05 |
| 5,682 | Optimal Indexing Using Near-Minimal Space [Extended Abstract] | 2003 | PODS | 5.372736e-05 |
| 12,194 | Web Scale Taxonomy Cleansing | 2011 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,838 | Finding Patterns in a Knowledge Base using Keywords to Compose Table Answers | 2014 | VLDB | 5.8887949e-05 |
| 8,029 | Understanding Queries in a Search Database System | 2010 | PODS | 4.6028544e-05 |
| 1,517 | Incremental Updates of Inverted Lists for Text Document Retrieval | 1994 | SIGMOD | 0.00011578859 |
| 5,615 | A Scalable Index for Top-k Subtree Similarity Queries | 2019 | SIGMOD | 5.4101086e-05 |
| 11,595 | Minimization of Classifier Construction Cost for Search Queries | 2020 | SIGMOD | 4.1945683e-05 |
| 8,919 | Efficient Indexes for Diverse Top-k Range Queries | 2020 | PODS | 4.427232e-05 |
| 1,073 | Finding and Approximating Top-k Answers in Keyword Proximity Search | 2006 | PODS | 0.00014264992 |
| 5,682 | Optimal Indexing Using Near-Minimal Space [Extended Abstract] | 2003 | PODS | 5.372736e-05 |
| 9,322 | Indexing for Keyword Search with Structured Constraints | 2023 | PODS | 4.3556432e-05 |
| 12,387 | Relaxation in Text Search using Taxonomies | 2008 | VLDB | 4.1945683e-05 |