Scalable Training of Hierarchical Topic Models
Summary: Scalable hLDA with partially collapsed Gibbs sampling and tree initialization to mitigate local optima in hierarchical topic models. Vectorized layouts and distributed dynamic matrices/trees yield 87x speedup vs prior hLDA, scalable to many cores. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Authors
- 1. Jianfei Chen
- 2. Jun Zhu
- 3. Jie Lu
- 4. Shixia Liu
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 1 of 1 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 6,014 | WarpLDA: a Cache Efficient O(1) Algorithm for Latent Dirichlet Allocation | 2016 | VLDB | 5.2415551e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 10,208 | Scalable Clustering Over High Dimensional Vector Streams | 2026 | SIGMOD | 4.1945683e-05 |
| 2,325 | Building Hierarchical Classifiers Using Class Proximity | 1999 | VLDB | 9.0304462e-05 |
| 5,379 | Scalable Ad-hoc Entity Extraction from Text Collections | 2008 | VLDB | 5.5405989e-05 |
| 3,645 | Large-Scale Collective Entity Matching | 2011 | VLDB | 6.8853274e-05 |
| 5,052 | HET-GMP: A Graph-based System Approach to Scaling Large Embedding Model Training | 2022 | SIGMOD | 5.7337977e-05 |
| 11,834 | Topic Exploration in Spatio-Temporal Document Collections | 2016 | SIGMOD | 4.1945683e-05 |
| 11,954 | Scalable Topical Phrase Mining from Text Corpora | 2015 | VLDB | 4.1945683e-05 |
| 6,014 | WarpLDA: a Cache Efficient O(1) Algorithm for Latent Dirichlet Allocation | 2016 | VLDB | 5.2415551e-05 |
| 328 | An Architecture for Parallel Topic Models | 2010 | VLDB | 0.0002728514 |
| 11,795 | LDA*: A Robust and Large-scale Topic Modeling System | 2017 | VLDB | 4.1945683e-05 |