Back to papers
An Architecture for Parallel Topic Models
Summary: Parallel topic-model sampling on a workstation cluster; scales to hundreds of millions of documents and thousands of topics. Distributed key-value state sharing removes separate sync phases; disk, CPU, and network work in parallel; extendable to n-grams and hierarchies.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 10118
- Venue
- VLDB
- Year
- 2010
- Pagerank
- 0.0002728514
- Overall Rank
- 328 | 97.73%
- DOI
-
-
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 19 of 19 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
| 37 |
Distributed GraphLab: A Framework for Machine Learning and Data Mining in the Cloud |
2012 |
VLDB |
0.0007522744 |
| 1,044 |
DimmWitted: A Study of Main-Memory Statistical Analytics |
2014 |
VLDB |
0.00014475229 |
| 1,158 |
Simulation of Database-Valued Markov Chains Using SimSQL |
2013 |
SIGMOD |
0.0001361064 |
| 1,942 |
Heterogeneity-aware Distributed Parameter Servers |
2017 |
SIGMOD |
0.00010012691 |
| 2,033 |
NOMAD: Non-locking, stOchastic Multi-machine algorithm for Asynchronous and Decentralized matrix completion |
2014 |
VLDB |
9.7172731e-05 |
| 2,440 |
FlexPS: Flexible Parallelism Control in Parameter Server Architecture |
2018 |
VLDB |
8.8119143e-05 |
| 3,601 |
Large-Scale Machine Learning at Twitter |
2012 |
SIGMOD |
6.9315087e-05 |
| 4,020 |
TopoX: Topology Refactorization for Efficient Graph Partitioning and Processing |
2019 |
VLDB |
6.5237459e-05 |
| 4,077 |
Towards High-Throughput Gibbs Sampling at Scale: A Study across Storage Managers |
2013 |
SIGMOD |
6.4678697e-05 |
| 4,120 |
Husky: Towards a More Efficient and Expressive Distributed Computing Framework |
2016 |
VLDB |
6.4364588e-05 |
| 4,409 |
Declarative Recursive Computation on an RDBMS |
2019 |
VLDB |
6.2104034e-05 |
| 5,988 |
NuPS: A Parameter Server for Machine Learning with Non-Uniform Parameter Access |
2022 |
SIGMOD |
5.2430981e-05 |
| 6,471 |
Dynamic Parameter Allocation in Parameter Servers |
2020 |
VLDB |
5.0511668e-05 |
| 7,704 |
ExDRa: Exploratory Data Science on Federated Raw Data |
2021 |
SIGMOD |
4.6733838e-05 |
| 8,025 |
Just Move It! Dynamic Parameter Allocation in Action |
2021 |
VLDB |
4.6031105e-05 |
| 11,795 |
LDA*: A Robust and Large-scale Topic Modeling System |
2017 |
VLDB |
4.1945683e-05 |
| 11,797 |
Runtime Optimization of Join Location in Parallel Data Management Systems |
2017 |
VLDB |
4.1945683e-05 |
| 11,819 |
Toward High-Performance Distributed Stream Processing via Approximate Fault Tolerance |
2017 |
VLDB |
4.1945683e-05 |
| 11,834 |
Topic Exploration in Spatio-Temporal Document Collections |
2016 |
SIGMOD |
4.1945683e-05 |
Outgoing Citations (Sorted by Pagerank)
Showing 0 of 0 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 11,954 |
Scalable Topical Phrase Mining from Text Corpora |
2015 |
VLDB |
4.1945683e-05 |
| 7,790 |
Mining Tree-Structured Data on Multicore Systems |
2009 |
VLDB |
4.650649e-05 |
| 13,251 |
Scalable Community Detection via Parallel Correlation Clustering |
2021 |
VLDB |
- |
| 10,975 |
Language-Model Based Informed Partition of Databases to Speed Up Pattern Mining |
2024 |
SIGMOD |
4.1945683e-05 |
| 11,834 |
Topic Exploration in Spatio-Temporal Document Collections |
2016 |
SIGMOD |
4.1945683e-05 |
| 428 |
Latent Semantic Indexing: A Probabilistic Analysis |
1998 |
PODS |
0.00023512226 |
| 8,462 |
Topology-aware Parallel Data Processing: Models, Algorithms and Systems at Scale |
2020 |
CIDR |
4.5056381e-05 |
| 6,014 |
WarpLDA: a Cache Efficient O(1) Algorithm for Latent Dirichlet Allocation |
2016 |
VLDB |
5.2415551e-05 |
| 13,328 |
Scalable Training of Hierarchical Topic Models |
2018 |
VLDB |
- |
| 11,795 |
LDA*: A Robust and Large-scale Topic Modeling System |
2017 |
VLDB |
4.1945683e-05 |