Towards High-Throughput Gibbs Sampling at Scale: A Study across Storage Managers
Summary: Investigates high-throughput Gibbs sampling for large factor graphs, showing materialization, page layout, and buffer choices must be redesigned for beyond-memory workloads. On HBase and Unix-file backends, a simple prototype achieves competitive throughput for graphs larger than memory, outperforming baselines by up to 100x. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Ce Zhang
- 2. Christopher RĂ©
Incoming Citations (Sorted by Pagerank)
Showing 9 of 9 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 667 | Incremental Knowledge Base Construction Using DeepDive | 2015 | VLDB | 0.00018440557 |
| 1,532 | Data Management in Machine Learning: Challenges, Techniques, and Systems | 2017 | SIGMOD | 0.00011472681 |
| 3,081 | Knowledge Expansion over Probabilistic Knowledge Bases | 2014 | SIGMOD | 7.6031501e-05 |
| 4,106 | Extracting Databases from Dark Data with DeepDive | 2016 | SIGMOD | 6.4456184e-05 |
| 4,164 | SlimShot: In-Database Probabilistic Inference for Knowledge Bases | 2016 | VLDB | 6.3923099e-05 |
| 6,169 | Approximate Lifted Inference with Probabilistic Databases | 2015 | VLDB | 5.1716068e-05 |
| 6,722 | GeoDeepDive: Statistical Inference using Familiar Data-Processing Languages | 2013 | SIGMOD | 4.9491521e-05 |
| 7,739 | Symmetric Weighted First-Order Model Counting | 2015 | PODS | 4.663547e-05 |
| 11,718 | A Demonstration of Sya: A Spatial Probabilistic Knowledge Base Construction System | 2018 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 11 of 11 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,350 | On Biased Reservoir Sampling in the Presence of Stream Evolution | 2006 | VLDB | 6.2645054e-05 |
| 4,100 | A Bi-Level Bernoulli Scheme for Database Sampling | 2004 | SIGMOD | 6.4531387e-05 |
| 4,694 | Scalable Reservoir Sampling on Many-Core CPUs | 2019 | SIGMOD | 5.9944898e-05 |
| 8,959 | Reservoir Sampling over Joins | 2024 | SIGMOD | 4.4206222e-05 |
| 2,186 | Scalable Probabilistic Databases with Factor Graphs and MCMC | 2010 | VLDB | 9.3378109e-05 |
| 269 | Fast Incremental Maintenance of Approximate Histograms | 1997 | VLDB | 0.00029656549 |
| 12,166 | Get the Most out of Your Sample: Optimal Unbiased Estimators using Partial Information | 2011 | PODS | 4.1945683e-05 |
| 10,847 | Sampling-based Predictive Database Buffer Management | 2025 | VLDB | 4.1945683e-05 |
| 7,771 | Modeling High-Dimensional Index Structures using Sampling | 2001 | SIGMOD | 4.6560482e-05 |
| 184 | New Sampling-Based Summary Statistics for Improving Approximate Query Answers | 1998 | SIGMOD | 0.00036625711 |