Large-scale Predictive Analytics in Vertica: Fast Data Transfer, Distributed Model Creation, and In-database Prediction
Summary: Vertica integrates with Distributed R to cut transfers and preserve locality across table segments. In-database R model management and fast deployment enable scalable predictive analytics on large data, beating ODBC and rivaling in-memory engines. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Shreya Prasad
- 2. Jeff LeFevre
- 3. Arash Fard
- 4. Vincent Xu
- 5. Vishrut Gupta
- 6. Meichun Hsu
- 7. Jorge Martinez
- 8. Indrajit Roy
Incoming Citations (Sorted by Pagerank)
Showing 6 of 6 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,532 | Data Management in Machine Learning: Challenges, Techniques, and Systems | 2017 | SIGMOD | 0.00011472681 |
| 1,942 | Heterogeneity-aware Distributed Parameter Servers | 2017 | SIGMOD | 0.00010012691 |
| 3,254 | Query Processing on Tensor Computation Runtimes | 2022 | VLDB | 7.3161051e-05 |
| 6,784 | SparkR: Scaling R Programs with Spark | 2016 | SIGMOD | 4.9265155e-05 |
| 7,032 | Building the Enterprise Fabric for Big Data with Vertica and Spark Integration | 2016 | SIGMOD | 4.8559744e-05 |
| 11,859 | dmapply: A functional primitive to express distributed machine learning algorithms in R | 2016 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 7 of 7 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4 | Pregel: A System for Large-Scale Graph Processing | 2010 | SIGMOD | 0.0019005923 |
| 140 | The MADlib Analytics Library or MAD Skills, the SQL | 2012 | VLDB | 0.00042270404 |
| 310 | The Vertica Analytic Database: C-Store 7 Years Later | 2012 | VLDB | 0.00028132402 |
| 413 | HaLoop: Efficient Iterative Data Processing on Large Clusters | 2010 | VLDB | 0.00023904409 |
| 1,495 | Ricardo: Integrating R and Hadoop | 2010 | SIGMOD | 0.00011691049 |
| 1,876 | ArrayStore: A Storage Manager for Complex Parallel Array Processing | 2011 | SIGMOD | 0.00010239284 |
| 5,964 | Bridging Two Worlds with RICE: Integrating R into the SAP In-Memory Computing Engine | 2011 | VLDB | 5.2520617e-05 |
Previous
Page 1 / 1
Next