Vertica-ML: Distributed Machine Learning in Vertica Database
Summary: Vertica-ML is a distributed ML subsystem embedded in the Vertica database, exposing a SQL-based data science workflow and model management. Models are first-class objects (like tables/views) for versioning and governance; the paper details architecture and performance experiments. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Arash Fard
- 2. Anh Le
- 3. George Larionov
- 4. Waqas Dhillon
- 5. Chuck Bear
Incoming Citations (Sorted by Pagerank)
Showing 14 of 14 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 9 of 9 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 21 | C-Store: A Column-oriented DBMS | 2005 | VLDB | 0.00086087497 |
| 140 | The MADlib Analytics Library or MAD Skills, the SQL | 2012 | VLDB | 0.00042270404 |
| 310 | The Vertica Analytic Database: C-Store 7 Years Later | 2012 | VLDB | 0.00028132402 |
| 333 | Neo: A Learned Query Optimizer | 2019 | VLDB | 0.00027206884 |
| 2,093 | Scalable K-Means++ | 2012 | VLDB | 9.5588104e-05 |
| 2,630 | PLANET: Massively Parallel Learning of Tree Ensembles with MapReduce | 2009 | VLDB | 8.4128091e-05 |
| 3,875 | Cloudy with High Chance of DBMS: A 10-year Prediction for Enterprise-Grade ML | 2020 | CIDR | 6.675257e-05 |
| 3,881 | Machine Learning and Databases: The Sound of Things to Come or a Cacophony of Hype? | 2015 | SIGMOD | 6.6691196e-05 |
| 7,032 | Building the Enterprise Fabric for Big Data with Vertica and Spark Integration | 2016 | SIGMOD | 4.8559744e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 6,775 | A Unified Transferable Model for ML-Enhanced DBMS | 2022 | CIDR | 4.9299192e-05 |
| 3,099 | DB4ML – An In-Memory Database Kernel with Machine Learning Support | 2020 | SIGMOD | 7.5642871e-05 |
| 543 | MLbase: A Distributed Machine-learning System | 2013 | CIDR | 0.00020526854 |
| 12,061 | Designing Query Optimizers for Big Data Problems of The Future | 2013 | VLDB | 4.1945683e-05 |
| 4,557 | Distributed Deep Learning on Data Systems: A Comparative Analysis of Approaches | 2021 | VLDB | 6.087611e-05 |
| 6,811 | In-database Distributed Machine Learning: Demonstration using Teradata SQL Engine | 2019 | VLDB | 4.9200998e-05 |
| 4,003 | Data Platform for Machine Learning | 2019 | SIGMOD | 6.54347e-05 |
| 4,906 | Machine Learning for Big Data | 2013 | SIGMOD | 5.8389053e-05 |
| 7,032 | Building the Enterprise Fabric for Big Data with Vertica and Spark Integration | 2016 | SIGMOD | 4.8559744e-05 |
| 5,395 | Large-scale Predictive Analytics in Vertica: Fast Data Transfer, Distributed Model Creation, and In-database Prediction | 2015 | SIGMOD | 5.5318806e-05 |