Database Paper Browser

Back to papers

The MADlib Analytics Library or MAD Skills, the SQL

Summary: MADlib: open in-database analytics library for SQL-based ML, data mining, and statistics; scalable inside a DBMS, no data export. Architecture, patterns; Greenplum performance; CRAN-like, community-driven repository for scalable statistics across DBMS. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
10408
Venue
VLDB
Year
2012
Pagerank
0.00042270404
Overall Rank
140 | 99.03%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 50 of 103 citing papers.

Rank Citing Paper Year Venue Pagerank
543 MLbase: A Distributed Machine-learning System 2013 CIDR 0.00020526854
557 SystemML: Declarative Machine Learning on Spark 2016 VLDB 0.00020197988
683 Cerebro: A Data System for Optimized Deep Learning Model Selection 2020 VLDB 0.00018195476
761 Materialization Optimizations for Feature Selection Workloads 2014 SIGMOD 0.00017053783
834 Learning Linear Regression Models over Factorized Joins 2016 SIGMOD 0.00016135159
903 To Join or Not to Join? Thinking Twice about Joins before Feature Selection 2016 SIGMOD 0.0001547016
1,044 DimmWitted: A Study of Main-Memory Statistical Analytics 2014 VLDB 0.00014475229
1,167 Learning Generalized Linear Models Over Normalized Data 2015 SIGMOD 0.00013547713
1,279 Towards Linear Algebra over Normalized Data 2017 VLDB 0.00012868394
1,391 Ease.ml: Towards Multi-tenant Resource Sharing for Machine Learning Workloads 2018 VLDB 0.0001223506
1,532 Data Management in Machine Learning: Challenges, Techniques, and Systems 2017 SIGMOD 0.00011472681
1,750 Weld: A Common Runtime for High Performance Data Analytics 2017 CIDR 0.00010683647
1,891 Towards Model-based Pricing for Machine Learning in a Data Marketplace 2019 SIGMOD 0.00010194092
1,942 Heterogeneity-aware Distributed Parameter Servers 2017 SIGMOD 0.00010012691
2,122 SystemDS: A Declarative Machine Learning System for the End-to-End Data Science Lifecycle 2020 CIDR 9.4989076e-05
2,170 tf.data: A Machine Learning Data Processing Framework 2021 VLDB 9.3821603e-05
2,251 Vizdom: Interactive Analytics through Pen and Touch 2015 VLDB 9.1986441e-05
2,255 LINVIEW: Incremental View Maintenance for Complex Analytical Queries 2014 SIGMOD 9.1884983e-05
2,456 Production Machine Learning Pipelines: Empirical Analysis and Optimization Opportunities 2021 SIGMOD 8.7733773e-05
2,623 GenBase: A Complex Analytics Genomics Benchmark 2014 SIGMOD 8.4374366e-05
2,642 Vertica-ML: Distributed Machine Learning in Vertica Database 2020 SIGMOD 8.3851878e-05
2,667 Cumulon: Optimizing Statistical Data Analysis in the Cloud 2013 SIGMOD 8.3413995e-05
2,753 Complaint-driven Training Data Debugging for Query 2.0 2020 SIGMOD 8.1724339e-05
2,804 Extending Relational Query Processing with ML Inference 2020 CIDR 8.0935487e-05
3,006 On Functional Aggregate Queries with Additive Inequalities 2019 PODS 7.7299363e-05
3,066 HAWQ: A Massively Parallel Processing SQL Engine in Hadoop 2014 SIGMOD 7.6221974e-05
3,081 Knowledge Expansion over Probabilistic Knowledge Bases 2014 SIGMOD 7.6031501e-05
3,099 DB4ML – An In-Memory Database Kernel with Machine Learning Support 2020 SIGMOD 7.5642871e-05
3,254 Query Processing on Tensor Computation Runtimes 2022 VLDB 7.3161051e-05
3,277 A Layered Aggregate Engine for Analytics Workloads 2019 SIGMOD 7.2871625e-05
3,345 QuickFOIL: Scalable Inductive Logic Programming 2015 VLDB 7.1958815e-05
3,359 Text2SQL is Not Enough: Unifying AI and Databases with TAG 2025 CIDR 7.1744146e-05
3,407 End-to-end Optimization of Machine Learning Prediction Queries 2022 SIGMOD 7.1295646e-05
3,455 A Comparison of Platforms for Implementing and Running Very Large Scale Machine Learning Algorithms 2014 SIGMOD 7.0771839e-05
3,467 Data Profiling – A Tutorial 2017 SIGMOD 7.069081e-05
3,473 AI Meets Database: AI4DB and DB4AI 2021 SIGMOD 7.062864e-05
3,638 Bolt-on Differential Privacy for Scalable Stochastic Gradient Descent-based Analytics 2017 SIGMOD 6.8952488e-05
3,875 Cloudy with High Chance of DBMS: A 10-year Prediction for Enterprise-Grade ML 2020 CIDR 6.675257e-05
3,948 A Comparative Evaluation of Systems for Scalable Linear Algebra-based Analytics 2018 VLDB 6.5959084e-05
3,958 MLog: Towards Declarative In-Database Machine Learning 2017 VLDB 6.5897636e-05
3,988 All-in-One: Graph Processing in RDBMSs Revisited 2017 SIGMOD 6.5589605e-05
4,014 Exploiting Correlations for Expensive Predicate Evaluation 2015 SIGMOD 6.5273084e-05
4,033 In-RDBMS Hardware Acceleration of Advanced Analytics 2018 VLDB 6.5113267e-05
4,077 Towards High-Throughput Gibbs Sampling at Scale: A Study across Storage Managers 2013 SIGMOD 6.4678697e-05
4,129 Are Key-Foreign Key Joins Safe to Avoid when Learning High-Capacity Classifiers? 2018 VLDB 6.428887e-05
4,159 F: Regression Models over Factorized Views 2016 VLDB 6.3993326e-05
4,197 Incremental View Maintenance with Triple Lock Factorization Benefits 2018 SIGMOD 6.367895e-05
4,257 Combining Databases and Signal Processing in Plato 2015 CIDR 6.3164509e-05
4,395 Scalable Asynchronous Gradient Descent Optimization for Out-of-Core Models 2017 VLDB 6.2244283e-05
4,548 Efficient and Portable Einstein Summation in SQL 2023 SIGMOD 6.0953447e-05
Previous Page 1 / 3 Next

Outgoing Citations (Sorted by Pagerank)

Showing 7 of 7 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers