Database Paper Browser

Back to papers

The MADlib Analytics Library or MAD Skills, the SQL

Summary: MADlib: open in-database analytics library for SQL-based ML, data mining, and statistics; scalable inside a DBMS, no data export. Architecture, patterns; Greenplum performance; CRAN-like, community-driven repository for scalable statistics across DBMS. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
10408
Venue
VLDB
Year
2012
Pagerank
0.00042270404
Overall Rank
140 | 99.03%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 50 of 103 citing papers.

Rank Citing Paper Year Venue Pagerank
4,557 Distributed Deep Learning on Data Systems: A Comparative Analysis of Approaches 2021 VLDB 6.087611e-05
4,787 The Relational Data Borg is Learning 2020 VLDB 5.9224501e-05
4,924 User-Defined Operators: Efficiently Integrating Custom Algorithms into Modern Databases 2022 VLDB 5.822682e-05
5,084 In-Database Machine Learning with CorgiPile: Stochastic Gradient Descent without Full Data Shuffle 2022 SIGMOD 5.7091191e-05
5,123 Accelerating Generalized Linear Models with MLWeaving: A One-Size-Fits-All System for Any-Precision Learning 2019 VLDB 5.6796998e-05
5,395 Large-scale Predictive Analytics in Vertica: Fast Data Transfer, Distributed Model Creation, and In-database Prediction 2015 SIGMOD 5.5318806e-05
5,806 BlinkML: Efficient Maximum Likelihood Estimation with Probabilistic Guarantees 2019 SIGMOD 5.3200643e-05
6,075 Opportunistic Physical Design for Big Data Analytics 2014 SIGMOD 5.223901e-05
6,230 Learned Approximate Query Processing: Make it Light, Accurate and Fast 2021 CIDR 5.145989e-05
6,322 The BUDS Language for Distributed Bayesian Machine Learning 2017 SIGMOD 5.1124615e-05
6,327 The Tensor Data Platform: Towards an AI-centric Database System 2023 CIDR 5.1083405e-05
6,330 Efficient Construction of Approximate Ad-Hoc ML models Through Materialization and Reuse 2018 VLDB 5.1077416e-05
6,373 DeepBase: Deep Inspection of Neural Networks 2019 SIGMOD 5.0929326e-05
6,378 Mitigating the Impedance Mismatch between Prediction Query Execution and Database Engine 2025 SIGMOD 5.0909804e-05
6,380 SmartLite: A DBMS-based Serving System for DNN Inference in Resource-constrained Environments 2024 VLDB 5.0893219e-05
6,404 ColumnML: Column-Store Machine Learning with On-The-Fly Data Transformation 2019 VLDB 5.0786954e-05
6,538 Tuple-oriented Compression for Large-scale Mini-batch Stochastic Gradient Descent 2019 SIGMOD 5.023239e-05
6,549 Demonstration of Nimbus: Model-based Pricing for Machine Learning in a Data Marketplace 2019 SIGMOD 5.0175568e-05
6,644 A Relational Matrix Algebra and its Implementation in a Column Store 2020 SIGMOD 4.9782839e-05
6,722 GeoDeepDive: Statistical Inference using Familiar Data-Processing Languages 2013 SIGMOD 4.9491521e-05
6,986 A Cost-based Optimizer for Gradient Descent Optimization 2017 SIGMOD 4.8727048e-05
7,062 EAGr: Supporting Continuous Ego-centric Aggregate Queries over Large Dynamic Graphs 2014 SIGMOD 4.8462038e-05
7,179 Coresets over Multiple Tables for Feature-rich and Data-efficient Machine Learning 2023 VLDB 4.8078895e-05
7,664 Schema Independent Relational Learning 2017 SIGMOD 4.6857329e-05
7,920 JoinBoost: Grow Trees Over Normalized Data Using Only SQL 2023 VLDB 4.6163888e-05
8,121 Automation of Data Prep, ML, and Data Science: New Cure or Snake Oil? 2021 SIGMOD 4.5809305e-05
8,230 You Say 'What', I Hear 'Where' and 'Why' - (Mis-)Interpreting SQL to Derive Fine-Grained Provenance 2018 VLDB 4.5541444e-05
8,399 UDA-GIST: An In-database Framework to Unify Data-Parallel and State-Parallel Analytics 2015 VLDB 4.5257744e-05
8,444 Not Black-Box Anymore! Enabling Analytics-Aware Optimizations in Teradata Vantage 2021 VLDB 4.5118994e-05
8,469 Semantic Operators and Their Optimization: Enabling LLM-Based Data Processing with Accuracy Guarantees in LOTUS 2025 VLDB 4.5041113e-05
8,789 Machine Learning Meets Big Spatial Data 2019 VLDB 4.4509194e-05
8,853 Complaint-Driven Training Data Debugging at Interactive Speeds 2022 SIGMOD 4.4350727e-05
8,864 Cerebro: A Layered Data Platform for Scalable Deep Learning 2021 CIDR 4.4326439e-05
8,968 Ontological Pathfinding: Mining First-Order Knowledge from Large Knowledge Bases 2016 SIGMOD 4.4190464e-05
9,222 Towards an Optimized GROUP BY Abstraction for Large-Scale Machine Learning 2021 VLDB 4.3698672e-05
9,237 Determining Exact Quantiles with Randomized Summaries 2024 SIGMOD 4.3690661e-05
9,320 Powering In-Database Dynamic Model Slicing for Structured Data Analytics 2024 VLDB 4.3556432e-05
9,391 Database as Runtime: Compiling LLMs to SQL for In-database Model Serving 2025 SIGMOD 4.3441378e-05
9,436 Transforming ML Predictive Pipelines into SQL with MASQ 2021 SIGMOD 4.3430376e-05
9,476 Adda: Towards Efficient in-Database Feature Generation via LLM-based Agents 2025 SIGMOD 4.3341665e-05
9,856 In-Database Data Imputation 2024 SIGMOD 4.269353e-05
10,095 NeurStore: Efficient In-database Deep Learning Model Management System 2026 SIGMOD 4.1945683e-05
10,198 Quantile Estimation with Duplicates 2026 SIGMOD 4.1945683e-05
10,378 HyperMR: Efficient Hypergraph-enhanced Matrix Storage on Compute-in-Memory Architecture 2025 SIGMOD 4.1945683e-05
10,388 Randomized Sketches for Quantile in LSM-tree based Store 2025 SIGMOD 4.1945683e-05
10,482 Fast and Scalable Data Transfer Across Data Systems 2025 SIGMOD 4.1945683e-05
10,499 Privacy and Accuracy-Aware AI/ML Model Deduplication 2025 SIGMOD 4.1945683e-05
10,998 Database Native Model Selection: Harnessing Deep Neural Networks in Database Systems 2024 VLDB 4.1945683e-05
11,283 Demonstration of SPARQLML: An Interfacing Language for Supporting Graph Machine Learning for RDF Graphs 2023 VLDB 4.1945683e-05
11,287 AQUA: Automatic Collaborative Query Processing in Analytical Database 2023 VLDB 4.1945683e-05
Previous Page 2 / 3 Next

Outgoing Citations (Sorted by Pagerank)

Showing 7 of 7 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers