Back to papers
ColumnML: Column-Store Machine Learning with On-The-Fly Data Transformation
Summary: ColumnML integrates ML in a column-store in-memory DBMS via coordinate-descent on columnar data. It delivers near-linear multi-core scaling (up to 14 cores) and CPU+FPGA hybrid acceleration for on-the-fly data transformation, while preserving compression.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 11970
- Venue
- VLDB
- Year
- 2019
- Pagerank
- 5.0786954e-05
- Overall Rank
- 6,404 | 55.45%
- DOI
-
10.14778/3297753.3297756
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 13 of 13 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
| 2,791 |
Towards Demystifying Serverless Machine Learning Training |
2021 |
SIGMOD |
8.1206618e-05 |
| 3,327 |
Pump Up the Volume: Processing Large Data on GPUs with Fast Interconnects |
2020 |
SIGMOD |
7.2205738e-05 |
| 3,473 |
AI Meets Database: AI4DB and DB4AI |
2021 |
SIGMOD |
7.062864e-05 |
| 5,084 |
In-Database Machine Learning with CorgiPile: Stochastic Gradient Descent without Full Data Shuffle |
2022 |
SIGMOD |
5.7091191e-05 |
| 5,123 |
Accelerating Generalized Linear Models with MLWeaving: A One-Size-Fits-All System for Any-Precision Learning |
2019 |
VLDB |
5.6796998e-05 |
| 7,803 |
Hardware Acceleration of Compression and Encryption in SAP HANA |
2022 |
VLDB |
4.6468619e-05 |
| 8,048 |
Lowering the Latency of Data Processing Pipelines Through FPGA based Hardware Acceleration |
2020 |
VLDB |
4.5977431e-05 |
| 8,346 |
Deep Learning: Systems and Responsibility |
2021 |
SIGMOD |
4.5420668e-05 |
| 8,441 |
Tackling Hardware/Software co-design from a database perspective |
2020 |
CIDR |
4.5124343e-05 |
| 9,435 |
AMNES: Accelerating the computation of data correlation using FPGAs |
2023 |
VLDB |
4.3430376e-05 |
| 10,998 |
Database Native Model Selection: Harnessing Deep Neural Networks in Database Systems |
2024 |
VLDB |
4.1945683e-05 |
| 11,629 |
Leveraging Organizational Resources to Adapt Models to New Data Modalities |
2020 |
VLDB |
4.1945683e-05 |
| 11,676 |
doppioDB 2.0: Hardware Techniques for Improved Integration of Machine Learning into Databases |
2019 |
VLDB |
4.1945683e-05 |
Outgoing Citations (Sorted by Pagerank)
Showing 20 of 20 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 140 |
The MADlib Analytics Library or MAD Skills, the SQL |
2012 |
VLDB |
0.00042270404 |
| 476 |
Impala: A Modern, Open-Source SQL Engine for Hadoop |
2015 |
CIDR |
0.00022226941 |
| 540 |
Design and Evaluation of Main Memory Hash Join Algorithms for Multi-core CPUs |
2011 |
SIGMOD |
0.0002063443 |
| 543 |
MLbase: A Distributed Machine-learning System |
2013 |
CIDR |
0.00020526854 |
| 613 |
Design and Implementation of the LogicBlox System |
2015 |
SIGMOD |
0.00019181325 |
| 658 |
Towards a Unified Architecture for in-RDBMS Analytics |
2012 |
SIGMOD |
0.00018506577 |
| 973 |
Orthogonal Security With Cipherbase |
2013 |
CIDR |
0.00014921633 |
| 1,044 |
DimmWitted: A Study of Main-Memory Statistical Analytics |
2014 |
VLDB |
0.00014475229 |
| 1,158 |
Simulation of Database-Valued Markov Chains Using SimSQL |
2013 |
SIGMOD |
0.0001361064 |
| 1,167 |
Learning Generalized Linear Models Over Normalized Data |
2015 |
SIGMOD |
0.00013547713 |
| 1,532 |
Data Management in Machine Learning: Challenges, Techniques, and Systems |
2017 |
SIGMOD |
0.00011472681 |
| 1,700 |
Bridging the Archipelago between Row-Stores and Column-Stores for Hybrid Workloads |
2016 |
SIGMOD |
0.00010858865 |
| 2,312 |
Real-Time Analytical Processing with SQL Server |
2015 |
VLDB |
9.0530853e-05 |
| 2,330 |
Concurrent Analytical Query Processing with GPUs |
2014 |
VLDB |
9.0192228e-05 |
| 3,151 |
A Memory Bandwidth-Efficient Hybrid Radix Sort on GPUs |
2017 |
SIGMOD |
7.4720668e-05 |
| 3,880 |
Caribou: Intelligent Distributed Storage |
2017 |
VLDB |
6.6700303e-05 |
| 4,033 |
In-RDBMS Hardware Acceleration of Advanced Analytics |
2018 |
VLDB |
6.5113267e-05 |
| 5,178 |
FPGA-based Data Partitioning |
2017 |
SIGMOD |
5.6438393e-05 |
| 8,202 |
Accelerating Pattern Matching Queries in Hybrid CPU-FPGA Architectures |
2017 |
SIGMOD |
4.5598793e-05 |
| 8,423 |
doppioDB: A Hardware Accelerated Database |
2017 |
SIGMOD |
4.5163448e-05 |
Semantically Similar Papers