Database Paper Browser

Back to papers

DimmWitted: A Study of Main-Memory Statistical Analytics

Summary: Study of main-memory statistical analytics on NUMA; compares row- vs column-order access and data/model sharing granularity. Uncovers hardware-statistical efficiency tradeoffs; prototype runs popular tasks (SVM, LR, Gibbs, NN) up to 100x faster across five architectures. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
10788
Venue
VLDB
Year
2014
Pagerank
0.00014475229
Overall Rank
1,044 | 92.74%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 25 of 25 citing papers.

Rank Citing Paper Year Venue Pagerank
192 HoloClean: Holistic Data Repairs with Probabilistic Inference 2017 VLDB 0.00035728858
667 Incremental Knowledge Base Construction Using DeepDive 2015 VLDB 0.00018440557
1,532 Data Management in Machine Learning: Challenges, Techniques, and Systems 2017 SIGMOD 0.00011472681
1,873 An Architecture for Compiling UDF-centric Workflows 2015 VLDB 0.00010253002
1,942 Heterogeneity-aware Distributed Parameter Servers 2017 SIGMOD 0.00010012691
2,163 Elastic Machine Learning Algorithms in Amazon SageMaker 2020 SIGMOD 9.3949234e-05
2,418 Tupleware: "Big" Data, Big Analytics, Small Clusters 2015 CIDR 8.8556595e-05
3,363 CROSSBOW: Scaling Deep Learning with Small Batch Sizes on Multi-GPU Servers 2019 VLDB 7.1731921e-05
3,808 SketchML: Accelerating Distributed Machine Learning with Data Sketches 2018 SIGMOD 6.7455428e-05
3,897 SLiMFast: Guaranteed Results for Data Fusion and Source Reliability 2017 SIGMOD 6.6554845e-05
4,033 In-RDBMS Hardware Acceleration of Advanced Analytics 2018 VLDB 6.5113267e-05
4,106 Extracting Databases from Dark Data with DeepDive 2016 SIGMOD 6.4456184e-05
4,802 Resource Elasticity for Large-Scale Machine Learning 2015 SIGMOD 5.9114415e-05
4,975 An Experimental Evaluation of Large Scale GBDT Systems 2019 VLDB 5.79026e-05
5,084 In-Database Machine Learning with CorgiPile: Stochastic Gradient Descent without Full Data Shuffle 2022 SIGMOD 5.7091191e-05
5,123 Accelerating Generalized Linear Models with MLWeaving: A One-Size-Fits-All System for Any-Precision Learning 2019 VLDB 5.6796998e-05
5,333 Heterogeneity-Aware Distributed Machine Learning Training via Partial Reduce 2021 SIGMOD 5.5656575e-05
5,806 BlinkML: Efficient Maximum Likelihood Estimation with Probabilistic Guarantees 2019 SIGMOD 5.3200643e-05
6,404 ColumnML: Column-Store Machine Learning with On-The-Fly Data Transformation 2019 VLDB 5.0786954e-05
6,525 Database Technology for the Masses: Sub-Operators as First-Class Entities 2021 VLDB 5.027205e-05
6,986 A Cost-based Optimizer for Gradient Descent Optimization 2017 SIGMOD 4.8727048e-05
8,126 SDPipe: A Semi-Decentralized Framework for Heterogeneity-aware Pipeline-parallel Training 2023 VLDB 4.5796615e-05
8,737 Scheduling Data Processing Pipelines for Incremental Training on MLP-based Recommendation Models 2025 SIGMOD 4.456315e-05
9,075 ParaX: Boosting Deep Learning for Big Data Analytics on Many-Core CPUs 2021 VLDB 4.4020349e-05
9,965 Distributed Learning of Fully Connected Neural Networks using Independent Subnet Training 2022 VLDB 4.2269436e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 11 of 11 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers