| 1,427 |
Towards Scalable Dataframe Systems |
2020 |
VLDB |
0.0001204248 |
| 2,122 |
SystemDS: A Declarative Machine Learning System for the End-to-End Data Science Lifecycle |
2020 |
CIDR |
9.4989076e-05 |
| 2,456 |
Production Machine Learning Pipelines: Empirical Analysis and Optimization Opportunities |
2021 |
SIGMOD |
8.7733773e-05 |
| 3,023 |
Helix: Accelerating Human-in-the-loop Machine Learning |
2018 |
VLDB |
7.6929986e-05 |
| 3,393 |
Lux: Always-on Visualization Recommendations for Exploratory Dataframe Workflows |
2022 |
VLDB |
7.1483239e-05 |
| 3,625 |
Cost Models for Big Data Query Processing: Learning, Retrofitting, and Our Findings |
2020 |
SIGMOD |
6.9055212e-05 |
| 4,557 |
Distributed Deep Learning on Data Systems: A Comparative Analysis of Approaches |
2021 |
VLDB |
6.087611e-05 |
| 4,774 |
LIMA: Fine-grained Lineage Tracing and Reuse in Machine Learning Systems |
2021 |
SIGMOD |
5.9316087e-05 |
| 4,935 |
OmniFair: A Declarative System for Model-Agnostic Group Fairness in Machine Learning |
2021 |
SIGMOD |
5.8198727e-05 |
| 4,957 |
Doing More with Less: Characterizing Dataset Downsampling for AutoML |
2021 |
VLDB |
5.8035715e-05 |
| 6,000 |
DeepEverest: Accelerating Declarative Top-K Queries for Deep Neural Network Interpretation |
2022 |
VLDB |
5.2415551e-05 |
| 6,053 |
Optimizing Machine Learning Workloads in Collaborative Environments |
2020 |
SIGMOD |
5.2326838e-05 |
| 6,469 |
Materialization and Reuse Optimizations for Production Data Science Pipelines |
2022 |
SIGMOD |
5.0519488e-05 |
| 6,733 |
Hindsight Logging for Model Training |
2021 |
VLDB |
4.9467666e-05 |
| 7,482 |
Provenance-Enabled Explainable AI |
2024 |
SIGMOD |
4.7180617e-05 |
| 7,656 |
Nautilus: An Optimized System for Deep Transfer Learning over Evolving Training Datasets |
2022 |
SIGMOD |
4.6871575e-05 |
| 7,704 |
ExDRa: Exploratory Data Science on Federated Raw Data |
2021 |
SIGMOD |
4.6733838e-05 |
| 8,092 |
Saga: A Scalable Framework for Optimizing Data Cleaning Pipelines for Machine Learning Applications |
2023 |
SIGMOD |
4.587921e-05 |
| 8,257 |
Automating and Optimizing Data-Centric What-If Analyses on Native Machine Learning Pipelines |
2023 |
SIGMOD |
4.5487511e-05 |
| 8,514 |
UPLIFT: Parallelization Strategies for Feature Transformations in Machine Learning Workloads |
2022 |
VLDB |
4.4944285e-05 |
| 9,223 |
Intermittent Human-in-the-Loop Model Selection using Cerebro: A Demonstration |
2021 |
VLDB |
4.3698672e-05 |
| 9,344 |
Hippo: Sharing Computations in Hyper-Parameter Optimization |
2022 |
VLDB |
4.3539442e-05 |
| 9,806 |
The Image Calculator: 10x Faster Image-AI Inference by Replacing JPEG with Self-designing Storage Format |
2024 |
SIGMOD |
4.2805224e-05 |
| 9,912 |
ElasticNotebook: Enabling Live Migration for Computational Notebooks |
2024 |
VLDB |
4.2565279e-05 |
| 10,252 |
CAPS: Cost-Aware ML Pipeline Selection |
2026 |
VLDB |
4.1945683e-05 |
| 10,338 |
Flow with FlorDB: Incremental Context Maintenance for the Machine Learning Lifecycle |
2025 |
CIDR |
4.1945683e-05 |
| 10,469 |
Alsatian: Optimizing Model Search for Deep Transfer Learning |
2025 |
SIGMOD |
4.1945683e-05 |
| 11,476 |
Enforcing Constraints for Machine Learning Systems via Declarative Feature Selection: An Experimental Study |
2021 |
SIGMOD |
4.1945683e-05 |
| 11,691 |
Enabling Data Science for the Majority |
2019 |
VLDB |
4.1945683e-05 |