Automation of Data Prep, ML, and Data Science: New Cure or Snake Oil?
Summary: Panel critiques automated data prep/ML tools, asking if industry hype masks snake oil or real gains. Emphasizes lack of benchmarks and calls for rigorous evaluation plus academia–industry collaboration to advance data-prep research in DB/data-management. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Arun Kumar
Incoming Citations (Sorted by Pagerank)
Showing 1 of 1 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 7,807 | Pollock: A Data Loading Benchmark | 2023 | VLDB | 4.6457732e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 3 of 3 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 140 | The MADlib Analytics Library or MAD Skills, the SQL | 2012 | VLDB | 0.00042270404 |
| 192 | HoloClean: Holistic Data Repairs with Probabilistic Inference | 2017 | VLDB | 0.00035728858 |
| 5,242 | Towards Benchmarking Feature Type Inference for AutoML Platforms | 2021 | SIGMOD | 5.6074743e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 8,637 | Machine Learning for Data Management: Problems and Solutions | 2018 | SIGMOD | 4.479892e-05 |
| 6,526 | Data Collection and Quality Challenges for Deep Learning | 2020 | VLDB | 5.0267429e-05 |
| 9,835 | Is Data Management the Beating Heart of AI Systems? | 2022 | SIGMOD | 4.2747054e-05 |
| 13,148 | The Limitations of Data, Machine Learning and Us | 2024 | SIGMOD | - |
| 13,244 | Deep Data Integration | 2021 | SIGMOD | - |
| 13,268 | From ML Models to Intelligent Applications: The Rise of MLOps | 2021 | VLDB | - |
| 7,259 | Panel: A Debate on Data and Algorithmic Ethics | 2018 | VLDB | 4.7865546e-05 |
| 13,197 | Will LLMs reshape, supercharge, or kill data science? (VLDB 2023 Panel) | 2023 | VLDB | - |
| 9,637 | Opportunities for Data Management Research in the Era of Horizontal AI/ML | 2019 | VLDB | 4.3111161e-05 |
| 10,844 | Panel on Neural Relational Data: Tabular Foundation Models, LLMs... or both? | 2025 | VLDB | 4.1945683e-05 |