Database Paper Browser

Back to papers

Doing More with Less: Characterizing Dataset Downsampling for AutoML

Summary: Downsampling large tabular data reshapes AutoML search under fixed time budgets. Empirical study of a genetic-programming AutoML search reveals tradeoffs between pipeline quality and search efficiency, guiding scalable AutoML for big data. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
12386
Venue
VLDB
Year
2021
Pagerank
5.8035715e-05
Overall Rank
4,957 | 65.52%
DOI
10.14778/3476249.3476262

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 2 of 2 citing papers.

Rank Citing Paper Year Venue Pagerank
5,026 AutoCTS: Automated Correlated Time Series Forecasting 2022 VLDB 5.7528419e-05
10,252 CAPS: Cost-Aware ML Pipeline Selection 2026 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 8 of 8 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers