DiffPrep: Differentiable Data Preprocessing Pipeline Search for Learning over Tabular Data
Summary: DiffPrep introduces differentiable bi-level search for preprocessing pipelines, enabling optimization in a continuous space. Continuous relaxation enables descent to find pipelines with a single training, yielding up to 6.6pp gains on 15/18 datasets. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Peng Li
- 2. Zhiyi Chen
- 3. Xu Chu
- 4. Kexin Rong
Incoming Citations (Sorted by Pagerank)
Showing 3 of 3 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 8,743 | CtxPipe: Context-aware Data Preparation Pipeline Construction for Machine Learning | 2024 | SIGMOD | 4.456315e-05 |
| 10,316 | LLM-AutoDP: Automatic Data Processing via LLM Agents for Model Fine-tuning | 2026 | VLDB | 4.1945683e-05 |
| 10,758 | Stress-Testing ML Pipelines with Adversarial Data Corruption | 2025 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 7 of 7 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 791 | ActiveClean: Interactive Data Cleaning For Statistical Modeling | 2016 | VLDB | 0.00016629664 |
| 921 | Democratizing Data Science through Interactive Curation of ML Pipelines | 2019 | SIGMOD | 0.00015337438 |
| 1,612 | Detecting Data Errors: Where are we and what needs to be done? | 2016 | VLDB | 0.00011142794 |
| 1,627 | Data Cleaning: Overview and Emerging Challenges | 2016 | SIGMOD | 0.00011086905 |
| 1,914 | Creating Embeddings of Heterogeneous Relational Datasets for Data Integration Tasks | 2020 | SIGMOD | 0.00010109102 |
| 2,302 | Nearest Neighbor Classifiers over Incomplete Information: From Certain Answers to Certain Predictions | 2021 | VLDB | 9.0668832e-05 |
| 4,967 | Leva: Boosting Machine Learning Performance with Relational Embedding Data Augmentation | 2022 | SIGMOD | 5.7956612e-05 |
Previous
Page 1 / 1
Next