Data Integration and Machine Learning: A Natural Synergy
Summary: Tutorial on Data Integration and Machine Learning: ML-powered data integration and human-in-the-loop pipelines; clean, relevant data for ML. Three: ML in data integration; data integration for ML analytics; open challenges at the data-integration–ML frontier. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 2 of 2 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 6,526 | Data Collection and Quality Challenges for Deep Learning | 2020 | VLDB | 5.0267429e-05 |
| 7,306 | DAPHNE: An Open and Extensible System Infrastructure for Integrated Data Analysis Pipelines | 2022 | CIDR | 4.7678574e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 23 of 23 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,420 | Data Management Challenges in Production Machine Learning | 2017 | SIGMOD | 0.00012057956 |
| 7,655 | Machine Learning for Cloud Data Systems: the Progress so far and the Path Forward | 2021 | VLDB | 4.6872456e-05 |
| 4,906 | Machine Learning for Big Data | 2013 | SIGMOD | 5.8389053e-05 |
| 13,244 | Deep Data Integration | 2021 | SIGMOD | - |
| 48 | Data Integration: A Theoretical Perspective | 2002 | PODS | 0.00069720859 |
| 398 | Big Data Integration | 2013 | VLDB | 0.00024372588 |
| 1,532 | Data Management in Machine Learning: Challenges, Techniques, and Systems | 2017 | SIGMOD | 0.00011472681 |
| 9,777 | Data Augmentation for ML-driven Data Preparation and Integration | 2021 | VLDB | 4.2856106e-05 |
| 5,976 | Responsible Data Integration: Next-generation Challenges | 2022 | SIGMOD | 5.245976e-05 |
| 4,607 | Data Integration and Machine Learning: A Natural Synergy | 2018 | SIGMOD | 6.0538827e-05 |