Amalur: Next-generation Data Integration in Data Lakes
Summary: Amalur rethinks data integration for ML-centric data lakes by unifying classic DI with factorized linear-algebra operators, pushing LA computation toward data sources. Enables ML-friendly extraction/prep with reduced ETL/export overhead and efficient factorized execution across joins. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Authors
- 1. Rihan Hai
- 2. Christos Koutras
- 3. Andra Ionescu
- 4. Asterios Katsifodimos
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 1 of 1 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,279 | Towards Linear Algebra over Normalized Data | 2017 | VLDB | 0.00012868394 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,833 | Data Wrangling: The Challenging Journey from the Wild to the Lake | 2015 | CIDR | 0.00010378976 |
| 1,644 | Finding Related Tables in Data Lakes for Interactive Data Science | 2020 | SIGMOD | 0.00011041787 |
| 3,948 | A Comparative Evaluation of Systems for Scalable Linear Algebra-based Analytics | 2018 | VLDB | 6.5959084e-05 |
| 3,958 | MLog: Towards Declarative In-Database Machine Learning | 2017 | VLDB | 6.5897636e-05 |
| 4,003 | Data Platform for Machine Learning | 2019 | SIGMOD | 6.54347e-05 |
| 13,277 | The Challenge of Building Effective Data Lakes | 2020 | SIGMOD | - |
| 939 | Data Lake Management: Challenges and Opportunities | 2019 | VLDB | 0.00015187344 |
| 2,194 | Enabling and Optimizing Non-linear Feature Interactions in Factorized Linear Algebra | 2019 | SIGMOD | 9.3138337e-05 |
| 4,859 | Integrating Data Lake Tables | 2023 | VLDB | 5.8732433e-05 |
| 1,279 | Towards Linear Algebra over Normalized Data | 2017 | VLDB | 0.00012868394 |