DataLoom: Simplifying Data Loading with LLMs
Summary: DataLoom maps chaotic file collections to initial schemas using LLMs for semantic tasks (header inference, column typing, table merging). It orchestrates LLMs for soft decisions and classical algorithms for hard constraints to minimize manual effort and time-to-first-insight. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Alexander van Renen
- 2. Mihail Stoian
- 3. Andreas Kipf
Incoming Citations (Sorted by Pagerank)
Showing 2 of 2 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 8,207 | SQLStorm: Taking Database Benchmarking into the LLM Era | 2025 | VLDB | 4.5583637e-05 |
| 10,835 | Large Language Models for Spatial Analysis Queries | 2025 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 7 of 7 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 167 | The Snowflake Elastic Data Warehouse | 2016 | SIGMOD | 0.00039180521 |
| 185 | DuckDB: an Embeddable Analytical Database | 2019 | SIGMOD | 0.00036538405 |
| 939 | Data Lake Management: Challenges and Opportunities | 2019 | VLDB | 0.00015187344 |
| 1,187 | JOSIE: Overlap Set Similarity Search for Finding Joinable Tables in Data Lakes | 2019 | SIGMOD | 0.00013443639 |
| 1,277 | The Data Civilizer System | 2017 | CIDR | 0.00012879695 |
| 1,625 | Data Profiling with Metanome | 2015 | VLDB | 0.00011094926 |
| 5,509 | Can Large Language Models Predict Data Correlations from Column Names? | 2023 | VLDB | 5.4703368e-05 |
Previous
Page 1 / 1
Next