Lingua Manga : A Generic Large Language Model Centric System for Data Curation
Summary: LLM-centric data curation system that uses pre-trained models and automated optimization of prompting, model selection, and labeling strategies to maximize performance and label efficiency across heterogeneous curation tasks. Demonstrated with three applications, enabling rapid development for programmers as well as low-/no-code users. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Zui Chen
- 2. Lei Cao
- 3. Sam Madden
Incoming Citations (Sorted by Pagerank)
Showing 1 of 1 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 10,835 | Large Language Models for Spatial Analysis Queries | 2025 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 5 of 5 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 94 | CrowdDB: Answering Queries with Crowdsourcing | 2011 | SIGMOD | 0.00051013264 |
| 192 | HoloClean: Holistic Data Repairs with Probabilistic Inference | 2017 | VLDB | 0.00035728858 |
| 221 | Deep Entity Matching with Pre-Trained Language Models | 2021 | VLDB | 0.00033121824 |
| 489 | Data Curation at Scale: The Data Tamer System | 2013 | CIDR | 0.00022030728 |
| 517 | Can Foundation Models Wrangle Your Data? | 2023 | VLDB | 0.00021169035 |
Previous
Page 1 / 1
Next