Database Paper Browser

Back to papers

Lingua Manga : A Generic Large Language Model Centric System for Data Curation

Summary: LLM-centric data curation system that uses pre-trained models and automated optimization of prompting, model selection, and labeling strategies to maximize performance and label efficiency across heterogeneous curation tasks. Demonstrated with three applications, enabling rapid development for programmers as well as low-/no-code users. (summarized by gpt-5-mini on Feb 09 2026)

Paper ID
13269
Venue
VLDB
Year
2023
Pagerank
4.3341665e-05
Overall Rank
9,492 | 33.97%
DOI
10.14778/3611540.3611624

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 1 of 1 citing papers.

Rank Citing Paper Year Venue Pagerank
10,835 Large Language Models for Spatial Analysis Queries 2025 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 5 of 5 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
94 CrowdDB: Answering Queries with Crowdsourcing 2011 SIGMOD 0.00051013264
192 HoloClean: Holistic Data Repairs with Probabilistic Inference 2017 VLDB 0.00035728858
221 Deep Entity Matching with Pre-Trained Language Models 2021 VLDB 0.00033121824
489 Data Curation at Scale: The Data Tamer System 2013 CIDR 0.00022030728
517 Can Foundation Models Wrangle Your Data? 2023 VLDB 0.00021169035
Previous Page 1 / 1 Next

Semantically Similar Papers