Database Paper Browser

Back to papers

Data Imputation with Limited Data Redundancy Using Data Lakes

Summary: LakeFill leverages LLMs and data lakes for tuple-level retrieval and encoding of incomplete tuples to find cross-table candidates when intra-table redundancy is low. It uses checklist-based reranking and a two-stage confidence-aware reasoner, beating prior methods. (summarized by gpt-5-mini on Feb 09 2026)

Paper ID
13966
Venue
VLDB
Year
2025
Pagerank
4.3341665e-05
Overall Rank
9,479 | 34.06%
DOI
10.14778/3748191.3748200

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 2 of 2 citing papers.

Rank Citing Paper Year Venue Pagerank
9,984 Towards Scalable Visual Data Wrangling via Direct Manipulation 2026 CIDR 4.1945683e-05
10,289 LEAD: Iterative Data Selection for Efficient LLM Instruction Tuning 2026 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 14 of 14 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers