Baran: Effective Error Correction via a Unified Context Representation and Transfer Learning
Summary: Baran introduces a unified context representation and transfer learning to fuse multiple error-corrector models for data repair. Modeling full context—value, tuple co-occurrences, and attribute type—yields richer candidates and higher precision, with Wikipedia pretraining boosting recall and needing ~20 labeled tuples. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 32 of 32 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 19 of 19 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 5,445 | QFix: Diagnosing Errors through Query Histories | 2017 | SIGMOD | 5.5020909e-05 |
| 11,841 | BART in Action: Error Generation and Empirical Evaluations of Data-Cleaning Systems | 2016 | SIGMOD | 4.1945683e-05 |
| 11,223 | Splitting Tuples of Mismatched Entities | 2023 | SIGMOD | 4.1945683e-05 |
| 8,585 | Robust Entity Resolution using Random Graphs | 2018 | SIGMOD | 4.4905755e-05 |
| 3,192 | Towards Dependable Data Repairing with Fixing Rules | 2014 | SIGMOD | 7.4095761e-05 |
| 2,506 | Auto-Detect: Data-Driven Error Detection in Tables | 2018 | SIGMOD | 8.6335464e-05 |
| 2,968 | Raha: A Configuration-Free Error Detection System | 2019 | SIGMOD | 7.7985097e-05 |
| 1,159 | Towards Certain Fixes with Editing Rules and Master Data | 2010 | VLDB | 0.00013592813 |
| 5,412 | Mining an "Anti-Knowledge Base" from Wikipedia Updates with Applications to Fact Checking and Beyond | 2020 | VLDB | 5.5207515e-05 |
| 6,187 | Semi-Supervised Data Cleaning with Raha and Baran | 2021 | CIDR | 5.1656857e-05 |