Unicorn: A Unified Multi-tasking Model for Supporting Matching Tasks in Data Integration
Summary: Unicorn: a unified multi-task data-matching model for diverse integration tasks. A single Encoder+Matcher with mixture-of-experts enables cross-task knowledge sharing and zero-shot support across 20 datasets and 7 tasks, outperforming ad-hoc baselines. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Jianhong Tu
- 2. Ju Fan
- 3. Nan Tang
- 4. Peng Wang
- 5. Guoliang Li
- 6. Xiaoyong Du
- 7. Xiaofeng Jia
- 8. Song Gao
Incoming Citations (Sorted by Pagerank)
Showing 14 of 14 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 14 of 14 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 8,153 | Deep Transfer Learning for Multi-source Entity Linkage via Domain Adaptation | 2022 | VLDB | 4.574554e-05 |
| 10,800 | Unify: A System For Unstructured Data Analytics | 2025 | VLDB | 4.1945683e-05 |
| 6,569 | Domain Adaptation for Deep Entity Resolution | 2022 | SIGMOD | 5.0065379e-05 |
| 7,668 | Human-in-the-loop Data Integration | 2017 | VLDB | 4.6834075e-05 |
| 5,081 | Reducing Uncertainty of Schema Matching via Crowdsourcing | 2013 | VLDB | 5.7132042e-05 |
| 8,824 | Analyzing and Revising Data Integration Schemas to Improve Their Matchability | 2008 | VLDB | 4.4415658e-05 |
| 11,223 | Splitting Tuples of Mismatched Entities | 2023 | SIGMOD | 4.1945683e-05 |
| 9,020 | Entity Matching in the Wild: A Consistent and Versatile Framework to Unify Data in Industrial Applications | 2020 | SIGMOD | 4.4079449e-05 |
| 9,460 | The Battleship Approach to the Low Resource Entity Matching Problem | 2023 | SIGMOD | 4.3366491e-05 |
| 1,914 | Creating Embeddings of Heterogeneous Relational Datasets for Data Integration Tasks | 2020 | SIGMOD | 0.00010109102 |