DTT: An Example-Driven Tabular Transformer for Joinability by Leveraging Large Language Models
Summary: DTT: an example-driven tabular transformer for joinability across heterogeneous formats. Few-shot mappings learned by fine-tuned LLMs yield accurate joins, missing-values, and error detection; outperform traditional approaches and rival GPT-3. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 5 of 5 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,587 | Table-GPT: Table Fine-tuned GPT for Diverse Table Tasks | 2024 | SIGMOD | 8.4924618e-05 |
| 9,399 | TabulaX: Leveraging Large Language Models for Multi-Class Table Transformations | 2025 | VLDB | 4.3441378e-05 |
| 10,595 | Optimized Batch Prompting for Cost-effective LLMs | 2025 | VLDB | 4.1945683e-05 |
| 10,598 | Auto-Prep: Holistic Prediction of Data Preparation Steps for Self-Service Business Intelligence | 2025 | VLDB | 4.1945683e-05 |
| 10,610 | Weak-to-Strong Prompts with Lightweight-to-Powerful LLMs for High-Accuracy, Low-Cost, and Explainable Data Transformation | 2025 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 14 of 14 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 8,892 | Generation of Training Examples for Tabular Natural Language Inference | 2023 | SIGMOD | 4.4275457e-05 |
| 3,735 | Auto-Join: Joining Tables by Leveraging Transformations | 2017 | VLDB | 6.8061318e-05 |
| 2,517 | Annotating Columns with Pre-trained Language Models | 2022 | SIGMOD | 8.6092139e-05 |
| 10,973 | Unstructured Data Fusion for Schema and Data Extraction | 2024 | SIGMOD | 4.1945683e-05 |
| 8,913 | Making Table Understanding Work in Practice | 2022 | CIDR | 4.427232e-05 |
| 1,914 | Creating Embeddings of Heterogeneous Relational Datasets for Data Integration Tasks | 2020 | SIGMOD | 0.00010109102 |
| 5,449 | Transformers for Tabular Data Representation: A Tutorial on Models and Applications | 2022 | VLDB | 5.5008652e-05 |
| 2,587 | Table-GPT: Table Fine-tuned GPT for Diverse Table Tasks | 2024 | SIGMOD | 8.4924618e-05 |
| 9,399 | TabulaX: Leveraging Large Language Models for Multi-Class Table Transformations | 2025 | VLDB | 4.3441378e-05 |
| 3,335 | DeepJoin: Joinable Table Discovery with Pre-trained Language Models | 2023 | VLDB | 7.2065006e-05 |