Back to papers
Birdie: Natural Language-Driven Table Discovery Using Differentiable Search Index
Summary: Birdie replaces the representation–index–search pipeline for NL table discovery with a differentiable search index: an encoder–decoder LM trained to map synthetic LLM-generated queries/tables to prefix-aware table identifiers, enabling deeper query–table interaction. Uses parameter-isolation for continual indexing, reducing forgetting >90% and improving accuracy +16.8% vs dense baselines.
(summarized by gpt-5-mini on Feb 09 2026)
- Paper ID
- 13860
- Venue
- VLDB
- Year
- 2025
- Pagerank
- 4.1945683e-05
- Overall Rank
- 10,589 | 26.34%
- DOI
-
10.14778/3734839.3734845
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
Outgoing Citations (Sorted by Pagerank)
Showing 16 of 16 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 513 |
TURL: Table Understanding through Representation Learning |
2021 |
VLDB |
0.00021288342 |
| 1,178 |
Table Union Search on Open Data |
2018 |
VLDB |
0.00013468118 |
| 1,187 |
JOSIE: Overlap Set Similarity Search for Finding Joinable Tables in Data Lakes |
2019 |
SIGMOD |
0.00013443639 |
| 1,872 |
ReAcTable: Enhancing ReAct for Table Question Answering |
2024 |
VLDB |
0.00010259702 |
| 2,587 |
Table-GPT: Table Fine-tuned GPT for Diverse Table Tasks |
2024 |
SIGMOD |
8.4924618e-05 |
| 2,633 |
Schema Extraction for Tabular Data on the Web |
2013 |
VLDB |
8.4063569e-05 |
| 2,836 |
Semantics-aware Dataset Discovery from Data Lakes with Contextualized Column-based Representation Learning |
2023 |
VLDB |
8.0443826e-05 |
| 3,000 |
SANTOS: Relationship-based Semantic Table Union Search |
2023 |
SIGMOD |
7.7462128e-05 |
| 3,148 |
ARM-Net: Adaptive Relation Modeling Network for Structured Data |
2021 |
SIGMOD |
7.4751269e-05 |
| 3,335 |
DeepJoin: Joinable Table Discovery with Pre-trained Language Models |
2023 |
VLDB |
7.2065006e-05 |
| 5,033 |
FinSQL: Model-Agnostic LLMs-based Text-to-SQL Framework for Financial Analysis |
2024 |
SIGMOD |
5.7486224e-05 |
| 5,449 |
Transformers for Tabular Data Representation: A Tutorial on Models and Applications |
2022 |
VLDB |
5.5008652e-05 |
| 6,092 |
Observatory: Characterizing Embeddings of Relational Tables |
2024 |
VLDB |
5.2138566e-05 |
| 7,868 |
Solo: Data Discovery Using Natural Language Questions Via A Self-Supervised Approach |
2023 |
SIGMOD |
4.6319504e-05 |
| 8,116 |
LakeBench: A Benchmark for Discovering Joinable and Unionable Tables in Data Lakes |
2024 |
VLDB |
4.581507e-05 |
| 9,490 |
Auto-BI: Automatically Build BI-Models Leveraging Local Join Prediction and Global Schema Graph |
2023 |
VLDB |
4.3341665e-05 |
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 3,426 |
Discovering Topical Structures of Databases |
2008 |
SIGMOD |
7.1063105e-05 |
| 7,643 |
Cross Modal Data Discovery over Structured and Unstructured Data Lakes |
2023 |
VLDB |
4.6901105e-05 |
| 10,155 |
DIVER: A Robust Text-to-SQL System with Dynamic Interactive Value Linking and Evidence Reasoning |
2026 |
SIGMOD |
4.1945683e-05 |
| 7,354 |
Reliable Text-to-SQL with Adaptive Abstention |
2025 |
SIGMOD |
4.7529612e-05 |
| 6,800 |
DTT: An Example-Driven Tabular Transformer for Joinability by Leveraging Large Language Models |
2024 |
SIGMOD |
4.9231471e-05 |
| 10,823 |
TableCopilot: A Table Assistant Empowered by Natural Language Conditional Table Discovery |
2025 |
VLDB |
4.1945683e-05 |
| 11,540 |
DBTagger: Multi-Task Learning for Keyword Mapping in NLIDBs Using Bi-Directional Recurrent Neural Networks |
2021 |
VLDB |
4.1945683e-05 |
| 10,221 |
NL2SQLBench: A Modular Benchmarking Framework for LLM-Enabled NL2SQL Solutions |
2026 |
VLDB |
4.1945683e-05 |
| 2,836 |
Semantics-aware Dataset Discovery from Data Lakes with Contextualized Column-based Representation Learning |
2023 |
VLDB |
8.0443826e-05 |
| 3,335 |
DeepJoin: Joinable Table Discovery with Pre-trained Language Models |
2023 |
VLDB |
7.2065006e-05 |