Observatory: Characterizing Embeddings of Relational Tables
Summary: Introduces Observatory, an extensible framework of eight primitive properties and quantitative measures for systematically characterizing table embeddings. Applied to nine models, it uncovers column-order sensitivity, weak encoding of functional dependencies, and lower sample-fidelity in specialized table models. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Tianji Cong
- 2. Madelon Hulsebos
- 3. Zhenjie Sun
- 4. Paul Groth
- 5. H. V. Jagadish
Incoming Citations (Sorted by Pagerank)
Showing 4 of 4 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,963 | DocETL: Agentic Query Rewriting and Evaluation for Complex Document Processing | 2025 | VLDB | 9.929429e-05 |
| 10,589 | Birdie: Natural Language-Driven Table Discovery Using Differentiable Search Index | 2025 | VLDB | 4.1945683e-05 |
| 10,753 | Cents: A Flexible and Cost-Effective Framework for LLM-Based Table Understanding | 2025 | VLDB | 4.1945683e-05 |
| 10,844 | Panel on Neural Relational Data: Tabular Foundation Models, LLMs... or both? | 2025 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 17 of 17 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,587 | Table-GPT: Table Fine-tuned GPT for Diverse Table Tasks | 2024 | SIGMOD | 8.4924618e-05 |
| 5,989 | Tables As a Paradigm for Querying and Restructuring | 1996 | PODS | 5.2429037e-05 |
| 8,913 | Making Table Understanding Work in Practice | 2022 | CIDR | 4.427232e-05 |
| 3,520 | GitTables: A Large-Scale Corpus of Relational Tables | 2023 | SIGMOD | 7.0131061e-05 |
| 10,269 | Database Views as Explanations for Relational Deep Learning | 2026 | VLDB | 4.1945683e-05 |
| 513 | TURL: Table Understanding through Representation Learning | 2021 | VLDB | 0.00021288342 |
| 9,886 | Scalable and Usable Relational Learning With Automatic Language Bias | 2021 | SIGMOD | 4.2621158e-05 |
| 3,335 | DeepJoin: Joinable Table Discovery with Pre-trained Language Models | 2023 | VLDB | 7.2065006e-05 |
| 5,449 | Transformers for Tabular Data Representation: A Tutorial on Models and Applications | 2022 | VLDB | 5.5008652e-05 |
| 1,914 | Creating Embeddings of Heterogeneous Relational Datasets for Data Integration Tasks | 2020 | SIGMOD | 0.00010109102 |