Database Paper Browser

Back to papers

Observatory: Characterizing Embeddings of Relational Tables

Summary: Introduces Observatory, an extensible framework of eight primitive properties and quantitative measures for systematically characterizing table embeddings. Applied to nine models, it uncovers column-order sensitivity, weak encoding of functional dependencies, and lower sample-fidelity in specialized table models. (summarized by gpt-5-mini on Feb 09 2026)

Paper ID
13759
Venue
VLDB
Year
2024
Pagerank
5.2138566e-05
Overall Rank
6,092 | 57.62%
DOI
10.14778/3636218.3636237

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 4 of 4 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 17 of 17 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
8 Extending the Data Base Relational Model to Capture More Meaning 1979 SIGMOD 0.0015385917
221 Deep Entity Matching with Pre-Trained Language Models 2021 VLDB 0.00033121824
420 InfoGather: Entity Augmentation and Attribute Discovery By Holistic Matching with Web Tables 2012 SIGMOD 0.00023719065
513 TURL: Table Understanding through Representation Learning 2021 VLDB 0.00021288342
517 Can Foundation Models Wrangle Your Data? 2023 VLDB 0.00021169035
894 A Hybrid Approach to Functional Dependency Discovery 2016 SIGMOD 0.00015556428
1,178 Table Union Search on Open Data 2018 VLDB 0.00013468118
1,187 JOSIE: Overlap Set Similarity Search for Finding Joinable Tables in Data Lakes 2019 SIGMOD 0.00013443639
1,664 On Multi-Column Foreign Key Discovery 2010 VLDB 0.00010976887
1,914 Creating Embeddings of Heterogeneous Relational Datasets for Data Integration Tasks 2020 SIGMOD 0.00010109102
2,141 LSH Ensemble: Internet-Scale Domain Search 2016 VLDB 9.4542625e-05
2,349 RPT: Relational Pre-trained Transformer Is Almost All You Need towards Democratizing Data Preparation 2021 VLDB 8.9876423e-05
2,517 Annotating Columns with Pre-trained Language Models 2022 SIGMOD 8.6092139e-05
2,888 Sato: Contextual Semantic Type Detection in Tables 2020 VLDB 7.9594996e-05
3,015 Chorus: Foundation Models for Unified Data Discovery and Exploration 2024 VLDB 7.7092391e-05
5,449 Transformers for Tabular Data Representation: A Tutorial on Models and Applications 2022 VLDB 5.5008652e-05
8,193 WarpGate: A Semantic Join Discovery System for Cloud Data Warehouses 2023 CIDR 4.5618596e-05
Previous Page 1 / 1 Next

Semantically Similar Papers