WarpGate: A Semantic Join Discovery System for Cloud Data Warehouses
Summary: WarpGate: embedding-based semantic join discovery for cloud data warehouses—maps columns into vectors so transformable, cross-database join relationships appear as proximity rather than exact syntactic matches. Prototype is sample-efficient, scales to millions of rows, and integrated into an enterprise analytics product. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Tianji Cong
- 2. James Gale
- 3. Jason Frantz
- 4. H. V. Jagadish
- 5. Çağatay Demiralp
Incoming Citations (Sorted by Pagerank)
Showing 5 of 5 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,836 | Semantics-aware Dataset Discovery from Data Lakes with Contextualized Column-based Representation Learning | 2023 | VLDB | 8.0443826e-05 |
| 6,092 | Observatory: Characterizing Embeddings of Relational Tables | 2024 | VLDB | 5.2138566e-05 |
| 10,197 | Qualitative Join Discovery in Data Lakes using Examples | 2026 | SIGMOD | 4.1945683e-05 |
| 10,645 | OpenForge: Probabilistic Metadata Integration | 2025 | VLDB | 4.1945683e-05 |
| 10,754 | OmniMatch: Joinability Discovery in Data Products | 2025 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 10 of 10 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 34 | Similarity Search in High Dimensions via Hashing | 1999 | VLDB | 0.00076637636 |
| 513 | TURL: Table Understanding through Representation Learning | 2021 | VLDB | 0.00021288342 |
| 1,178 | Table Union Search on Open Data | 2018 | VLDB | 0.00013468118 |
| 1,187 | JOSIE: Overlap Set Similarity Search for Finding Joinable Tables in Data Lakes | 2019 | SIGMOD | 0.00013443639 |
| 1,644 | Finding Related Tables in Data Lakes for Interactive Data Science | 2020 | SIGMOD | 0.00011041787 |
| 2,888 | Sato: Contextual Semantic Type Detection in Tables | 2020 | VLDB | 7.9594996e-05 |
| 3,358 | Organizing Data Lakes for Navigation | 2020 | SIGMOD | 7.1784949e-05 |
| 5,486 | Fast Foreign-Key Detection in Microsoft SQL Server PowerPivot for Excel | 2014 | VLDB | 5.4811603e-05 |
| 5,794 | Discovering Related Data At Scale | 2021 | VLDB | 5.3245122e-05 |
| 8,173 | Sigma Workbook: A Spreadsheet for Cloud Data Warehouses | 2022 | VLDB | 4.568186e-05 |
Previous
Page 1 / 1
Next