Database Paper Browser

Back to papers

Finding Related Tables

Summary: Framework capturing diverse relatedness among heterogeneous tables, including joinable and unionable candidates. Algorithms detect joinable/unionable related tables; scalable to over a million Wikipedia tables, enabling reuse and improved search. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
4606
Venue
SIGMOD
Year
2012
Pagerank
0.00016311524
Overall Rank
818 | 94.32%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 33 of 33 citing papers.

Rank Citing Paper Year Venue Pagerank
513 TURL: Table Understanding through Representation Learning 2021 VLDB 0.00021288342
939 Data Lake Management: Challenges and Opportunities 2019 VLDB 0.00015187344
1,178 Table Union Search on Open Data 2018 VLDB 0.00013468118
1,187 JOSIE: Overlap Set Similarity Search for Finding Joinable Tables in Data Lakes 2019 SIGMOD 0.00013443639
2,104 Data Polygamy: The Many-Many Relationships among Urban Spatio-Temporal Data Sets 2016 SIGMOD 9.536298e-05
2,498 Support the Data Enthusiast: Challenges for Next-Generation Data-Analysis Systems 2014 VLDB 8.6465331e-05
2,633 Schema Extraction for Tabular Data on the Web 2013 VLDB 8.4063569e-05
2,730 Open Data Integration 2018 VLDB 8.2126735e-05
2,836 Semantics-aware Dataset Discovery from Data Lakes with Contextualized Column-based Representation Learning 2023 VLDB 8.0443826e-05
3,000 SANTOS: Relationship-based Semantic Table Union Search 2023 SIGMOD 7.7462128e-05
3,155 Ten Years of WebTables 2018 VLDB 7.4672742e-05
3,229 InfoGather+: Semantic Matching and Annotation of Numeric and Time-Varying Attributes in Web Tables 2013 SIGMOD 7.3393682e-05
3,358 Organizing Data Lakes for Navigation 2020 SIGMOD 7.1784949e-05
3,690 Navigating the Data Lake with DATAMARAN: Automatically Extracting Structure from Log Datasets 2018 SIGMOD 6.8384476e-05
3,797 Stitching Web Tables for Improving Matching Quality 2017 VLDB 6.7597149e-05
4,859 Integrating Data Lake Tables 2023 VLDB 5.8732433e-05
5,179 SilkMoth: An Efficient Method for Finding Related Sets with Maximum Matching Constraints 2017 VLDB 5.6428428e-05
5,691 Putting Things into Context: Rich Explanations for Query Answers using Join Graphs 2021 SIGMOD 5.3684557e-05
5,789 Interactive Navigation of Open Data Linkages 2017 VLDB 5.3269741e-05
5,937 DataXFormer: Leveraging the Web for Semantic Transformations 2015 CIDR 5.2650964e-05
6,270 MATE: Multi-Attribute Table Extraction 2022 VLDB 5.1337451e-05
7,745 Crossing the finish line faster when paddling the Data Lake with KAYAK 2017 VLDB 4.6618625e-05
7,919 DEXTER: Large-Scale Discovery and Extraction of Product Specifications on the Web 2015 VLDB 4.616746e-05
8,116 LakeBench: A Benchmark for Discovering Joinable and Unionable Tables in Data Lakes 2024 VLDB 4.581507e-05
8,678 Progressive Deep Web Crawling Through Keyword Queries For Data Enrichment 2019 SIGMOD 4.4702119e-05
8,685 NLC: Search Correlated Window Pairs on Long Time Series 2022 VLDB 4.4675898e-05
8,849 SourceSight: Enabling Effective Source Selection 2016 SIGMOD 4.4369118e-05
9,992 Supporting Our AI Overlords: Redesigning Data Systems to be Agent-First 2026 CIDR 4.1945683e-05
10,510 Table Overlap Estimation through Graph Embeddings 2025 SIGMOD 4.1945683e-05
10,685 LakeVisage: Towards Scalable, Flexible and Interactive Visualization Recommendation for Data Discovery over Data Lakes 2025 VLDB 4.1945683e-05
10,836 Data Discovery in Data Lakes: Operations, Indexes, Systems 2025 VLDB 4.1945683e-05
10,951 Determining the Largest Overlap between Tables 2024 SIGMOD 4.1945683e-05
11,895 Finding Quality in Quantity: The Challenge of Discovering Valuable Sources for Integration 2015 CIDR 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 12 of 12 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers

Overall Rank Paper Year Venue Pagerank
8,499 Synthesizing Mapping Relationships Using Table Corpus 2017 SIGMOD 4.4975851e-05
1,317 Harvesting Relational Tables from Lists on the Web 2009 VLDB 0.00012625853
1,510 Summarizing Relational Databases 2009 VLDB 0.00011606901
3,824 Correlation Sketches for Approximate Join-Correlation Queries 2021 SIGMOD 6.7260705e-05
1,367 Answering Table Queries on the Web using Column Keywords 2012 VLDB 0.00012349783
5,794 Discovering Related Data At Scale 2021 VLDB 5.3245122e-05
1,001 Recovering Semantics of Tables on the Web 2011 VLDB 0.00014706505
1,178 Table Union Search on Open Data 2018 VLDB 0.00013468118
107 WebTables: Exploring the Power of Tables on the Web 2008 VLDB 0.00048377684
364 Annotating and Searching Web Tables Using Entities, Types and Relationships 2010 VLDB 0.00025637562