Database Paper Browser

Back to papers

Annotating and Searching Web Tables Using Entities, Types and Relationships

Summary: Annotates web tables with entities, types, and relations using a joint graphical model to label cells and columns simultaneously. Demonstrates gains in relational Web search over text-only indexing on 25M+ HTML tables using YAGO/DBpedia/Wikipedia. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
10067
Venue
VLDB
Year
2010
Pagerank
0.00025637562
Overall Rank
364 | 97.47%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 39 of 39 citing papers.

Rank Citing Paper Year Venue Pagerank
420 InfoGather: Entity Augmentation and Attribute Discovery By Holistic Matching with Web Tables 2012 SIGMOD 0.00023719065
513 TURL: Table Understanding through Representation Learning 2021 VLDB 0.00021288342
818 Finding Related Tables 2012 SIGMOD 0.00016311524
1,001 Recovering Semantics of Tables on the Web 2011 VLDB 0.00014706505
1,178 Table Union Search on Open Data 2018 VLDB 0.00013468118
1,546 KATARA: A Data Cleaning System Powered by Knowledge Bases and Crowdsourcing 2015 SIGMOD 0.00011446851
2,141 LSH Ensemble: Internet-Scale Domain Search 2016 VLDB 9.4542625e-05
2,517 Annotating Columns with Pre-trained Language Models 2022 SIGMOD 8.6092139e-05
2,633 Schema Extraction for Tabular Data on the Web 2013 VLDB 8.4063569e-05
2,836 Semantics-aware Dataset Discovery from Data Lakes with Contextualized Column-based Representation Learning 2023 VLDB 8.0443826e-05
2,888 Sato: Contextual Semantic Type Detection in Tables 2020 VLDB 7.9594996e-05
3,000 SANTOS: Relationship-based Semantic Table Union Search 2023 SIGMOD 7.7462128e-05
3,155 Ten Years of WebTables 2018 VLDB 7.4672742e-05
3,229 InfoGather+: Semantic Matching and Annotation of Numeric and Time-Varying Attributes in Web Tables 2013 SIGMOD 7.3393682e-05
3,288 Biperpedia: An Ontology for Search Applications 2014 VLDB 7.273034e-05
3,358 Organizing Data Lakes for Navigation 2020 SIGMOD 7.1784949e-05
3,690 Navigating the Data Lake with DATAMARAN: Automatically Extracting Structure from Log Datasets 2018 SIGMOD 6.8384476e-05
3,797 Stitching Web Tables for Improving Matching Quality 2017 VLDB 6.7597149e-05
4,630 Knowledge Graphs 2021: A Data Odyssey 2021 VLDB 6.0348379e-05
4,838 Finding Patterns in a Knowledge Base using Keywords to Compose Table Answers 2014 VLDB 5.8887949e-05
4,859 Integrating Data Lake Tables 2023 VLDB 5.8732433e-05
5,729 KATARA: Reliable Data Cleaning with Knowledge Bases and Crowdsourcing 2015 VLDB 5.3506368e-05
6,586 Web Data Management 2011 SIGMOD 5.0023398e-05
7,588 Scalable Column Concept Determination for Web Tables Using Large Knowledge Bases 2013 VLDB 4.7030914e-05
8,787 QuTE: Answering Quantity Queries from Web Tables 2021 SIGMOD 4.4520613e-05
8,849 SourceSight: Enabling Effective Source Selection 2016 SIGMOD 4.4369118e-05
8,852 Watchog: A Light-weight Contrastive Learning based Framework for Column Annotation 2023 SIGMOD 4.4356508e-05
9,014 Knowledge Exploration Using Tables on the Web 2017 VLDB 4.4095176e-05
9,251 Joint Open Knowledge Base Canonicalization and Linking 2021 SIGMOD 4.3690661e-05
10,512 Auto-Test: Learning Semantic-Domain Constraints for Unsupervised Error Detection in Tables 2025 SIGMOD 4.1945683e-05
10,753 Cents: A Flexible and Cost-Effective Framework for LLM-Based Table Understanding 2025 VLDB 4.1945683e-05
10,951 Determining the Largest Overlap between Tables 2024 SIGMOD 4.1945683e-05
11,775 Building Structured Databases of Factual Knowledge from Massive Text Corpora 2017 SIGMOD 4.1945683e-05
11,847 Automatic Entity Recognition and Typing in Massive Text Data 2016 SIGMOD 4.1945683e-05
11,895 Finding Quality in Quantity: The Challenge of Discovering Valuable Sources for Integration 2015 CIDR 4.1945683e-05
11,930 ConfSeer: Leveraging Customer Support Knowledge Bases for Automated Misconfiguration Detection 2015 VLDB 4.1945683e-05
11,971 Mining Latent Entity Structures from Massive Unstructured and Interconnected Data 2014 SIGMOD 4.1945683e-05
12,044 Knowledge Harvesting in the Big-Data Era 2013 SIGMOD 4.1945683e-05
12,201 AIDA: An Online Tool for Accurate Disambiguation of Named Entities in Text and Tables 2011 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 3 of 3 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
107 WebTables: Exploring the Power of Tables on the Web 2008 VLDB 0.00048377684
1,140 EntityRank: Searching Entities Directly and Holistically 2007 VLDB 0.00013720706
1,585 Answering Table Augmentation Queries from Unstructured Lists on the Web 2009 VLDB 0.00011255098
Previous Page 1 / 1 Next

Semantically Similar Papers