InfoGather: Entity Augmentation and Attribute Discovery By Holistic Matching with Web Tables
Summary: InfoGather augments entities and discovers attributes from web tables via holistic matching, enabling entity augmentation by name or example and attribute discovery with high precision. Topic-sensitive PageRank enables indirect table matching and multi-table aggregation; MapReduce preprocessing delivers near-interactive latency and four orders of magnitude speedup on 573M web tables. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 48 of 48 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 11 of 11 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 107 | WebTables: Exploring the Power of Tables on the Web | 2008 | VLDB | 0.00048377684 |
| 155 | Robust and Efficient Fuzzy Match for Online Data Cleaning | 2003 | SIGMOD | 0.00040637896 |
| 208 | Reconciling Schemas of Disparate Data Sources: A Machine-Learning Approach | 2001 | SIGMOD | 0.0003460594 |
| 303 | Generic Schema Matching with Cupid | 2001 | VLDB | 0.00028301477 |
| 364 | Annotating and Searching Web Tables Using Entities, Types and Relationships | 2010 | VLDB | 0.00025637562 |
| 518 | Data Integration for the Relational Web | 2009 | VLDB | 0.00021158934 |
| 886 | Fast Personalized PageRank on MapReduce | 2011 | SIGMOD | 0.00015597161 |
| 902 | Statistical Schema Matching across Web Query Interfaces | 2003 | SIGMOD | 0.00015486247 |
| 1,001 | Recovering Semantics of Tables on the Web | 2011 | VLDB | 0.00014706505 |
| 1,527 | Generic Schema Matching, Ten Years Later | 2011 | VLDB | 0.00011499442 |
| 1,585 | Answering Table Augmentation Queries from Unstructured Lists on the Web | 2009 | VLDB | 0.00011255098 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 107 | WebTables: Exploring the Power of Tables on the Web | 2008 | VLDB | 0.00048377684 |
| 902 | Statistical Schema Matching across Web Query Interfaces | 2003 | SIGMOD | 0.00015486247 |
| 8,696 | Effective Entity Augmentation By Querying External Data Sources | 2023 | VLDB | 4.4660032e-05 |
| 1,140 | EntityRank: Searching Entities Directly and Holistically | 2007 | VLDB | 0.00013720706 |
| 7,588 | Scalable Column Concept Determination for Web Tables Using Large Knowledge Bases | 2013 | VLDB | 4.7030914e-05 |
| 1,001 | Recovering Semantics of Tables on the Web | 2011 | VLDB | 0.00014706505 |
| 364 | Annotating and Searching Web Tables Using Entities, Types and Relationships | 2010 | VLDB | 0.00025637562 |
| 1,585 | Answering Table Augmentation Queries from Unstructured Lists on the Web | 2009 | VLDB | 0.00011255098 |
| 1,367 | Answering Table Queries on the Web using Column Keywords | 2012 | VLDB | 0.00012349783 |
| 3,229 | InfoGather+: Semantic Matching and Annotation of Numeric and Time-Varying Attributes in Web Tables | 2013 | SIGMOD | 7.3393682e-05 |