Database Paper Browser

Back to papers

Scalable Column Concept Determination for Web Tables Using Large Knowledge Bases

Summary: Scalable MapReduce-based column concept determination for web tables via fuzzy matching to a large knowledge base. Proposes knowledge concept aggregation and knowledge entity partition; NP-hardness of optimal strategies; hierarchy-aware heuristics; shows strong annotation quality and scalability. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
10649
Venue
VLDB
Year
2013
Pagerank
4.7030914e-05
Overall Rank
7,588 | 47.22%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 5 of 5 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 19 of 19 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
62 Freebase: A Collaboratively Created Graph Database For Structuring Human Knowledge 2008 SIGMOD 0.0006429466
107 WebTables: Exploring the Power of Tables on the Web 2008 VLDB 0.00048377684
125 Approximate String Joins in a Database (Almost) for Free 2001 VLDB 0.00044847972
155 Robust and Efficient Fuzzy Match for Online Data Cleaning 2003 SIGMOD 0.00040637896
250 Efficient set joins on similarity predicates 2004 SIGMOD 0.00030661988
266 Efficient Exact Set-Similarity Joins 2006 VLDB 0.00029718727
364 Annotating and Searching Web Tables Using Entities, Types and Relationships 2010 VLDB 0.00025637562
420 InfoGather: Entity Augmentation and Attribute Discovery By Holistic Matching with Web Tables 2012 SIGMOD 0.00023719065
447 Efficient Parallel Set-Similarity Joins Using MapReduce 2010 SIGMOD 0.00022900171
1,001 Recovering Semantics of Tables on the Web 2011 VLDB 0.00014706505
1,066 Probase: A Probabilistic Taxonomy for Text Understanding 2012 SIGMOD 0.0001433416
1,234 Ed-Join: An Efficient Algorithm for Similarity Joins With Edit Distance Constraints 2008 VLDB 0.00013122499
1,317 Harvesting Relational Tables from Lists on the Web 2009 VLDB 0.00012625853
1,367 Answering Table Queries on the Web using Column Keywords 2012 VLDB 0.00012349783
1,396 Can We Beat the Prefix Filtering? An Adaptive Framework for Similarity Join and Search 2012 SIGMOD 0.00012204748
1,585 Answering Table Augmentation Queries from Unstructured Lists on the Web 2009 VLDB 0.00011255098
1,715 V-SMART-Join: A Scalable MapReduce Framework for All-Pair Similarity Joins of Multisets and Vectors 2012 VLDB 0.00010803271
2,592 Pass-Join: A Partition-based Method for Similarity Joins 2012 VLDB 8.4795761e-05
4,216 Trie-Join: Efficient Trie-based String Similarity Joins with Edit-Distance Constraints 2010 VLDB 6.3521675e-05
Previous Page 1 / 1 Next

Semantically Similar Papers