Database Paper Browser

Back to papers

Building Structured Databases of Factual Knowledge from Massive Text Corpora

Summary: Minimally-supervised, domain- and language-independent extraction of entities, relations, and attributes to build StructDBs from text corpora. Demonstrates scalable cross-domain StructDB construction across news, social, biomedical, and business data with reduced labeling, enabling exploration and knowledge discovery. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
5339
Venue
SIGMOD
Year
2017
Pagerank
4.1945683e-05
Overall Rank
11,775 | 18.09%
DOI
10.1145/3035918.3054781

Incoming Non-self Citations Over Time

No non-self incoming citations found for this paper in this database.

Authors

Incoming Citations (Sorted by Pagerank)

Showing 0 of 0 citing papers.

Rank Citing Paper Year Venue Pagerank
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 16 of 16 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
62 Freebase: A Collaboratively Created Graph Database For Structuring Human Knowledge 2008 SIGMOD 0.0006429466
107 WebTables: Exploring the Power of Tables on the Web 2008 VLDB 0.00048377684
364 Annotating and Searching Web Tables Using Entities, Types and Relationships 2010 VLDB 0.00025637562
420 InfoGather: Entity Augmentation and Attribute Discovery By Holistic Matching with Web Tables 2012 SIGMOD 0.00023719065
667 Incremental Knowledge Base Construction Using DeepDive 2015 VLDB 0.00018440557
1,066 Probase: A Probabilistic Taxonomy for Text Understanding 2012 SIGMOD 0.0001433416
3,211 Natural Language Question Answering over RDF — A Graph Data Driven Approach 2014 SIGMOD 7.3743561e-05
3,288 Biperpedia: An Ontology for Search Applications 2014 VLDB 7.273034e-05
3,823 Automatic Discovery of Attributes in Relational Databases 2011 SIGMOD 6.7261168e-05
4,106 Extracting Databases from Dark Data with DeepDive 2016 SIGMOD 6.4456184e-05
5,520 Towards the Web of Concepts: Extracting Concepts from Large Datasets 2010 VLDB 5.4614656e-05
7,614 Mining Attribute-structure Correlated Patterns in Large Attributed Graphs 2012 VLDB 4.6947636e-05
7,912 Mining Quality Phrases from Massive Text Corpora 2015 SIGMOD 4.6183486e-05
7,919 DEXTER: Large-Scale Discovery and Extraction of Product Specifications on the Web 2015 VLDB 4.616746e-05
9,423 Database Principles in Information Extraction 2014 PODS 4.3441378e-05
11,954 Scalable Topical Phrase Mining from Text Corpora 2015 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Semantically Similar Papers