Building Structured Databases of Factual Knowledge from Massive Text Corpora
Summary: Minimally-supervised, domain- and language-independent extraction of entities, relations, and attributes to build StructDBs from text corpora. Demonstrates scalable cross-domain StructDB construction across news, social, biomedical, and business data with reduced labeling, enabling exploration and knowledge discovery. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Authors
- 1. Xiang Ren
- 2. Meng Jiang
- 3. Jingbo Shang
- 4. Jiawei Han
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 16 of 16 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 13,626 | Managing Information Extraction [Tutorial Outline] | 2006 | SIGMOD | - |
| 7,912 | Mining Quality Phrases from Massive Text Corpora | 2015 | SIGMOD | 4.6183486e-05 |
| 10,973 | Unstructured Data Fusion for Schema and Data Extraction | 2024 | SIGMOD | 4.1945683e-05 |
| 12,044 | Knowledge Harvesting in the Big-Data Era | 2013 | SIGMOD | 4.1945683e-05 |
| 5,379 | Scalable Ad-hoc Entity Extraction from Text Collections | 2008 | VLDB | 5.5405989e-05 |
| 5,529 | Data-Driven Domain Discovery for Structured Datasets | 2020 | VLDB | 5.4566641e-05 |
| 11,844 | Potential and Pitfalls of Domain-Specific Information Extraction at Web Scale | 2016 | SIGMOD | 4.1945683e-05 |
| 11,971 | Mining Latent Entity Structures from Massive Unstructured and Interconnected Data | 2014 | SIGMOD | 4.1945683e-05 |
| 11,847 | Automatic Entity Recognition and Typing in Massive Text Data | 2016 | SIGMOD | 4.1945683e-05 |
| 9,136 | TextCube: Automated Construction and Multidimensional Exploration | 2019 | VLDB | 4.3881065e-05 |