Knowledge Harvesting in the Big-Data Era
Summary: Survey of scalable knowledge harvesting from Web/text for large KBs (DBpedia/Freebase/YAGO), stressing distributed extraction and entity-centric analytics. Highlights expanding entities/facts, multilingual/temporal coverage, commonsense knowledge, disambiguation, and cross-source sameAs linking; outlines data-management challenges. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Authors
Incoming Citations (Sorted by Pagerank)
Showing 1 of 1 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 7,508 | Knowledge Bases in the Age of Big Data Analytics | 2014 | VLDB | 4.7180617e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 14 of 14 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 11,971 | Mining Latent Entity Structures from Massive Unstructured and Interconnected Data | 2014 | SIGMOD | 4.1945683e-05 |
| 11,844 | Potential and Pitfalls of Domain-Specific Information Extraction at Web Scale | 2016 | SIGMOD | 4.1945683e-05 |
| 9,423 | Database Principles in Information Extraction | 2014 | PODS | 4.3441378e-05 |
| 2,420 | From Data Fusion to Knowledge Fusion | 2014 | VLDB | 8.8530994e-05 |
| 5,538 | Growing and Serving Large Open-domain Knowledge Graphs | 2023 | SIGMOD | 5.4509524e-05 |
| 11,775 | Building Structured Databases of Factual Knowledge from Massive Text Corpora | 2017 | SIGMOD | 4.1945683e-05 |
| 4,630 | Knowledge Graphs 2021: A Data Odyssey | 2021 | VLDB | 6.0348379e-05 |
| 11,906 | Knowledge Curation and Knowledge Fusion: Challenges, Models, and Applications | 2015 | SIGMOD | 4.1945683e-05 |
| 7,508 | Knowledge Bases in the Age of Big Data Analytics | 2014 | VLDB | 4.7180617e-05 |
| 5,652 | From Information to Knowledge: Harvesting Entities and Relationships from Web Sources | 2010 | PODS | 5.3903671e-05 |