Automatic Entity Recognition and Typing in Massive Text Data
Summary: Data-driven, domain-agnostic entity recognition and typing in massive text, automatically identifying spans and assigning fine-grained types. No annotated data or handcrafted features; rapid cross-domain, multilingual adaptation across news, forums, tweets, and biomedical text. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Authors
- 1. Xiang Ren
- 2. Ahmed El-Kishky
- 3. Heng Ji
- 4. Jiawei Han
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 7 of 7 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 62 | Freebase: A Collaboratively Created Graph Database For Structuring Human Knowledge | 2008 | SIGMOD | 0.0006429466 |
| 364 | Annotating and Searching Web Tables Using Entities, Types and Relationships | 2010 | VLDB | 0.00025637562 |
| 420 | InfoGather: Entity Augmentation and Attribute Discovery By Holistic Matching with Web Tables | 2012 | SIGMOD | 0.00023719065 |
| 1,066 | Probase: A Probabilistic Taxonomy for Text Understanding | 2012 | SIGMOD | 0.0001433416 |
| 5,431 | Entity Extraction, Linking, Classification, and Tagging for Social Media: A Wikipedia-Based Approach | 2013 | VLDB | 5.5076946e-05 |
| 7,912 | Mining Quality Phrases from Massive Text Corpora | 2015 | SIGMOD | 4.6183486e-05 |
| 11,954 | Scalable Topical Phrase Mining from Text Corpora | 2015 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 754 | Distributed Representations of Tuples for Entity Resolution | 2018 | VLDB | 0.00017117211 |
| 5,479 | Microblog Entity Linking with Social Temporal Context | 2015 | SIGMOD | 5.4850984e-05 |
| 300 | Deep Learning for Entity Matching: A Design Space Exploration | 2018 | SIGMOD | 0.00028441466 |
| 9,896 | Towards Interpretable and Learnable Risk Analysis for Entity Resolution | 2020 | SIGMOD | 4.2600049e-05 |
| 6,569 | Domain Adaptation for Deep Entity Resolution | 2022 | SIGMOD | 5.0065379e-05 |
| 11,971 | Mining Latent Entity Structures from Massive Unstructured and Interconnected Data | 2014 | SIGMOD | 4.1945683e-05 |
| 9,136 | TextCube: Automated Construction and Multidimensional Exploration | 2019 | VLDB | 4.3881065e-05 |
| 3,578 | Efficient Approximate Entity Extraction with Edit Distance Constraints | 2009 | SIGMOD | 6.9503858e-05 |
| 5,379 | Scalable Ad-hoc Entity Extraction from Text Collections | 2008 | VLDB | 5.5405989e-05 |
| 11,775 | Building Structured Databases of Factual Knowledge from Massive Text Corpora | 2017 | SIGMOD | 4.1945683e-05 |