Scalable Ad-hoc Entity Extraction from Text Collections
Summary: Introduces ad-hoc entity extraction where target entities come from a task-specific list, avoiding full-document processing. Proposes an inverted-index-driven pruning approach that identifies and processes only task-relevant documents, with empirical gains on real data. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 8 of 8 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,592 | Pass-Join: A Partition-based Method for Similarity Joins | 2012 | VLDB | 8.4795761e-05 |
| 3,578 | Efficient Approximate Entity Extraction with Edit Distance Constraints | 2009 | SIGMOD | 6.9503858e-05 |
| 4,250 | Local Similarity Search for Unstructured Text | 2016 | SIGMOD | 6.3241139e-05 |
| 4,951 | Mining Document Collections to Facilitate Accurate Approximate Entity Matching | 2009 | VLDB | 5.8100413e-05 |
| 5,073 | Faerie: Efficient Filtering Algorithms for Approximate Dictionary-based Entity Extraction | 2011 | SIGMOD | 5.7177424e-05 |
| 5,455 | Natural Language Data Management and Interfaces: Recent Development and Open Challenges | 2017 | SIGMOD | 5.4977219e-05 |
| 6,580 | Query Portals: Dynamically Generating Portals for Entity-Oriented Web Queries | 2010 | SIGMOD | 5.0034092e-05 |
| 9,049 | JENNER: Just-in-time Enrichment in Query Processing | 2022 | VLDB | 4.4039656e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 5 of 5 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 250 | Efficient set joins on similarity predicates | 2004 | SIGMOD | 0.00030661988 |
| 266 | Efficient Exact Set-Similarity Joins | 2006 | VLDB | 0.00029718727 |
| 759 | To Search or to Crawl? Towards a Query Optimizer for Text-Centric Tasks | 2006 | SIGMOD | 0.00017064615 |
| 3,868 | An Efficient Filter for Approximate Membership Checking | 2008 | SIGMOD | 6.6822543e-05 |
| 6,072 | Factorizing Complex Predicates in Queries to Exploit Indexes | 2003 | SIGMOD | 5.2257599e-05 |
Previous
Page 1 / 1
Next