Database Paper Browser

Back to papers

Scalable Ad-hoc Entity Extraction from Text Collections

Summary: Introduces ad-hoc entity extraction where target entities come from a task-specific list, avoiding full-document processing. Proposes an inverted-index-driven pruning approach that identifies and processes only task-relevant documents, with empirical gains on real data. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
9727
Venue
VLDB
Year
2008
Pagerank
5.5405989e-05
Overall Rank
5,379 | 62.59%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 8 of 8 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 5 of 5 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
250 Efficient set joins on similarity predicates 2004 SIGMOD 0.00030661988
266 Efficient Exact Set-Similarity Joins 2006 VLDB 0.00029718727
759 To Search or to Crawl? Towards a Query Optimizer for Text-Centric Tasks 2006 SIGMOD 0.00017064615
3,868 An Efficient Filter for Approximate Membership Checking 2008 SIGMOD 6.6822543e-05
6,072 Factorizing Complex Predicates in Queries to Exploit Indexes 2003 SIGMOD 5.2257599e-05
Previous Page 1 / 1 Next

Semantically Similar Papers