Database Paper Browser

Back to papers

Which Concepts Are Worth Extracting?

Summary: Introduces cost-effective conceptual design: selecting a budget-limited subset of concepts to annotate to boost query effectiveness. Proposes APM and AAM with provable guarantees (APM PTAS without overlap; constant-factor when concepts are exclusive; AAM PTAS for exclusive concepts); experiments on Wikipedia and query logs validate performance and guide choice. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
4867
Venue
SIGMOD
Year
2014
Pagerank
4.1945683e-05
Overall Rank
11,975 | 16.70%
DOI
10.1145/2588555.2610496

Incoming Non-self Citations Over Time

No non-self incoming citations found for this paper in this database.

Authors

Incoming Citations (Sorted by Pagerank)

Showing 0 of 0 citing papers.

Rank Citing Paper Year Venue Pagerank
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 5 of 5 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
2,224 The SphereSearch Engine for Unified Ranked Retrieval of Heterogeneous XML and Web Documents 2005 VLDB 9.251962e-05
2,915 Brainwash: A Data System for Feature Engineering 2013 CIDR 7.9078385e-05
3,477 Toward Best-Effort Information Extraction 2008 SIGMOD 7.0583481e-05
3,820 Enterprise Information Extraction: Recent Developments and Open Challenges 2010 SIGMOD 6.7299199e-05
6,586 Web Data Management 2011 SIGMOD 5.0023398e-05
Previous Page 1 / 1 Next

Semantically Similar Papers