Querying Probabilistic Information Extraction
Summary: In-database CRF-based information extraction (Viterbi) is integrated with query processing to handle probabilistic data. Two strategies: deterministic max-likelihood with pushdown into Viterbi; and a probabilistic-worlds approach for top-k answers; evaluated on two datasets. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 7 of 7 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 140 | The MADlib Analytics Library or MAD Skills, the SQL | 2012 | VLDB | 0.00042270404 |
| 658 | Towards a Unified Architecture for in-RDBMS Analytics | 2012 | SIGMOD | 0.00018506577 |
| 4,033 | In-RDBMS Hardware Acceleration of Advanced Analytics | 2018 | VLDB | 6.5113267e-05 |
| 4,387 | Hybrid In-Database Inference for Declarative Information Extraction | 2011 | SIGMOD | 6.2320072e-05 |
| 8,765 | Efficient Query Answering in Probabilistic RDF Graphs | 2011 | SIGMOD | 4.456315e-05 |
| 10,645 | OpenForge: Probabilistic Metadata Integration | 2025 | VLDB | 4.1945683e-05 |
| 11,747 | Holistic Query Evaluation over Information Extraction Pipelines | 2018 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 9 of 9 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 74 | Efficient Query Evaluation on Probabilistic Databases | 2004 | VLDB | 0.00057857292 |
| 101 | ULDBs: Databases with Uncertainty and Lineage | 2006 | VLDB | 0.0004955674 |
| 287 | Declarative Information Extraction Using Datalog with Embedded Extraction Predicates | 2007 | VLDB | 0.00028971272 |
| 469 | MauveDB: Supporting Model-based User Views in Database Systems | 2006 | SIGMOD | 0.00022406923 |
| 760 | Creating Probabilistic Databases from Information Extraction Models | 2006 | VLDB | 0.00017053935 |
| 980 | BayesStore: Managing Large, Uncertain Data Repositories with Probabilistic Graphical Models | 2008 | VLDB | 0.00014879747 |
| 2,393 | Rank-aware Query Optimization | 2004 | SIGMOD | 8.9016542e-05 |
| 3,477 | Toward Best-Effort Information Extraction | 2008 | SIGMOD | 7.0583481e-05 |
| 4,156 | Uncertainty Management in Rule-Based Information Extraction Systems | 2009 | SIGMOD | 6.3999205e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3,631 | On-the-Fly Entity-Aware Query Processing in the Presence of Linkage | 2010 | VLDB | 6.9014378e-05 |
| 74 | Efficient Query Evaluation on Probabilistic Databases | 2004 | VLDB | 0.00057857292 |
| 11,747 | Holistic Query Evaluation over Information Extraction Pipelines | 2018 | VLDB | 4.1945683e-05 |
| 1,992 | Probabilistic Ranking of Database Query Results | 2004 | VLDB | 9.8462684e-05 |
| 7,872 | Probabilistic Database Summarization for Interactive Data Exploration | 2017 | VLDB | 4.6307184e-05 |
| 4,156 | Uncertainty Management in Rule-Based Information Extraction Systems | 2009 | SIGMOD | 6.3999205e-05 |
| 3,081 | Knowledge Expansion over Probabilistic Knowledge Bases | 2014 | SIGMOD | 7.6031501e-05 |
| 4,521 | A Temporal-Probabilistic Database Model for Information Extraction | 2013 | VLDB | 6.1168322e-05 |
| 760 | Creating Probabilistic Databases from Information Extraction Models | 2006 | VLDB | 0.00017053935 |
| 4,387 | Hybrid In-Database Inference for Declarative Information Extraction | 2011 | SIGMOD | 6.2320072e-05 |