Hybrid In-Database Inference for Declarative Information Extraction

Summary: Hybrid in-database inference for declarative information extraction. Per-record selection among MCMC variants, Viterbi, and sum-product in a PDB-based IE engine optimizes accuracy versus runtime; reports up to 10× speedups over non-hybrid baselines. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID: 4410
Venue: SIGMOD
Year: 2011
Pagerank: 6.2251088e-05
Overall Rank: 4,384 | 69.54%
DOI: -

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 8 of 8 citing papers.

Rank	Citing Paper	Year	Venue	Pagerank
139	The MADlib Analytics Library or MAD Skills, the SQL	2012	VLDB	0.00042320525
543	MLbase: A Distributed Machine-learning System	2013	CIDR	0.0002050918
1,165	Simulation of Database-Valued Markov Chains Using SimSQL	2013	SIGMOD	0.00013567206
4,079	Towards High-Throughput Gibbs Sampling at Scale: A Study across Storage Managers	2013	SIGMOD	6.4621024e-05
5,270	Probabilistic Databases with MarkoViews	2012	VLDB	5.5925747e-05
6,197	WADaR: Joint Wrapper and Data Repair	2015	VLDB	5.1570343e-05
8,974	Ontological Pathfinding: Mining First-Order Knowledge from Large Knowledge Bases	2016	SIGMOD	4.4148109e-05
9,429	Database Principles in Information Extraction	2014	PODS	4.3399748e-05

Outgoing Citations (Sorted by Pagerank)

Showing 10 of 10 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank	Cited Paper	Year	Venue	Pagerank
74	Efficient Query Evaluation on Probabilistic Databases	2004	VLDB	0.00057797415
103	ULDBs: Databases with Uncertainty and Lineage	2006	VLDB	0.00049520051
289	Declarative Information Extraction Using Datalog with Embedded Extraction Predicates	2007	VLDB	0.00028865317
322	MCDB: A Monte Carlo Approach to Managing Uncertain Data	2008	SIGMOD	0.00027523667
468	MauveDB: Supporting Model-based User Views in Database Systems	2006	SIGMOD	0.00022407392
974	BayesStore: Managing Large, Uncertain Data Repositories with Probabilistic Graphical Models	2008	VLDB	0.00014882804
2,186	Scalable Probabilistic Databases with Factor Graphs and MCMC	2010	VLDB	9.3460817e-05
3,479	Toward Best-Effort Information Extraction	2008	SIGMOD	7.053581e-05
4,156	Uncertainty Management in Rule-Based Information Extraction Systems	2009	SIGMOD	6.3947765e-05
4,960	Querying Probabilistic Information Extraction	2010	VLDB	5.7992639e-05

Semantically Similar Papers

Overall Rank	Paper	Year	Venue	Pagerank
6,790	InferDB: In-Database Machine Learning Inference Using Indexes	2024	VLDB	4.9205968e-05
3,626	On-the-Fly Entity-Aware Query Processing in the Presence of Linkage	2010	VLDB	6.9003552e-05
3,547	Optimizing MPF Queries: Decision Support and Probabilistic Inference	2007	SIGMOD	6.9810773e-05
8,578	Anytime Approximation in Probabilistic Databases via Scaled Dissociations	2019	SIGMOD	4.4879347e-05
2,186	Scalable Probabilistic Databases with Factor Graphs and MCMC	2010	VLDB	9.3460817e-05
74	Efficient Query Evaluation on Probabilistic Databases	2004	VLDB	0.00057797415
3,084	Knowledge Expansion over Probabilistic Knowledge Bases	2014	SIGMOD	7.5967738e-05
757	Creating Probabilistic Databases from Information Extraction Models	2006	VLDB	0.00017062337
4,522	A Temporal-Probabilistic Database Model for Information Extraction	2013	VLDB	6.1109228e-05
4,960	Querying Probabilistic Information Extraction	2010	VLDB	5.7992639e-05