Database Paper Browser

Back to papers

Declarative Information Extraction Using Datalog with Embedded Extraction Predicates

Summary: Datalog with embedded extraction predicates enables modular, declarative IE, outperforming ad-hoc Perl/C++. Also, it enables statistics-driven optimization for IE programs, addressing text-specific challenges and showing empirical gains. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
9513
Venue
VLDB
Year
2007
Pagerank
0.00028971272
Overall Rank
287 | 98.01%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 36 of 36 citing papers.

Rank Citing Paper Year Venue Pagerank
652 On the Provenance of Non-Answers to Queries over Extracted Data 2008 VLDB 0.00018634477
667 Incremental Knowledge Base Construction Using DeepDive 2015 VLDB 0.00018440557
1,722 Building Structured Web Community Portals: A Top-Down, Compositional, and Incremental Approach 2007 VLDB 0.00010757784
1,938 Split-Correctness in Information Extraction 2019 PODS 0.00010028895
2,450 Functional Dependencies for Graphs 2016 SIGMOD 8.7882979e-05
2,929 Complexity Bounds for Relational Algebra over Document Spanners 2019 PODS 7.8800307e-05
2,984 Efficiently Incorporating User Feedback into Information Extraction and Integration Programs 2009 SIGMOD 7.7796344e-05
3,069 Evita Raced: Metacompilation for Declarative Networks 2008 VLDB 7.6151182e-05
3,477 Toward Best-Effort Information Extraction 2008 SIGMOD 7.0583481e-05
4,106 Extracting Databases from Dark Data with DeepDive 2016 SIGMOD 6.4456184e-05
4,156 Uncertainty Management in Rule-Based Information Extraction Systems 2009 SIGMOD 6.3999205e-05
4,164 SlimShot: In-Database Probabilistic Inference for Knowledge Bases 2016 VLDB 6.3923099e-05
4,387 Hybrid In-Database Inference for Declarative Information Extraction 2011 SIGMOD 6.2320072e-05
4,983 Querying Probabilistic Information Extraction 2010 VLDB 5.7870787e-05
5,398 Cleaning Inconsistencies in Information Extraction via Prioritized Repairs 2014 PODS 5.5295577e-05
5,412 Mining an "Anti-Knowledge Base" from Wikipedia Updates with Applications to Fact Checking and Beyond 2020 VLDB 5.5207515e-05
5,620 Datalog and Emerging Applications: An Interactive Tutorial 2011 SIGMOD 5.407079e-05
5,652 From Information to Knowledge: Harvesting Entities and Relationships from Web Sources 2010 PODS 5.3903671e-05
6,111 Why Big Data Industrial Systems Need Rules and What We Can Do About It 2015 SIGMOD 5.2049579e-05
6,254 More Efficient Datalog Queries: Subsumptive Tabling Beats Magic Sets 2011 SIGMOD 5.1368042e-05
6,534 Automatic Rule Refinement for Information Extraction 2010 VLDB 5.0244622e-05
6,846 A framework for annotating CSV-like data 2016 VLDB 4.9092462e-05
8,148 When Speed Has a Price: Fast Information Extraction Using Approximate Algorithms 2013 VLDB 4.5754467e-05
8,204 ELEET: Efficient Learned Query Execution over Text and Tables 2024 VLDB 4.5594273e-05
8,603 OXPath: A Language for Scalable, Memory-efficient Data Extraction from Web Applications 2011 VLDB 4.4866461e-05
8,613 Synthesizing Extraction Rules from User Examples with SEER 2017 SIGMOD 4.4849545e-05
9,376 Versatile Optimization of UDF-heavy Data Flows with Sofa 2014 SIGMOD 4.347376e-05
9,423 Database Principles in Information Extraction 2014 PODS 4.3441378e-05
9,593 Expressiveness within Sequence Datalog 2021 PODS 4.3202988e-05
9,635 Optimizing Complex Extraction Programs over Evolving Text Data 2009 SIGMOD 4.3118125e-05
11,240 Autonomously Computable Information Extraction 2023 VLDB 4.1945683e-05
11,391 Blueprint: A Constraint-solving Approach For Document Extraction 2022 VLDB 4.1945683e-05
11,755 Scalable Semantic Querying of Text 2018 VLDB 4.1945683e-05
11,844 Potential and Pitfalls of Domain-Specific Information Extraction at Web Scale 2016 SIGMOD 4.1945683e-05
12,052 Provenance-based Dictionary Refinement in Information Extraction 2013 SIGMOD 4.1945683e-05
12,115 Just-in-Time Information Extraction using Extraction Views 2012 SIGMOD 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 6 of 6 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers