Back to papers
Querying Data Provenance
Summary: Adopts semiring provenance as the general tuple-level model for provenance queries, enabling broad analysis. Proposes a universal provenance query language plus storage, processing, and indexing schemes, with validation across diverse tasks.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 4307
- Venue
- SIGMOD
- Year
- 2010
- Pagerank
- 9.3676609e-05
- Overall Rank
- 2,173 | 84.89%
- DOI
-
-
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 24 of 24 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
| 1,106 |
Provenance for Aggregate Queries |
2011 |
PODS |
0.0001398766 |
| 1,281 |
DataHub: Collaborative Data Science & Dataset Version Management at Scale |
2015 |
CIDR |
0.00012854744 |
| 2,027 |
Titian: Data Provenance Support in Spark |
2016 |
VLDB |
9.7437067e-05 |
| 2,028 |
Putting Lipstick on Pig: Enabling Database-style Workflow Provenance |
2012 |
VLDB |
9.7433981e-05 |
| 2,280 |
SMOKE: Fine-grained Lineage at Interactive Speed |
2018 |
VLDB |
9.1111033e-05 |
| 2,764 |
The Semiring Framework for Database Provenance |
2017 |
PODS |
8.1574444e-05 |
| 4,851 |
Provenance for Natural Language Queries |
2017 |
VLDB |
5.8768322e-05 |
| 5,209 |
Explaining Outputs in Modern Data Analytics |
2016 |
VLDB |
5.629362e-05 |
| 5,691 |
Putting Things into Context: Rich Explanations for Query Answers using Join Graphs |
2021 |
SIGMOD |
5.3684557e-05 |
| 5,708 |
Lineage-driven Fault Injection |
2015 |
SIGMOD |
5.3603939e-05 |
| 5,733 |
Explaining Wrong Queries Using Small Examples |
2019 |
SIGMOD |
5.3483446e-05 |
| 6,084 |
Distributed Provenance Compression |
2017 |
SIGMOD |
5.2196728e-05 |
| 6,429 |
ShapGraph: An Holistic View of Explanations through Provenance Graphs and Shapley Values |
2022 |
SIGMOD |
5.0666822e-05 |
| 6,662 |
Selective Provenance for Datalog Programs Using Top-K Queries |
2015 |
VLDB |
4.9704872e-05 |
| 6,981 |
Dataset Relationship Management |
2019 |
CIDR |
4.8743957e-05 |
| 8,230 |
You Say 'What', I Hear 'Where' and 'Why' - (Mis-)Interpreting SQL to Derive Fine-Grained Provenance |
2018 |
VLDB |
4.5541444e-05 |
| 8,504 |
Distributed Time-aware Provenance |
2013 |
VLDB |
4.496125e-05 |
| 8,955 |
Shedding Light on Opaque Application Queries |
2021 |
SIGMOD |
4.4215357e-05 |
| 9,043 |
Query-Guided Resolution in Uncertain Databases |
2023 |
SIGMOD |
4.4039656e-05 |
| 9,202 |
Compact, Tamper-Resistant Archival of Fine-Grained Provenance |
2021 |
VLDB |
4.3742967e-05 |
| 10,546 |
Evaluating Continuous Queries with Inconsistency Annotations |
2025 |
VLDB |
4.1945683e-05 |
| 11,647 |
Ariadne: Online Provenance for Big Graph Analytics |
2019 |
SIGMOD |
4.1945683e-05 |
| 11,892 |
Looking at Everything in Context |
2015 |
CIDR |
4.1945683e-05 |
| 13,492 |
NetTrails: A Declarative Platform for Maintaining and Querying Provenance in Distributed Systems |
2011 |
SIGMOD |
- |
Outgoing Citations (Sorted by Pagerank)
Showing 22 of 22 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 31 |
Provenance Semirings |
2007 |
PODS |
0.0007857786 |
| 61 |
DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases |
1997 |
VLDB |
0.00064329285 |
| 74 |
Efficient Query Evaluation on Probabilistic Databases |
2004 |
VLDB |
0.00057857292 |
| 101 |
ULDBs: Databases with Uncertainty and Lineage |
2006 |
VLDB |
0.0004955674 |
| 158 |
Automated Selection of Materialized Views and Indexes for SQL Databases |
2000 |
VLDB |
0.00040071492 |
| 320 |
ObjectRank: Authority-Based Keyword Search in Databases |
2004 |
VLDB |
0.00027577867 |
| 415 |
A Fast Index for Semistructured Data |
2001 |
VLDB |
0.00023814619 |
| 480 |
Translating Web Data |
2002 |
VLDB |
0.00022191997 |
| 561 |
An Annotation Management System for Relational Databases |
2004 |
VLDB |
0.00020115419 |
| 689 |
Debugging Schema Mappings with Routes |
2006 |
VLDB |
0.00018111991 |
| 855 |
Integrating Conflicting Data: The Role of Source Dependence |
2009 |
VLDB |
0.00015906735 |
| 1,081 |
Catching the Boat with Strudel: Experiences with a Web-Site Management System |
1998 |
SIGMOD |
0.00014216794 |
| 1,368 |
Querying Business Processes |
2006 |
VLDB |
0.00012347323 |
| 1,824 |
DBNotes: A Post-It System for Relational Databases based on Provenance |
2005 |
SIGMOD |
0.00010405194 |
| 1,861 |
Efficient Provenance Storage |
2008 |
SIGMOD |
0.00010287053 |
| 1,866 |
Update Exchange with Mappings and Provenance |
2007 |
VLDB |
0.00010272139 |
| 1,923 |
Reconciling while Tolerating Disagreement in Collaborative Data Sharing |
2006 |
SIGMOD |
0.00010080761 |
| 2,008 |
Access Support in Object Bases |
1990 |
SIGMOD |
9.8029112e-05 |
| 2,524 |
Provenance Management in Curated Databases |
2006 |
SIGMOD |
8.6017899e-05 |
| 3,110 |
Learning to Create Data-Integrating Queries |
2008 |
VLDB |
7.5475982e-05 |
| 5,244 |
Cooperative Update Exchange in the Youtopia System |
2009 |
VLDB |
5.6069303e-05 |
| 5,270 |
Annotated XML: Queries and Provenance |
2008 |
PODS |
5.5963545e-05 |
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 31 |
Provenance Semirings |
2007 |
PODS |
0.0007857786 |
| 11,471 |
On Optimizing the Trade-off between Privacy and Utility in Data Provenance |
2021 |
SIGMOD |
4.1945683e-05 |
| 2,892 |
Data Provenance at Internet Scale: Architecture, Experiences, and the Road Ahead |
2017 |
CIDR |
7.9480559e-05 |
| 6,662 |
Selective Provenance for Datalog Programs Using Top-K Queries |
2015 |
VLDB |
4.9704872e-05 |
| 8,960 |
Computing How-Provenance for SPARQL Queries via Query Rewriting |
2021 |
VLDB |
4.4206222e-05 |
| 4,851 |
Provenance for Natural Language Queries |
2017 |
VLDB |
5.8768322e-05 |
| 9,179 |
Equivalence-Invariant Algebraic Provenance for Hyperplane Update Queries |
2020 |
SIGMOD |
4.3820222e-05 |
| 652 |
On the Provenance of Non-Answers to Queries over Extracted Data |
2008 |
VLDB |
0.00018634477 |
| 1,106 |
Provenance for Aggregate Queries |
2011 |
PODS |
0.0001398766 |
| 6,186 |
On Provenance Minimization |
2011 |
PODS |
5.166082e-05 |