On Provenance Minimization
Summary: Defines the "core provenance"—the component of N[X]-provenance present in every query equivalent to a given query—and proves it is compact and captures the inherent computational structure. Provides algorithms to rewrite queries to realize the core and to compute tuple-level core provenance directly from arbitrary evaluations without rewriting. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Yael Amsterdamer
- 2. Daniel Deutch
- 3. Tova Milo
- 4. Val Tannen
Incoming Citations (Sorted by Pagerank)
Showing 4 of 4 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,764 | The Semiring Framework for Database Provenance | 2017 | PODS | 8.1574444e-05 |
| 6,084 | Distributed Provenance Compression | 2017 | SIGMOD | 5.2196728e-05 |
| 8,508 | Minimally Factorizing the Provenance of Self-join Free Conjunctive Queries | 2024 | PODS | 4.4952414e-05 |
| 9,921 | ProvCite: Provenance-based Data Citation | 2019 | VLDB | 4.2549509e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 11 of 11 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 31 | Provenance Semirings | 2007 | PODS | 0.0007857786 |
| 494 | Data Exchange: Getting to the Core | 2003 | PODS | 0.00021805832 |
| 561 | An Annotation Management System for Relational Databases | 2004 | VLDB | 0.00020115419 |
| 1,119 | The Complexity of Causality and Responsibility for Query Answers and non-Answers | 2011 | VLDB | 0.0001386199 |
| 1,490 | On the Decidability of Query Containment under Constraints | 1998 | PODS | 0.00011699154 |
| 1,861 | Efficient Provenance Storage | 2008 | SIGMOD | 0.00010287053 |
| 1,866 | Update Exchange with Mappings and Provenance | 2007 | VLDB | 0.00010272139 |
| 2,479 | Efficient Query Reformulation in Peer Data Management Systems | 2004 | SIGMOD | 8.6909119e-05 |
| 3,584 | Efficient Querying and Maintenance of Network Provenance at Internet-Scale | 2010 | SIGMOD | 6.9460423e-05 |
| 3,937 | On Reconciling Data Exchange, Data Integration, and Peer Data Management | 2007 | PODS | 6.6159574e-05 |
| 5,195 | Equivalence of Queries Combining Set and Bag-Set Semantics | 2006 | PODS | 5.6366303e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 8,729 | OneProvenance: Efficient Extraction of Dynamic Coarse-Grained Provenance From Database Query Event Logs | 2023 | VLDB | 4.4582221e-05 |
| 10,910 | Postulates for Provenance: Instance-based provenance for first-order logic | 2024 | PODS | 4.1945683e-05 |
| 652 | On the Provenance of Non-Answers to Queries over Extracted Data | 2008 | VLDB | 0.00018634477 |
| 8,960 | Computing How-Provenance for SPARQL Queries via Query Rewriting | 2021 | VLDB | 4.4206222e-05 |
| 655 | On Propagation of Deletions and Annotations Through Views | 2002 | PODS | 0.00018608845 |
| 8,394 | Hypothetical Reasoning via Provenance Abstraction | 2019 | SIGMOD | 4.527807e-05 |
| 9,179 | Equivalence-Invariant Algebraic Provenance for Hyperplane Update Queries | 2020 | SIGMOD | 4.3820222e-05 |
| 11,471 | On Optimizing the Trade-off between Privacy and Utility in Data Provenance | 2021 | SIGMOD | 4.1945683e-05 |
| 1,106 | Provenance for Aggregate Queries | 2011 | PODS | 0.0001398766 |
| 2,173 | Querying Data Provenance | 2010 | SIGMOD | 9.3676609e-05 |