Compact, Tamper-Resistant Archival of Fine-Grained Provenance

Summary: Tamper-resistant archival of fine-grained provenance in relational storage. Proposes compression of evolving provenance by exploiting repetition, secured by cryptographic hashes, with empirical evaluation on scientific and OLAP workloads. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID: 12569
Venue: VLDB
Year: 2021
Pagerank: 4.3701044e-05
Overall Rank: 9,204 | 36.04%
DOI: 10.14778/3436905.3436909

Incoming Non-self Citations Over Time

Authors

1. Nan Zheng
2. Zachary G. Ives

Incoming Citations (Sorted by Pagerank)

Showing 2 of 2 citing papers.

Rank	Citing Paper	Year	Venue	Pagerank
9,043	Query-Guided Resolution in Uncertain Databases	2023	SIGMOD	4.3997447e-05
10,429	Unified Lineage System: Tracking Data Provenance at Scale	2025	SIGMOD	4.1905499e-05

Outgoing Citations (Sorted by Pagerank)

Showing 13 of 13 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank	Cited Paper	Year	Venue	Pagerank
31	Provenance Semirings	2007	PODS	0.00078516827
132	Integrating Compression and Execution in Column-Oriented Database Systems	2006	SIGMOD	0.00043697853
688	Debugging Schema Mappings with Routes	2006	VLDB	0.00018095639
867	Query Optimization in the Presence of Limited Access Patterns	1999	SIGMOD	0.00015757737
1,106	Provenance for Aggregate Queries	2011	PODS	0.00013976386
1,864	Efficient Provenance Storage	2008	SIGMOD	0.00010277171
1,868	Update Exchange with Mappings and Provenance	2007	VLDB	0.0001026365
2,031	Putting Lipstick on Pig: Enabling Database-style Workflow Provenance	2012	VLDB	9.7341007e-05
2,182	Querying Data Provenance	2010	SIGMOD	9.3596252e-05
2,286	SMOKE: Fine-grained Lineage at Interactive Speed	2018	VLDB	9.102574e-05
3,158	Fine-Grained, Secure and Efficient Data Provenance on Blockchain Systems	2019	VLDB	7.466958e-05
4,462	Magellan: Toward Building Entity Matching Management Systems over Data Science Stacks	2016	VLDB	6.1566477e-05
6,088	Distributed Provenance Compression	2017	SIGMOD	5.2146574e-05

Semantically Similar Papers

Overall Rank	Paper	Year	Venue	Pagerank
8,166	Capturing and Querying Fine-grained Provenance of Preprocessing Pipelines in Data Science	2021	VLDB	4.567959e-05
8,392	Hypothetical Reasoning via Provenance Abstraction	2019	SIGMOD	4.5234647e-05
11,474	On Optimizing the Trade-off between Privacy and Utility in Data Provenance	2021	SIGMOD	4.1905499e-05
11,670	Ursprung: Provenance for Large-Scale Analytics Environments	2019	SIGMOD	4.1905499e-05
8,725	OneProvenance: Efficient Extraction of Dynamic Coarse-Grained Provenance From Database Query Event Logs	2023	VLDB	4.453957e-05
3,158	Fine-Grained, Secure and Efficient Data Provenance on Blockchain Systems	2019	VLDB	7.466958e-05
1,864	Efficient Provenance Storage	2008	SIGMOD	0.00010277171
1,763	Efficient Lineage Tracking For Scientific Workflows	2008	SIGMOD	0.00010626896
6,187	On Provenance Minimization	2011	PODS	5.1611195e-05
2,182	Querying Data Provenance	2010	SIGMOD	9.3596252e-05