Database Paper Browser

Back to papers

Provenance for Generalized Map and Reduce Workflows

Summary: Proposes a formal provenance model for generalized map-and-reduce workflows (acyclic DAGs of map/reduce), enabling recursive composition and both backward and forward tracing between inputs and outputs. Implements transparent, wrapper-based capture in Hadoop that preserves parallelism and fault-tolerance and reports prototype performance. (summarized by gpt-5-mini on Feb 09 2026)

Paper ID
170
Venue
CIDR
Year
2011
Pagerank
0.00011961469
Overall Rank
1,440 | 89.99%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 18 of 18 citing papers.

Rank Citing Paper Year Venue Pagerank
1,308 Upper and Lower Bounds on the Cost of a Map-Reduce Computation 2013 VLDB 0.00012661651
2,027 Titian: Data Provenance Support in Spark 2016 VLDB 9.7437067e-05
2,028 Putting Lipstick on Pig: Enabling Database-style Workflow Provenance 2012 VLDB 9.7433981e-05
2,280 SMOKE: Fine-grained Lineage at Interactive Speed 2018 VLDB 9.1111033e-05
3,149 Fine-Grained, Secure and Efficient Data Provenance on Blockchain Systems 2019 VLDB 7.4741595e-05
3,700 RAMP: A System for Capturing and Tracing Provenance in MapReduce Workflows 2011 VLDB 6.8307955e-05
4,774 LIMA: Fine-grained Lineage Tracing and Reuse in Machine Learning Systems 2021 SIGMOD 5.9316087e-05
5,209 Explaining Outputs in Modern Data Analytics 2016 VLDB 5.629362e-05
5,364 A Quest for Beauty and Wealth (or, Business Processes for Database Researchers) 2011 PODS 5.5461492e-05
6,981 Dataset Relationship Management 2019 CIDR 4.8743957e-05
7,678 To Not Miss the Forest for the Trees - A Holistic Approach for Explaining Missing Answers over Nested Data 2021 SIGMOD 4.6813062e-05
7,857 Fixed It For You: Protocol Repair Using Lineage Graphs 2019 CIDR 4.6345517e-05
8,394 Hypothetical Reasoning via Provenance Abstraction 2019 SIGMOD 4.527807e-05
8,504 Distributed Time-aware Provenance 2013 VLDB 4.496125e-05
9,059 Tracking Personal Data Use: Provenance And Trust 2015 CIDR 4.4039656e-05
11,662 Capturing and Querying Structural Provenance in Spark with Pebble 2019 SIGMOD 4.1945683e-05
11,710 Demonstration of Smoke: A Deep Breath of Data-Intensive Lineage Applications 2018 SIGMOD 4.1945683e-05
11,798 Privacy-Preserving Network Provenance 2017 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 8 of 8 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers