RAMP: A System for Capturing and Tracing Provenance in MapReduce Workflows
Summary: RAMP extends Hadoop to automatically capture and trace provenance for MapReduce workflows via a wrapper-based approach, preserving parallelism and fault tolerance with minimal user intervention. Demonstrated on a Pig-script Twitter sentiment workflow, it enables drill-down provenance verification of outputs. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Hyunjung Park
- 2. Robert Ikeda
- 3. Jennifer Widom
Incoming Citations (Sorted by Pagerank)
Showing 7 of 7 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3,051 | Partial Results in Database Systems | 2014 | SIGMOD | 7.6512591e-05 |
| 3,149 | Fine-Grained, Secure and Efficient Data Provenance on Blockchain Systems | 2019 | VLDB | 7.4741595e-05 |
| 5,086 | Improving Reproducibility of Data Science Pipelines through Transparent Provenance Capture | 2020 | VLDB | 5.7078462e-05 |
| 8,394 | Hypothetical Reasoning via Provenance Abstraction | 2019 | SIGMOD | 4.527807e-05 |
| 8,504 | Distributed Time-aware Provenance | 2013 | VLDB | 4.496125e-05 |
| 11,619 | Demonstration of Interactive Runtime Debugging of Distributed Dataflows in Texera | 2020 | VLDB | 4.1945683e-05 |
| 11,647 | Ariadne: Online Provenance for Big Graph Analytics | 2019 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 2 of 2 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3 | Pig Latin: A Not-So-Foreign Language for Data Processing | 2008 | SIGMOD | 0.0024183614 |
| 1,440 | Provenance for Generalized Map and Reduce Workflows | 2011 | CIDR | 0.00011961469 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,677 | Automatically Leveraging MapReduce Frameworks for Data-Intensive Applications | 2018 | SIGMOD | 6.0047822e-05 |
| 12,125 | ReStore: Reusing Results of MapReduce Jobs in Pig | 2012 | SIGMOD | 4.1945683e-05 |
| 2,205 | ReStore: Reusing Results of MapReduce Jobs | 2012 | VLDB | 9.2920002e-05 |
| 2,476 | A Platform for Scalable One-Pass Analytics using MapReduce | 2011 | SIGMOD | 8.6960139e-05 |
| 2,028 | Putting Lipstick on Pig: Enabling Database-style Workflow Provenance | 2012 | VLDB | 9.7433981e-05 |
| 2,747 | Stubby: A Transformation-based Optimizer for MapReduce Workflows | 2012 | VLDB | 8.1828918e-05 |
| 13,487 | RAFT at Work: Speeding-Up MapReduce Applications under Task and Node Failures | 2011 | SIGMOD | - |
| 780 | Building a High-Level Dataflow System on top of Map-Reduce: The Pig Experience | 2009 | VLDB | 0.00016775082 |
| 6,943 | TRAMP: Understanding the Behavior of Schema Mappings through Provenance | 2010 | VLDB | 4.8916728e-05 |
| 1,440 | Provenance for Generalized Map and Reduce Workflows | 2011 | CIDR | 0.00011961469 |