Efficient Lineage Tracking For Scientific Workflows
Summary: Introduces an interval-based, space- and query-efficient representation for data lineage graphs from scientific workflows, avoiding recursive storage. Transforms any workflow processes into compact dependency graphs and offers analysis plus evaluation. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Thomas Heinis
- 2. Gustavo Alonso
Incoming Citations (Sorted by Pagerank)
Showing 11 of 11 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 697 | Human-Assisted Graph Search: It’s Okay to Ask Questions | 2011 | VLDB | 0.00018043655 |
| 1,440 | Provenance for Generalized Map and Reduce Workflows | 2011 | CIDR | 0.00011961469 |
| 2,027 | Titian: Data Provenance Support in Spark | 2016 | VLDB | 9.7437067e-05 |
| 5,802 | An Optimal Labeling Scheme for Workflow Provenance Using Skeleton Labels | 2010 | SIGMOD | 5.3209459e-05 |
| 6,525 | Database Technology for the Masses: Sub-Operators as First-Class Entities | 2021 | VLDB | 5.027205e-05 |
| 6,533 | Labeling Workflow Views with Fine-Grained Dependencies | 2012 | VLDB | 5.0245193e-05 |
| 7,370 | Detecting and Resolving Unsound Workflow Views for Correct Provenance Analysis | 2009 | SIGMOD | 4.7500735e-05 |
| 7,561 | Efficient Recovery of Missing Events | 2013 | VLDB | 4.7102455e-05 |
| 8,054 | Labeling Recursive Workflow Executions On-the-Fly | 2011 | SIGMOD | 4.5947587e-05 |
| 10,419 | Unified Lineage System: Tracking Data Provenance at Scale | 2025 | SIGMOD | 4.1945683e-05 |
| 12,282 | Searching Workflows with Hierarchical Views | 2010 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 6 of 6 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 299 | Trio: A System for Data, Uncertainty, and Lineage | 2006 | VLDB | 0.00028525071 |
| 561 | An Annotation Management System for Relational Databases | 2004 | VLDB | 0.00020115419 |
| 1,149 | A Comprehensive XQuery to SQL Translation using Dynamic Interval Encoding | 2003 | SIGMOD | 0.0001365931 |
| 2,604 | GridDB: A Data-Centric Overlay for Scientific Grids | 2004 | VLDB | 8.4647212e-05 |
| 5,203 | Efficient Exploration of Large Scientific Databases | 2002 | VLDB | 5.6316997e-05 |
| 12,620 | Scientific Data Repositories - Designing for a Moving Target | 2003 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 10,419 | Unified Lineage System: Tracking Data Provenance at Scale | 2025 | SIGMOD | 4.1945683e-05 |
| 7,132 | Enabling Privacy in Provenance-Aware Workflow Systems | 2011 | CIDR | 4.8227603e-05 |
| 3,149 | Fine-Grained, Secure and Efficient Data Provenance on Blockchain Systems | 2019 | VLDB | 7.4741595e-05 |
| 1,440 | Provenance for Generalized Map and Reduce Workflows | 2011 | CIDR | 0.00011961469 |
| 1,861 | Efficient Provenance Storage | 2008 | SIGMOD | 0.00010287053 |
| 611 | Lineage Tracing for General Data Warehouse Transformations | 2001 | VLDB | 0.00019231115 |
| 923 | Provenance and Scientific Workflows: Challenges and Opportunities | 2008 | SIGMOD | 0.0001527609 |
| 6,533 | Labeling Workflow Views with Fine-Grained Dependencies | 2012 | VLDB | 5.0245193e-05 |
| 5,843 | Tracing Lineage Beyond Relational Operators | 2007 | VLDB | 5.3032967e-05 |
| 5,802 | An Optimal Labeling Scheme for Workflow Provenance Using Skeleton Labels | 2010 | SIGMOD | 5.3209459e-05 |