An Algebraic Approach for Data-Centric Scientific Workflows
Summary: Algebraic approach inspired by relational algebra, plus a parallel execution model to automatically optimize data-centric scientific workflows. Validated on oil-exploitation and synthetic data with the Chiron engine, achieving up to 226% speedup vs ad-hoc implementations. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Eduardo Ogasawara
- 2. Jonas Dias
- 3. Daniel de Oliveira
- 4. Fábio Porto
- 5. Patrick Valduriez
- 6. Marta Mattoso
Incoming Citations (Sorted by Pagerank)
Showing 3 of 3 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,965 | SQLShare: Results from a Multi-Year SQL-as-a-Service Experiment | 2016 | SIGMOD | 7.8059273e-05 |
| 8,078 | Meta-Dataflows: Efficient Exploratory Dataflow Jobs | 2018 | SIGMOD | 4.5914967e-05 |
| 11,743 | DfAnalyzer: Runtime Dataflow Analysis of Scientific Applications using Provenance | 2018 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 3 of 3 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3 | Pig Latin: A Not-So-Foreign Language for Data Processing | 2008 | SIGMOD | 0.0024183614 |
| 2,417 | Dynamic Load Balancing in Hierarchical Parallel Database Systems | 1996 | VLDB | 8.8604775e-05 |
| 3,674 | An Approach to Optimize Data Processing in Business Processes | 2007 | VLDB | 6.8558403e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3,710 | Optimizing Analytic Data Flows for Multiple Execution Engines | 2012 | SIGMOD | 6.8238962e-05 |
| 2,172 | Spinning Fast Iterative Data Flows | 2012 | VLDB | 9.3706587e-05 |
| 4,051 | Workflow, Transactions and Datalog | 1999 | PODS | 6.4940502e-05 |
| 5,850 | Active and Accelerated Learning of Cost Models for Optimizing Scientific Applications | 2006 | VLDB | 5.3009887e-05 |
| 8,498 | Customizable Parallel Execution of Scientific Stream Queries | 2005 | VLDB | 4.4981372e-05 |
| 4,700 | Schedule Optimization for Data Processing Flows on the Cloud | 2011 | SIGMOD | 5.9882572e-05 |
| 7,362 | Algebraic Optimization of Computations over Scientific Databases | 1993 | VLDB | 4.752436e-05 |
| 3,674 | An Approach to Optimize Data Processing in Business Processes | 2007 | VLDB | 6.8558403e-05 |
| 3,196 | Algebraic Manipulation of Scientific Datasets | 2004 | VLDB | 7.4017366e-05 |
| 1,765 | Efficient Lineage Tracking For Scientific Workflows | 2008 | SIGMOD | 0.00010630348 |