Spinning Fast Iterative Data Flows
Summary: Integrates incremental (workset) iterations into parallel dataflows, enabling mutable state and exploitation of sparse dependencies in iterative algorithms. Prototype shows up to 100× speedups and competitive results versus specialized systems, while preserving a unified dataflow abstraction. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Stephan Ewen
- 2. Kostas Tzoumas
- 3. Volker Markl
- 4. Moritz Kaufmann
Incoming Citations (Sorted by Pagerank)
Showing 16 of 16 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 10 of 10 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1 | Access Path Selection in a Relational Database Management System | 1979 | SIGMOD | 0.0040449103 |
| 4 | Pregel: A System for Large-Scale Graph Processing | 2010 | SIGMOD | 0.0019005923 |
| 20 | GAMMA - A High Performance Dataflow Database Machine | 1986 | VLDB | 0.00086459551 |
| 22 | SCOPE: Easy and Efficient Parallel Processing of Massive Data Sets | 2008 | VLDB | 0.0008456613 |
| 77 | An Amateur's Introduction to Recursive Query Processing Strategies | 1986 | SIGMOD | 0.00057043861 |
| 232 | A Performance Evaluation of Four Parallel Join Algorithms in a Shared-Nothing Multiprocessor Environment | 1989 | SIGMOD | 0.00032122485 |
| 365 | On the Power of Magic | 1987 | PODS | 0.00025585898 |
| 413 | HaLoop: Efficient Iterative Data Processing on Large Clusters | 2010 | VLDB | 0.00023904409 |
| 520 | An Overview of The System Software of A Parallel Relational Database Machine GRACE | 1986 | VLDB | 0.00021152636 |
| 2,611 | Opening the Black Boxes in Data Flow Optimization | 2012 | VLDB | 8.4536967e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 538 | The Dataflow Model: A Practical Approach to Balancing Correctness, Latency, and Cost in Massive-Scale, Unbounded, Out-of-Order Data Processing | 2015 | VLDB | 0.00020678804 |
| 8,534 | Translation of Array-Based Loops to Distributed Data-Parallel Programs | 2020 | VLDB | 4.4937074e-05 |
| 9,547 | Optimistic Recovery for Iterative Dataflows in Action | 2015 | SIGMOD | 4.3259935e-05 |
| 9,504 | Supporting Scalable Analytics with Latency Constraints | 2015 | VLDB | 4.3341665e-05 |
| 3,710 | Optimizing Analytic Data Flows for Multiple Execution Engines | 2012 | SIGMOD | 6.8238962e-05 |
| 8,078 | Meta-Dataflows: Efficient Exploratory Dataflow Jobs | 2018 | SIGMOD | 4.5914967e-05 |
| 1,685 | Fast Iterative Graph Computation with Block Updates | 2013 | VLDB | 0.0001091808 |
| 522 | Differential dataflow | 2013 | CIDR | 0.00021099241 |
| 12,039 | Iterative Parallel Data Processing with Stratosphere: An Inside Look | 2013 | SIGMOD | 4.1945683e-05 |
| 5,209 | Explaining Outputs in Modern Data Analytics | 2016 | VLDB | 5.629362e-05 |