Iterative Parallel Data Processing with Stratosphere: An Inside Look
Summary: Iterative analytics on a shared-nothing data-flow engine (Stratosphere) with incremental state updates. Demonstrates end-to-end support: code, optimization plans, runtime monitoring, and a lightweight Pregel API for graph/ML workloads. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Authors
- 1. Stephan Ewen
- 2. Sebastian Schelter
- 3. Kostas Tzoumas
- 4. Daniel Warneke
- 5. Volker Markl
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 4 of 4 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4 | Pregel: A System for Large-Scale Graph Processing | 2010 | SIGMOD | 0.0019005923 |
| 1,355 | SQL/MapReduce: A practical approach to self-describing, polymorphic, and parallelizable user-defined functions | 2009 | VLDB | 0.00012404572 |
| 2,172 | Spinning Fast Iterative Data Flows | 2012 | VLDB | 9.3706587e-05 |
| 2,611 | Opening the Black Boxes in Data Flow Optimization | 2012 | VLDB | 8.4536967e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 7,813 | GraphScope: A One-Stop Large Graph Processing System | 2021 | VLDB | 4.6441779e-05 |
| 11,188 | ST4ML: Machine Learning Oriented Spatio-Temporal Data Processing at Scale | 2023 | SIGMOD | 4.1945683e-05 |
| 13,330 | Optimizing Data-Intensive Applications Automatically By Leveraging Parallel Data Processing Frameworks | 2017 | SIGMOD | - |
| 7,882 | Massively Parallel Data Analysis with PACTs on Nephele | 2010 | VLDB | 4.6285796e-05 |
| 8,534 | Translation of Array-Based Loops to Distributed Data-Parallel Programs | 2020 | VLDB | 4.4937074e-05 |
| 9,001 | The Power of Nested Parallelism in Big Data Processing – Hitting Three Flies with One Slap – | 2021 | SIGMOD | 4.4107627e-05 |
| 522 | Differential dataflow | 2013 | CIDR | 0.00021099241 |
| 1,685 | Fast Iterative Graph Computation with Block Updates | 2013 | VLDB | 0.0001091808 |
| 5,209 | Explaining Outputs in Modern Data Analytics | 2016 | VLDB | 5.629362e-05 |
| 2,172 | Spinning Fast Iterative Data Flows | 2012 | VLDB | 9.3706587e-05 |