Integrating Scale Out and Fault Tolerance in Stream Processing using Operator State Management
Summary: Externalised operator state with explicit primitives enables dynamic scale-out and fault-tolerant recovery for stateful stream processing. Checkpoints are partitioned across new VMs for scale-out; recovery restores state and replays tuples, demonstrated on EC2 with up to 50 VMs and L=350. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 40 of 40 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 8 of 8 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 191 | The Design of the Borealis Stream Processing Engine | 2005 | CIDR | 0.00035738595 |
| 194 | Query Processing, Resource Management, and Approximation in a Data Stream Management System | 2003 | CIDR | 0.00035426067 |
| 586 | DBToaster: Higher-order Delta Processing for Dynamic, Frequently Fresh Views | 2012 | VLDB | 0.00019685374 |
| 600 | Linear Road: A Stream Data Management Benchmark | 2004 | VLDB | 0.0001938744 |
| 960 | A Comparison of Join Algorithms for Log Processing in MapReduce | 2010 | SIGMOD | 0.00015012242 |
| 2,706 | Design, Implementation, and Evaluation of the Linear Road Benchmark on the Stream Processing Core | 2006 | SIGMOD | 8.2673299e-05 |
| 5,045 | Massive Scale-out of Expensive Continuous Queries | 2011 | VLDB | 5.740793e-05 |
| 5,049 | Run-Time Operator State Spilling for Memory Intensive Long-Running Queries | 2006 | SIGMOD | 5.7372423e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,488 | Analyzing Efficient Stream Processing on Modern Hardware | 2019 | VLDB | 6.145117e-05 |
| 9,125 | On-Demand State Separation for Cloud Data Warehousing | 2022 | VLDB | 4.3917246e-05 |
| 1,357 | Highly Available, Fault-Tolerant, Parallel Dataflows | 2004 | SIGMOD | 0.00012392275 |
| 6,629 | A Holistic View of Stream Partitioning Costs | 2017 | VLDB | 4.9880986e-05 |
| 11,819 | Toward High-Performance Distributed Stream Processing via Approximate Fault Tolerance | 2017 | VLDB | 4.1945683e-05 |
| 9,313 | Providing Resiliency to Load Variations in Distributed Stream Processing | 2006 | VLDB | 4.3565355e-05 |
| 3,886 | Fault-tolerant Stream Processing using a Distributed, Replicated File System | 2008 | VLDB | 6.6661649e-05 |
| 10,862 | How Reliable Are Streams? End-to-End Processing-Guarantee Validation and Performance Benchmarking of Stream Processing Systems | 2025 | VLDB | 4.1945683e-05 |
| 5,045 | Massive Scale-out of Expensive Continuous Queries | 2011 | VLDB | 5.740793e-05 |
| 11,804 | State Management in Apache Flink | 2017 | VLDB | 4.1945683e-05 |