Back to papers
Accelerating Stream Processing Engines via Hardware Offloading
Summary: FlexStream offloads re-partitioning to hardware and adopts a coupled network-executor model to saturate NIC bandwidth and rethink SPE parallelization. With a lock-free state backend and fast migration it achieves 1.95–3.35× throughput and much lower latency spikes and migration time versus prior SPEs.
(summarized by gpt-5-mini on Feb 11 2026)
- Paper ID
- 7350
- Venue
- SIGMOD
- Year
- 2026
- Pagerank
- 4.1945683e-05
- Overall Rank
- 10,043 | 30.14%
- DOI
-
10.1145/3769754
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
Outgoing Citations (Sorted by Pagerank)
Showing 15 of 15 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 288 |
Storm @Twitter |
2014 |
SIGMOD |
0.00028939871 |
| 418 |
Morsel-Driven Parallelism: A NUMA-Aware Query Evaluation Framework for the Many-Core Age |
2014 |
SIGMOD |
0.00023729211 |
| 824 |
Twitter Heron: Stream Processing at Scale |
2015 |
SIGMOD |
0.0001623129 |
| 1,084 |
Dhalion: Self-Regulating Stream Processing in Heron |
2017 |
VLDB |
0.00014209714 |
| 1,226 |
Integrating Scale Out and Fault Tolerance in Stream Processing using Operator State Management |
2013 |
SIGMOD |
0.00013180799 |
| 4,044 |
Megaphone: Latency-conscious state migration for distributed streaming dataflows |
2019 |
VLDB |
6.4995312e-05 |
| 4,488 |
Analyzing Efficient Stream Processing on Modern Hardware |
2019 |
VLDB |
6.145117e-05 |
| 4,795 |
Rhino: Efficient Management of Very Large Distributed State for Stream Processing Engines |
2020 |
SIGMOD |
5.9158043e-05 |
| 5,193 |
LightSaber: Efficient Window Aggregation on Multi-core Processors |
2020 |
SIGMOD |
5.6371049e-05 |
| 5,657 |
BriskStream: Scaling Data Stream Processing on Shared-Memory Multicore Architectures |
2019 |
SIGMOD |
5.3864606e-05 |
| 6,648 |
Grizzly: Efficient Stream Processing Through Adaptive Query Compilation |
2020 |
SIGMOD |
4.9771723e-05 |
| 6,767 |
Watermarks in Stream Processing Systems: Semantics and Comparative Analysis of Apache Flink and Google Cloud Dataflow |
2021 |
VLDB |
4.9322174e-05 |
| 8,001 |
Rethinking Stateful Stream Processing with RDMA |
2022 |
SIGMOD |
4.6092573e-05 |
| 9,217 |
Elasticutor: Rapid Elasticity for Realtime Stateful Stream Processing |
2019 |
SIGMOD |
4.3712054e-05 |
| 9,496 |
Scabbard: Single-Node Fault-Tolerant Stream Processing |
2022 |
VLDB |
4.3341665e-05 |
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 10,967 |
Low-Latency Adaptive Distributed Stream Join System Based on a Flexible Join Model |
2024 |
SIGMOD |
4.1945683e-05 |
| 5,193 |
LightSaber: Efficient Window Aggregation on Multi-core Processors |
2020 |
SIGMOD |
5.6371049e-05 |
| 7,736 |
From a Stream of Relational Queries to Distributed Stream Processing |
2010 |
VLDB |
4.664248e-05 |
| 9,313 |
Providing Resiliency to Load Variations in Distributed Stream Processing |
2006 |
VLDB |
4.3565355e-05 |
| 7,930 |
Demonstrating PDSP-Bench: A Benchmarking System for Parallel and Distributed Stream Processing |
2025 |
SIGMOD |
4.613363e-05 |
| 8,001 |
Rethinking Stateful Stream Processing with RDMA |
2022 |
SIGMOD |
4.6092573e-05 |
| 2,706 |
Design, Implementation, and Evaluation of the Linear Road Benchmark on the Stream Processing Core |
2006 |
SIGMOD |
8.2673299e-05 |
| 9,217 |
Elasticutor: Rapid Elasticity for Realtime Stateful Stream Processing |
2019 |
SIGMOD |
4.3712054e-05 |
| 6,629 |
A Holistic View of Stream Partitioning Costs |
2017 |
VLDB |
4.9880986e-05 |
| 4,488 |
Analyzing Efficient Stream Processing on Modern Hardware |
2019 |
VLDB |
6.145117e-05 |