Back to papers
StreamOps: Cloud-Native Runtime Management for Streaming Services in ByteDance
Summary: StreamOps: standalone cloud-native control plane for scalable management of tens of thousands of streaming jobs at ByteDance. Exposes an extensible detect–diagnose–resolve policy API (auto-scaler, straggler detector, job doctor) to mitigate lag and failures; validated in production.
(summarized by gpt-5-mini on Feb 09 2026)
- Paper ID
- 13182
- Venue
- VLDB
- Year
- 2023
- Pagerank
- 5.5838392e-05
- Overall Rank
- 5,286 | 63.23%
- DOI
-
10.14778/3611540.3611543
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 3 of 3 citing papers.
Outgoing Citations (Sorted by Pagerank)
Showing 14 of 14 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 70 |
Hive - A Warehousing Solution Over a Map-Reduce Framework |
2009 |
VLDB |
0.00059533166 |
| 191 |
The Design of the Borealis Stream Processing Engine |
2005 |
CIDR |
0.00035738595 |
| 314 |
MillWheel: Fault-Tolerant Stream Processing at Internet Scale |
2013 |
VLDB |
0.00028084774 |
| 538 |
The Dataflow Model: A Practical Approach to Balancing Correctness, Latency, and Cost in Massive-Scale, Unbounded, Out-of-Order Data Processing |
2015 |
VLDB |
0.00020678804 |
| 1,084 |
Dhalion: Self-Regulating Stream Processing in Heron |
2017 |
VLDB |
0.00014209714 |
| 1,226 |
Integrating Scale Out and Fault Tolerance in Stream Processing using Operator State Management |
2013 |
SIGMOD |
0.00013180799 |
| 1,613 |
Realtime Data Processing at Facebook |
2016 |
SIGMOD |
0.00011140777 |
| 3,550 |
Chi: A Scalable and Programmable Control Plane for Distributed Stream Processing Systems |
2018 |
VLDB |
6.9843512e-05 |
| 4,044 |
Megaphone: Latency-conscious state migration for distributed streaming dataflows |
2019 |
VLDB |
6.4995312e-05 |
| 4,795 |
Rhino: Efficient Management of Very Large Distributed State for Stream Processing Engines |
2020 |
SIGMOD |
5.9158043e-05 |
| 4,822 |
Consistency and Completeness: Rethinking Distributed Stream Processing in Apache Kafka |
2021 |
SIGMOD |
5.8959131e-05 |
| 5,657 |
BriskStream: Scaling Data Stream Processing on Shared-Memory Multicore Architectures |
2019 |
SIGMOD |
5.3864606e-05 |
| 5,939 |
Clonos: Consistent Causal Recovery for Highly-Available Streaming Dataflows |
2021 |
SIGMOD |
5.2641681e-05 |
| 9,381 |
MorphStream: Adaptive Scheduling for Scalable Transactional Stream Processing on Multicores |
2023 |
SIGMOD |
4.3459591e-05 |
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 9,217 |
Elasticutor: Rapid Elasticity for Realtime Stateful Stream Processing |
2019 |
SIGMOD |
4.3712054e-05 |
| 7,492 |
Krypton: Real-time Serving and Analytical SQL Engine at ByteDance |
2023 |
VLDB |
4.7180617e-05 |
| 9,733 |
ContTune: Continuous Tuning by Conservative Bayesian Optimization for Distributed Stream Data Processing Systems |
2023 |
VLDB |
4.2942813e-05 |
| 11,078 |
ResLake: Towards Minimum Job Latency and Balanced Resource Utilization in Geo-distributed Job Scheduling |
2024 |
VLDB |
4.1945683e-05 |
| 9,155 |
Towards Resource Efficiency: Practical Insights into Large-Scale Spark Workloads at ByteDance |
2024 |
VLDB |
4.3849295e-05 |
| 4,167 |
Scalable Distributed Stream Join Processing |
2015 |
SIGMOD |
6.3919506e-05 |
| 1,226 |
Integrating Scale Out and Fault Tolerance in Stream Processing using Operator State Management |
2013 |
SIGMOD |
0.00013180799 |
| 10,410 |
Oceanus: Enable SLO-Aware Vertical Autoscaling for Cloud-Native Streaming Services in Tencent |
2025 |
SIGMOD |
4.1945683e-05 |
| 3,550 |
Chi: A Scalable and Programmable Control Plane for Distributed Stream Processing Systems |
2018 |
VLDB |
6.9843512e-05 |
| 12,137 |
Building User-defined Runtime Adaptation Routines for Stream Processing Applications |
2012 |
VLDB |
4.1945683e-05 |