Database Paper Browser

Back to papers

Storm @Twitter

Summary: Describes Storm, a real-time, fault-tolerant distributed stream processing system deployed at Twitter at scale. Covers architecture, topology execution, scale-out, and fault tolerance, with empirical resilience data and production-time lessons. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
4858
Venue
SIGMOD
Year
2014
Pagerank
0.00028939871
Overall Rank
288 | 98.00%
DOI
10.1145/2588555.2595641

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 50 of 73 citing papers.

Rank Citing Paper Year Venue Pagerank
314 MillWheel: Fault-Tolerant Stream Processing at Internet Scale 2013 VLDB 0.00028084774
824 Twitter Heron: Stream Processing at Scale 2015 SIGMOD 0.0001623129
1,084 Dhalion: Self-Regulating Stream Processing in Heron 2017 VLDB 0.00014209714
1,613 Realtime Data Processing at Facebook 2016 SIGMOD 0.00011140777
1,794 Summingbird: A Framework for Integrating Batch and Online MapReduce Computations 2014 VLDB 0.00010532024
1,953 Distributed Evaluation of Subgraph Queries Using Worst-case Optimal Low-Memory Dataflows 2018 VLDB 9.9665955e-05
2,264 S-Store: Streaming Meets Transaction Processing 2015 VLDB 9.1575142e-05
2,338 Samza: Stateful Scalable Stream Processing at LinkedIn 2017 VLDB 9.00711e-05
2,819 Mison: A Fast JSON Parser for Data Analytics 2017 VLDB 8.0651326e-05
2,826 Regular Path Query Evaluation on Streaming Graphs 2020 SIGMOD 8.056119e-05
3,333 SnappyData: A Unified Cluster for Streaming, Transactions, and Interactive Analytics 2017 CIDR 7.2093479e-05
3,378 General Incremental Sliding-Window Aggregation 2015 VLDB 7.1622572e-05
3,550 Chi: A Scalable and Programmable Control Plane for Distributed Stream Processing Systems 2018 VLDB 6.9843512e-05
3,569 S-Store: A Streaming NewSQL System for Big Velocity Applications 2014 VLDB 6.9608969e-05
3,762 SABER: Window-Based Hybrid Stream Processing for Heterogeneous Architectures 2016 SIGMOD 6.7804471e-05
4,120 Husky: Towards a More Efficient and Expressive Distributed Computing Framework 2016 VLDB 6.4364588e-05
4,488 Analyzing Efficient Stream Processing on Modern Hardware 2019 VLDB 6.145117e-05
4,577 Accelerating Dynamic Graph Analytics on GPUs 2018 VLDB 6.0709631e-05
4,885 GraphJet: Real-Time Content Recommendations at Twitter 2016 VLDB 5.8534354e-05
5,193 LightSaber: Efficient Window Aggregation on Multi-core Processors 2020 SIGMOD 5.6371049e-05
5,211 Tornado: A System For Real-Time Iterative Analysis Over Evolving Data 2016 SIGMOD 5.6284829e-05
5,262 SnappyData: A Hybrid Transactional Analytical Store Built On Spark 2016 SIGMOD 5.5977349e-05
5,939 Clonos: Consistent Causal Recovery for Highly-Available Streaming Dataflows 2021 SIGMOD 5.2641681e-05
6,242 Helios: Hyperscale Indexing for the Cloud & Edge 2020 VLDB 5.1408379e-05
6,476 Parallel Index-based Stream Join on a Multicore CPU 2020 SIGMOD 5.0496617e-05
6,629 A Holistic View of Stream Partitioning Costs 2017 VLDB 4.9880986e-05
6,648 Grizzly: Efficient Stream Processing Through Adaptive Query Compilation 2020 SIGMOD 4.9771723e-05
6,759 AStream: Ad-hoc Shared Stream Processing 2019 SIGMOD 4.9352213e-05
6,856 Liquid: Unifying Nearline and Offline Big Data Integration 2015 CIDR 4.9060615e-05
6,871 Towards General and Efficient Online Tuning for Spark 2023 VLDB 4.8997004e-05
7,373 Hazelcast Jet: Low-latency Stream Processing at the 99.99th Percentile 2021 VLDB 4.7494183e-05
7,627 Incremental Sliding Window Connectivity over Streaming Graphs 2024 VLDB 4.6928167e-05
8,001 Rethinking Stateful Stream Processing with RDMA 2022 SIGMOD 4.6092573e-05
8,018 Parallelizing Intra-Window Join on Multicores: An Experimental Study 2021 SIGMOD 4.6046381e-05
8,075 AJoin: Ad-hoc Stream Joins at Scale 2020 VLDB 4.5917655e-05
8,217 Spur: Mitigating Slow Instances in Large-Scale Streaming Pipelines 2020 SIGMOD 4.5568298e-05
8,713 Stateful Entities: Object-oriented Cloud Applications as Distributed Dataflows 2023 CIDR 4.4625215e-05
8,746 Texera: A System for Collaborative and Interactive Data Analytics Using Workflows 2024 VLDB 4.456315e-05
8,909 What's the Difference? Incremental Processing with Change Queries in Snowflake 2023 SIGMOD 4.427232e-05
8,922 Enabling Signal Processing over Data Streams 2017 SIGMOD 4.427232e-05
9,217 Elasticutor: Rapid Elasticity for Realtime Stateful Stream Processing 2019 SIGMOD 4.3712054e-05
9,318 Disaggregated State Management in Apache FlinkĀ® 2.0 2025 VLDB 4.3556432e-05
9,381 MorphStream: Adaptive Scheduling for Scalable Transactional Stream Processing on Multicores 2023 SIGMOD 4.3459591e-05
9,496 Scabbard: Single-Node Fault-Tolerant Stream Processing 2022 VLDB 4.3341665e-05
9,501 Dhalion in Action: Automatic Management of Streaming Applications 2018 VLDB 4.3341665e-05
9,504 Supporting Scalable Analytics with Latency Constraints 2015 VLDB 4.3341665e-05
9,604 GeaFlow: A Graph Extended and Accelerated Dataflow System 2023 SIGMOD 4.3177432e-05
9,733 ContTune: Continuous Tuning by Conservative Bayesian Optimization for Distributed Stream Data Processing Systems 2023 VLDB 4.2942813e-05
9,797 Dalton: Learned Partitioning for Distributed Data Streams 2023 VLDB 4.2818172e-05
9,803 Railgun: managing large streaming windows under MAD requirements 2021 VLDB 4.2807806e-05
Previous Page 1 / 2 Next

Outgoing Citations (Sorted by Pagerank)

Showing 7 of 7 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers