Database Paper Browser

Back to papers

Twitter Heron: Stream Processing at Scale

Summary: Introduces Heron, a real-time stream engine to replace Storm at Twitter, addressing scale, debuggability, performance, and manageability. Design insights and empirical evidence of efficiency and scalability as Twitter’s de facto streaming backend. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
5013
Venue
SIGMOD
Year
2015
Pagerank
0.0001623129
Overall Rank
824 | 94.27%
DOI
10.1145/2723372.2723374

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 45 of 45 citing papers.

Rank Citing Paper Year Venue Pagerank
1,084 Dhalion: Self-Regulating Stream Processing in Heron 2017 VLDB 0.00014209714
1,613 Realtime Data Processing at Facebook 2016 SIGMOD 0.00011140777
2,338 Samza: Stateful Scalable Stream Processing at LinkedIn 2017 VLDB 9.00711e-05
2,826 Regular Path Query Evaluation on Streaming Graphs 2020 SIGMOD 8.056119e-05
3,210 Frontier: Resilient Edge Processing for the Internet of Things 2018 VLDB 7.3746627e-05
3,386 Lethe: A Tunable Delete-Aware LSM Engine 2020 SIGMOD 7.1577103e-05
3,550 Chi: A Scalable and Programmable Control Plane for Distributed Stream Processing Systems 2018 VLDB 6.9843512e-05
3,659 Autoscaling Tiered Cloud Storage in Anna 2019 VLDB 6.8696023e-05
3,704 How to Win a Hot Dog Eating Contest: Distributed Incremental View Maintenance with Batch Updates 2016 SIGMOD 6.827494e-05
4,390 LogStore: A Cloud-Native and Multi-Tenant Log Database 2021 SIGMOD 6.2279149e-05
5,263 Consistent Regions: Guaranteed Tuple Processing in IBM Streams 2016 VLDB 5.5976361e-05
5,657 BriskStream: Scaling Data Stream Processing on Shared-Memory Multicore Architectures 2019 SIGMOD 5.3864606e-05
5,732 TcpRT: Instrument and Diagnostic Analysis System for Service Quality of Cloud Databases at Massive Scale in Real-time 2018 SIGMOD 5.3501728e-05
5,939 Clonos: Consistent Causal Recovery for Highly-Available Streaming Dataflows 2021 SIGMOD 5.2641681e-05
6,123 Data Ingestion for the Connected World 2017 CIDR 5.1991194e-05
6,629 A Holistic View of Stream Partitioning Costs 2017 VLDB 4.9880986e-05
6,871 Towards General and Efficient Online Tuning for Spark 2023 VLDB 4.8997004e-05
7,373 Hazelcast Jet: Low-latency Stream Processing at the 99.99th Percentile 2021 VLDB 4.7494183e-05
7,627 Incremental Sliding Window Connectivity over Streaming Graphs 2024 VLDB 4.6928167e-05
7,747 TSCache: An Efficient Flash-based Caching Scheme for Time-series Data Workloads 2021 VLDB 4.6616405e-05
7,866 Operational Analytics Data Management Systems 2016 VLDB 4.6321795e-05
7,998 Data Management for Social Networking 2016 PODS 4.6101889e-05
8,217 Spur: Mitigating Slow Instances in Large-Scale Streaming Pipelines 2020 SIGMOD 4.5568298e-05
8,922 Enabling Signal Processing over Data Streams 2017 SIGMOD 4.427232e-05
9,187 POLAR: Adaptive and Non-invasive Join Order Selection via Plans of Least Resistance 2024 VLDB 4.3780059e-05
9,217 Elasticutor: Rapid Elasticity for Realtime Stateful Stream Processing 2019 SIGMOD 4.3712054e-05
9,496 Scabbard: Single-Node Fault-Tolerant Stream Processing 2022 VLDB 4.3341665e-05
9,501 Dhalion in Action: Automatic Management of Streaming Applications 2018 VLDB 4.3341665e-05
9,733 ContTune: Continuous Tuning by Conservative Bayesian Optimization for Distributed Stream Data Processing Systems 2023 VLDB 4.2942813e-05
9,803 Railgun: managing large streaming windows under MAD requirements 2021 VLDB 4.2807806e-05
10,043 Accelerating Stream Processing Engines via Hardware Offloading 2026 SIGMOD 4.1945683e-05
10,280 Meerkat: Scalable, Network-Aware Failure Recovery for the Internet of Things 2026 VLDB 4.1945683e-05
10,962 Fault Tolerance Placement in the Internet of Things 2024 SIGMOD 4.1945683e-05
11,435 Synchronization Schemas 2021 PODS 4.1945683e-05
11,468 Klink: Progress-Aware Scheduling for Streaming Data Systems 2021 SIGMOD 4.1945683e-05
11,485 Real-time Data Infrastructure at Uber 2021 SIGMOD 4.1945683e-05
11,625 InvaliDB: Scalable Push-Based Real-Time Queries on Top of Pull-Based Databases (Extended) 2020 VLDB 4.1945683e-05
11,673 Online Template Induction for Machine-Generated Emails 2019 VLDB 4.1945683e-05
11,709 Robust, Scalable, Real-Time Event Time Series Aggregation at Twitter 2018 SIGMOD 4.1945683e-05
11,728 Challenges and Experiences in Building an Efficient Apache Beam Runner For IBM Streams 2018 VLDB 4.1945683e-05
11,774 Query Processing Techniques for Big Spatial-Keyword Data 2017 SIGMOD 4.1945683e-05
11,805 CarStream: An Industrial System of Big Data Processing for Internet-of-Vehicles 2017 VLDB 4.1945683e-05
11,807 Upsortable: Programming Top-K Queries Over Data Streams 2017 VLDB 4.1945683e-05
11,819 Toward High-Performance Distributed Stream Processing via Approximate Fault Tolerance 2017 VLDB 4.1945683e-05
11,849 The Challenges of Global-scale Data Management 2016 SIGMOD 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 6 of 6 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
288 Storm @Twitter 2014 SIGMOD 0.00028939871
314 MillWheel: Fault-Tolerant Stream Processing at Internet Scale 2013 VLDB 0.00028084774
1,222 Querying and Mining Data Streams: You Only Get One Look 2002 SIGMOD 0.00013213129
1,286 Photon: Fault-tolerant and Scalable Joining of Continuous Data Streams 2013 SIGMOD 0.0001282373
1,794 Summingbird: A Framework for Integrating Batch and Online MapReduce Computations 2014 VLDB 0.00010532024
2,597 Continuous Queries in Oracle 2007 VLDB 8.4713998e-05
Previous Page 1 / 1 Next

Semantically Similar Papers