Summingbird: A Framework for Integrating Batch and Online MapReduce Computations
Summary: Summingbird is a Scala DSL that unifies batch and online MapReduce in a single framework, using dataflow abstractions (sources, sinks, stores) and executing on Hadoop (Scalding/Cascading) or Storm without code changes. Hybrid processing mode transparently fuses batch and online results via algebraic structures, imposing aggregations constraints that guide applicability while reducing duplication for Twitter-scale analytics. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Oscar Boykin
- 2. Sam Ritchie
- 3. Ian O'Connell
- 4. Jimmy Lin
Incoming Citations (Sorted by Pagerank)
Showing 14 of 14 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 12 of 12 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 22 | SCOPE: Easy and Efficient Parallel Processing of Massive Data Sets | 2008 | VLDB | 0.0008456613 |
| 8,108 | Execution Primitives for Scalable Joins and Aggregations in Map Reduce | 2014 | VLDB | 4.5846987e-05 |
| 1,326 | Starling: A Scalable Query Engine on Cloud Functions | 2020 | SIGMOD | 0.00012576952 |
| 2,736 | Online Aggregation and Continuous Query support in MapReduce | 2010 | SIGMOD | 8.2043187e-05 |
| 2,476 | A Platform for Scalable One-Pass Analytics using MapReduce | 2011 | SIGMOD | 8.6960139e-05 |
| 4,677 | Automatically Leveraging MapReduce Frameworks for Data-Intensive Applications | 2018 | SIGMOD | 6.0047822e-05 |
| 824 | Twitter Heron: Stream Processing at Scale | 2015 | SIGMOD | 0.0001623129 |
| 2,818 | Implicit Parallelism through Deep Language Embedding | 2015 | SIGMOD | 8.0665558e-05 |
| 11,502 | In the Land of Data Streams where Synopses are Missing, One Framework to Bring Them All | 2021 | VLDB | 4.1945683e-05 |
| 11,709 | Robust, Scalable, Real-Time Event Time Series Aggregation at Twitter | 2018 | SIGMOD | 4.1945683e-05 |