Structured Streaming: A Declarative API for Real-Time Applications in Apache Spark
Summary: Declarative Spark Structured Streaming; incrementalizes SQL/DataFrame queries, not user-built DAG. End-to-end real-time apps unifying streaming with batch analytics; code generation yields high performance; rollbacks and mixed execution in production. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Michael Armbrust
- 2. Tathagata Das
- 3. Joseph Torres
- 4. Burak Yavuz
- 5. Shixiong Zhu
- 6. Reynold Xin
- 7. Ali Ghodsi
- 8. Ion Stoica
- 9. Matei Zaharia
Incoming Citations (Sorted by Pagerank)
Showing 35 of 35 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 10 of 10 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 55 | Efficiently Updating Materialized Views | 1986 | SIGMOD | 0.00065762967 |
| 66 | Spark SQL: Relational Data Processing in Spark | 2015 | SIGMOD | 0.00061639801 |
| 191 | The Design of the Borealis Stream Processing Engine | 2005 | CIDR | 0.00035738595 |
| 401 | View Maintenance in a Warehousing Environment | 1995 | SIGMOD | 0.00024214488 |
| 522 | Differential dataflow | 2013 | CIDR | 0.00021099241 |
| 538 | The Dataflow Model: A Practical Approach to Balancing Correctness, Latency, and Cost in Massive-Scale, Unbounded, Out-of-Order Data Processing | 2015 | VLDB | 0.00020678804 |
| 591 | TelegraphCQ: Continuous Dataflow Processing | 2003 | SIGMOD | 0.00019569071 |
| 1,098 | Trill: A High-Performance Incremental Query Processor for Diverse Analytics | 2015 | VLDB | 0.00014114442 |
| 1,310 | Consistency Analysis in Bloom: a CALM and Collected Approach | 2011 | CIDR | 0.00012658593 |
| 2,198 | Continuous Analytics Over Discontinuous Streams | 2010 | SIGMOD | 9.308495e-05 |
Previous
Page 1 / 1
Next