Database Paper Browser

Back to papers

Structured Streaming: A Declarative API for Real-Time Applications in Apache Spark

Summary: Declarative Spark Structured Streaming; incrementalizes SQL/DataFrame queries, not user-built DAG. End-to-end real-time apps unifying streaming with batch analytics; code generation yields high performance; rollbacks and mixed execution in production. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
5497
Venue
SIGMOD
Year
2018
Pagerank
0.00011431383
Overall Rank
1,548 | 89.24%
DOI
10.1145/3183713.3190664

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 35 of 35 citing papers.

Rank Citing Paper Year Venue Pagerank
746 Delta Lake: High-Performance ACID Table Storage over Cloud Object Stores 2020 VLDB 0.00017326979
4,044 Megaphone: Latency-conscious state migration for distributed streaming dataflows 2019 VLDB 6.4995312e-05
5,130 One SQL to Rule Them All – an Efficient and Syntactically Idiomatic Approach to Management of Streams and Tables 2019 SIGMOD 5.6755067e-05
5,193 LightSaber: Efficient Window Aggregation on Multi-core Processors 2020 SIGMOD 5.6371049e-05
5,731 Babelfish: Efficient Execution of Polyglot Queries 2022 VLDB 5.3502065e-05
5,939 Clonos: Consistent Causal Recovery for Highly-Available Streaming Dataflows 2021 SIGMOD 5.2641681e-05
6,242 Helios: Hyperscale Indexing for the Cloud & Edge 2020 VLDB 5.1408379e-05
6,436 Providing Streaming Joins as a Service at Facebook 2018 VLDB 5.0636254e-05
6,721 Beyond Analytics: The Evolution of Stream Processing Systems 2020 SIGMOD 4.9492015e-05
6,759 AStream: Ad-hoc Shared Stream Processing 2019 SIGMOD 4.9352213e-05
7,373 Hazelcast Jet: Low-latency Stream Processing at the 99.99th Percentile 2021 VLDB 4.7494183e-05
7,938 Correctness in Stream Processing: Challenges and Opportunities 2022 CIDR 4.613363e-05
8,075 AJoin: Ad-hoc Stream Joins at Scale 2020 VLDB 4.5917655e-05
8,217 Spur: Mitigating Slow Instances in Large-Scale Streaming Pipelines 2020 SIGMOD 4.5568298e-05
8,480 Optimization of Threshold Functions over Streams 2021 VLDB 4.5011552e-05
8,596 Prompt: Dynamic Data-Partitioning for Distributed Micro-batch Stream Processing Systems 2020 SIGMOD 4.4887993e-05
8,788 FishStore: Faster Ingestion with Subset Hashing 2019 SIGMOD 4.451039e-05
8,909 What's the Difference? Incremental Processing with Change Queries in Snowflake 2023 SIGMOD 4.427232e-05
9,496 Scabbard: Single-Node Fault-Tolerant Stream Processing 2022 VLDB 4.3341665e-05
9,516 [Demo] Low-latency Spark Queries on Updatable Data 2019 SIGMOD 4.3335877e-05
9,604 GeaFlow: A Graph Extended and Accelerated Dataflow System 2023 SIGMOD 4.3177432e-05
9,803 Railgun: managing large streaming windows under MAD requirements 2021 VLDB 4.2807806e-05
9,883 Towards Resource-adaptive Query Execution in Cloud Native Databases 2024 CIDR 4.2635782e-05
9,913 Chukonu: A Fully-Featured High-Performance Big Data Framework that Integrates a Native Compute Engine into Spark 2022 VLDB 4.2565279e-05
10,077 Enjima: A Resource-Adaptive Stream Processing System 2026 SIGMOD 4.1945683e-05
10,259 Scarf: Self-Adaptive Tuning via Multi-Objective Reinforcement Learning for Apache Flink 2026 VLDB 4.1945683e-05
10,263 APEROL: Adaptive Parallel Edge-to-cloud Runtime Optimization for Layered Workflow Execution 2026 VLDB 4.1945683e-05
10,280 Meerkat: Scalable, Network-Aware Failure Recovery for the Internet of Things 2026 VLDB 4.1945683e-05
10,417 Streaming Democratized: Ease Across the Latency Spectrum with Delayed View Semantics and Snowflake Dynamic Tables 2025 SIGMOD 4.1945683e-05
10,509 Styx: Transactional Stateful Functions on Streaming Dataflows 2025 SIGMOD 4.1945683e-05
10,577 Agamotto: Scheduling of Deadline-Oriented Incremental Query Execution under Uncertain Resource Price 2025 VLDB 4.1945683e-05
10,736 TreeCat: Standalone Catalog Engine for Large Data Systems 2025 VLDB 4.1945683e-05
10,962 Fault Tolerance Placement in the Internet of Things 2024 SIGMOD 4.1945683e-05
11,243 Fries: Fast and Consistent Runtime Reconfiguration in Dataflow Systems with Transactional Guarantees 2023 VLDB 4.1945683e-05
11,505 Approximating Median Absolute Deviation with Bounded Error 2021 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 10 of 10 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers