Database Paper Browser

Back to papers

The Dataflow Model: A Practical Approach to Balancing Correctness, Latency, and Cost in Massive-Scale, Unbounded, Out-of-Order Data Processing

Summary: Proposes Dataflow Model, an abstraction for unbounded streams, enabling event-time processing with tunable correctness, latency, and cost. Advocates ongoing arrival and possible retractions, with formal semantics, core principles, and validation. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
11057
Venue
VLDB
Year
2015
Pagerank
0.00020678804
Overall Rank
538 | 96.26%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 16 of 66 citing papers.

Rank Citing Paper Year Venue Pagerank
10,770 cedar: Optimized and Unified Machine Learning Input Data Pipelines 2025 VLDB 4.1945683e-05
10,856 Analyzing Near-Network Hardware Acceleration with Co-Processing on DPUs 2025 VLDB 4.1945683e-05
10,862 How Reliable Are Streams? End-to-End Processing-Guarantee Validation and Performance Benchmarking of Stream Processing Systems 2025 VLDB 4.1945683e-05
10,941 PECJ: Stream Window Join on Disorder Data Streams with Proactive Error Compensation 2024 SIGMOD 4.1945683e-05
11,146 Raising the Level of Abstraction for Time-State Analytics With the Timeline Framework 2023 CIDR 4.1945683e-05
11,243 Fries: Fast and Consistent Runtime Reconfiguration in Dataflow Systems with Transactional Guarantees 2023 VLDB 4.1945683e-05
11,292 Portals: A Showcase of Multi-Dataflow Stateful Serverless 2023 VLDB 4.1945683e-05
11,435 Synchronization Schemas 2021 PODS 4.1945683e-05
11,468 Klink: Progress-Aware Scheduling for Streaming Data Systems 2021 SIGMOD 4.1945683e-05
11,485 Real-time Data Infrastructure at Uber 2021 SIGMOD 4.1945683e-05
11,502 In the Land of Data Streams where Synopses are Missing, One Framework to Bring Them All 2021 VLDB 4.1945683e-05
11,625 InvaliDB: Scalable Push-Based Real-Time Queries on Top of Pull-Based Databases (Extended) 2020 VLDB 4.1945683e-05
11,709 Robust, Scalable, Real-Time Event Time Series Aggregation at Twitter 2018 SIGMOD 4.1945683e-05
11,728 Challenges and Experiences in Building an Efficient Apache Beam Runner For IBM Streams 2018 VLDB 4.1945683e-05
11,804 State Management in Apache Flink 2017 VLDB 4.1945683e-05
11,805 CarStream: An Industrial System of Big Data Processing for Internet-of-Vehicles 2017 VLDB 4.1945683e-05
Previous Page 2 / 2 Next

Outgoing Citations (Sorted by Pagerank)

Showing 13 of 13 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers