Prompt: Dynamic Data-Partitioning for Distributed Micro-batch Stream Processing Systems
Summary: Prompt introduces dynamic data partitioning for micro-batch streams, with buffering and key-sorting to handle skew. Greedy workload-aware partitioning with load-aware distribution and elastic resources yields 2x throughput with maintained latency. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 2 of 2 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 9,041 | TreeSensing: Linearly Compressing Sketches with Flexibility | 2023 | SIGMOD | 4.4039656e-05 |
| 9,797 | Dalton: Learned Partitioning for Distributed Data Streams | 2023 | VLDB | 4.2818172e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 7 of 7 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 538 | The Dataflow Model: A Practical Approach to Balancing Correctness, Latency, and Cost in Massive-Scale, Unbounded, Out-of-Order Data Processing | 2015 | VLDB | 0.00020678804 |
| 1,477 | Fine-grained Partitioning for Aggressive Data Skipping | 2014 | SIGMOD | 0.00011770865 |
| 1,548 | Structured Streaming: A Declarative API for Real-Time Applications in Apache Spark | 2018 | SIGMOD | 0.00011431383 |
| 2,706 | Design, Implementation, and Evaluation of the Linear Road Benchmark on the Stream Processing Core | 2006 | SIGMOD | 8.2673299e-05 |
| 5,045 | Massive Scale-out of Expensive Continuous Queries | 2011 | VLDB | 5.740793e-05 |
| 6,629 | A Holistic View of Stream Partitioning Costs | 2017 | VLDB | 4.9880986e-05 |
| 11,943 | A Demonstration of AQWA: Adaptive Query-Workload-Aware Partitioning of Big Spatial Data | 2015 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 9,797 | Dalton: Learned Partitioning for Distributed Data Streams | 2023 | VLDB | 4.2818172e-05 |
| 2,814 | Tuple Routing Strategies for Distributed Eddies | 2003 | VLDB | 8.0749691e-05 |
| 11,993 | A Partitioning Framework for Aggressive Data Skipping | 2014 | VLDB | 4.1945683e-05 |
| 9,217 | Elasticutor: Rapid Elasticity for Realtime Stateful Stream Processing | 2019 | SIGMOD | 4.3712054e-05 |
| 1,003 | Adaptive Filters for Continuous Queries over Distributed Data Streams | 2003 | SIGMOD | 0.00014698435 |
| 9,313 | Providing Resiliency to Load Variations in Distributed Stream Processing | 2006 | VLDB | 4.3565355e-05 |
| 10,967 | Low-Latency Adaptive Distributed Stream Join System Based on a Flexible Join Model | 2024 | SIGMOD | 4.1945683e-05 |
| 2,080 | Optimal Sampling From Distributed Streams | 2010 | PODS | 9.5899129e-05 |
| 6,629 | A Holistic View of Stream Partitioning Costs | 2017 | VLDB | 4.9880986e-05 |
| 9,504 | Supporting Scalable Analytics with Latency Constraints | 2015 | VLDB | 4.3341665e-05 |