Cost-based Fault-tolerance for Parallel Data Processing
Summary: Cost-based fault-tolerance for PDEs picks a subset of intermediates to materialize, reducing runtime under mid-query failures. Outperforms coarse restarts and lineage schemes across workloads, delivering a trade-off for short and long queries. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Abdallah Salama
- 2. Carsten Binnig
- 3. Tim Kraska
- 4. Erfan Zamanian
Incoming Citations (Sorted by Pagerank)
Showing 1 of 1 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 9,194 | Phoebe: A Learning-based Checkpoint Optimizer | 2021 | VLDB | 4.3761777e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 2 of 2 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 542 | Shark: SQL and Rich Analytics at Scale | 2013 | SIGMOD | 0.00020595648 |
| 2,575 | A Latency and Fault-Tolerance Optimizer for Online Parallel Query Plans | 2011 | SIGMOD | 8.5133576e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 438 | Query Optimization for Parallel Execution | 1992 | SIGMOD | 0.00023199245 |
| 6,061 | Towards Energy-Efficient Database Cluster Design | 2012 | VLDB | 5.2304505e-05 |
| 1,957 | On the Design and Scalability of Distributed Shared-Data Databases | 2015 | SIGMOD | 9.9598319e-05 |
| 12,668 | Fault-tolerant, Load-balancing Queries in Telegraph | 2001 | SIGMOD | 4.1945683e-05 |
| 12,203 | Resiliency-Aware Data Management | 2011 | VLDB | 4.1945683e-05 |
| 7,125 | Fast Failure Recovery in Distributed Graph Processing Systems | 2015 | VLDB | 4.8246382e-05 |
| 2,618 | Distributing A Database For Parallelism | 1983 | SIGMOD | 8.4447319e-05 |
| 3,821 | Locality-aware Partitioning in Parallel Database Systems | 2015 | SIGMOD | 6.7281515e-05 |
| 1,357 | Highly Available, Fault-Tolerant, Parallel Dataflows | 2004 | SIGMOD | 0.00012392275 |
| 2,575 | A Latency and Fault-Tolerance Optimizer for Online Parallel Query Plans | 2011 | SIGMOD | 8.5133576e-05 |