Runtime Variation in Big Data Analytics
Summary: Two-step predictor for runtime distribution: shape features plus a classifier with >96% accuracy. First large-scale study predicting enterprise analytics runtime categories; enables what-if analyses on allocation and scheduling. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Yiwen Zhu
- 2. Rathijit Sen
- 3. Robert Horton
- 4. John Mark Agosta
Incoming Citations (Sorted by Pagerank)
Showing 2 of 2 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 7,844 | InTime: Towards Performance Predictability In Byzantine Fault Tolerant Proof-of-Stake Consensus | 2025 | SIGMOD | 4.6366294e-05 |
| 9,871 | From Logs to Causal Inference: Diagnosing Large Systems | 2025 | VLDB | 4.2667743e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 19 of 19 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 6,757 | KEA: Tuning an Exabyte-Scale Data Infrastructure | 2021 | SIGMOD | 4.9372134e-05 |
| 11,635 | Automated Performance Management for the Big Data Stack | 2019 | CIDR | 4.1945683e-05 |
| 6,268 | Speedup Your Analytics: Automatic Parameter Tuning for Databases and Big Data Systems | 2019 | VLDB | 5.133857e-05 |
| 718 | Performance Prediction for Concurrent Database Workloads | 2011 | SIGMOD | 0.0001763106 |
| 4,174 | Computation Reuse in Analytics Job Service at Microsoft | 2018 | SIGMOD | 6.3856219e-05 |
| 5,688 | PREDIcT: Towards Predicting the Runtime of Large Scale Iterative Analytics | 2013 | VLDB | 5.3702808e-05 |
| 3,625 | Cost Models for Big Data Query Processing: Learning, Retrofitting, and Our Findings | 2020 | SIGMOD | 6.9055212e-05 |
| 9,504 | Supporting Scalable Analytics with Latency Constraints | 2015 | VLDB | 4.3341665e-05 |
| 953 | Runtime Measurements in the Cloud: Observing, Analyzing, and Reducing Variance | 2010 | VLDB | 0.00015095431 |
| 5,297 | Continuous Cloud-Scale Query Optimization and Processing | 2013 | VLDB | 5.5801669e-05 |