Recurring Job Optimization in Scope
Summary: Instrument recurring jobs to collect statistics during execution, piggybacking on normal runs to capture distributions with minimal overhead. Stats feed the Scope optimizer to improve future calls of similar jobs, enabling scalable data-aware optimization. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Nicolas Bruno
- 2. Sameer Agarwal
- 3. Srikanth Kandula
- 4. Bing Shi
- 5. Ming-Chuan Wu
- 6. Jingren Zhou
Incoming Citations (Sorted by Pagerank)
Showing 10 of 10 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,152 | Blink and It's Done: Interactive Queries on Very Large Data | 2012 | VLDB | 0.00013645792 |
| 1,874 | Knowing When You’re Wrong: Building Fast and Reliable Approximate Query Processing Systems | 2014 | SIGMOD | 0.00010244443 |
| 1,922 | Selecting Subexpressions to Materialize at Datacenter Scale | 2018 | VLDB | 0.00010082599 |
| 2,237 | Procedural Extensions of SQL: Understanding their usage in the wild | 2021 | VLDB | 9.2212748e-05 |
| 3,625 | Cost Models for Big Data Query Processing: Learning, Retrofitting, and Our Findings | 2020 | SIGMOD | 6.9055212e-05 |
| 5,297 | Continuous Cloud-Scale Query Optimization and Processing | 2013 | VLDB | 5.5801669e-05 |
| 6,109 | Pixida: Optimizing Data Parallel Jobs in Wide-Area Data Analytics | 2016 | VLDB | 5.2059441e-05 |
| 6,757 | KEA: Tuning an Exabyte-Scale Data Infrastructure | 2021 | SIGMOD | 4.9372134e-05 |
| 7,778 | Runtime Variation in Big Data Analytics | 2023 | SIGMOD | 4.653651e-05 |
| 11,958 | Shared Execution of Recurring Workloads in MapReduce | 2015 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 1 of 1 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 22 | SCOPE: Easy and Efficient Parallel Processing of Massive Data Sets | 2008 | VLDB | 0.0008456613 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 6,673 | Incorporating Super-Operators in Big-Data Query Optimizers | 2020 | VLDB | 4.966799e-05 |
| 4,132 | Advanced Join Strategies for Large-Scale Distributed Computation | 2014 | VLDB | 6.4241067e-05 |
| 3,703 | Multi-Query Optimization in MapReduce Framework | 2014 | VLDB | 6.8289978e-05 |
| 11,753 | Effective Temporal Dependence Discovery in Time Series Data | 2018 | VLDB | 4.1945683e-05 |
| 11,958 | Shared Execution of Recurring Workloads in MapReduce | 2015 | VLDB | 4.1945683e-05 |
| 7,778 | Runtime Variation in Big Data Analytics | 2023 | SIGMOD | 4.653651e-05 |
| 4,061 | Advanced Partitioning Techniques for Massively Distributed Computation | 2012 | SIGMOD | 6.483587e-05 |
| 4,174 | Computation Reuse in Analytics Job Service at Microsoft | 2018 | SIGMOD | 6.3856219e-05 |
| 6,040 | Steering Query Optimizers: A Practical Take on Big Data Workloads | 2021 | SIGMOD | 5.2412035e-05 |
| 5,297 | Continuous Cloud-Scale Query Optimization and Processing | 2013 | VLDB | 5.5801669e-05 |