Database Paper Browser

Back to papers

SparkCruise: Workload Optimization in Managed Spark Clusters at Microsoft

Summary: SparkCruise injects a workload-driven feedback loop into the Spark SQL optimizer to optimize large workloads without accessing user data. Analysis of production Spark SQL workloads vs. TPC-DS demonstrates online learning and a computation-reuse optimization. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
12517
Venue
VLDB
Year
2021
Pagerank
4.5607121e-05
Overall Rank
8,197 | 42.98%
DOI
10.14778/3476311.3476388

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 5 of 5 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 13 of 13 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers