Fine-Grained Modeling and Optimization for Intelligent Resource Management in Big Data Processing
Summary: Fine-grained instance-level modeling with a MaxCompute-based architecture decomposes resource optimization into simpler, multi-objective decisions (partition count, placement, per-instance resources). Novel predictive models and optimization methods enable sub-second RO and yield 37–72% latency and 43–78% cost reductions on production workloads. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Chenghao Lyu
- 2. Qi Fan
- 3. Fei Song
- 4. Arnab Sinha
- 5. Yanlei Diao
- 6. Wei Chen
- 7. Li Ma
- 8. Yihui Feng
- 9. Yaliang Li
- 10. Kai Zeng
- 11. Jingren Zhou
Incoming Citations (Sorted by Pagerank)
Showing 8 of 8 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 5,701 | Resource Management in Aurora Serverless | 2024 | VLDB | 5.3647775e-05 |
| 5,832 | Stage: Query Execution Time Prediction in Amazon Redshift | 2024 | SIGMOD | 5.3111109e-05 |
| 7,889 | Cost-Intelligent Data Analytics in the Cloud | 2024 | CIDR | 4.6253386e-05 |
| 8,020 | The Holon Approach for Simultaneously Tuning Multiple Components in a Self-Driving Database Management System with Machine Learning via Synthesized Proto-Actions | 2024 | VLDB | 4.6040862e-05 |
| 8,617 | A Spark Optimizer for Adaptive, Fine-Grained Parameter Tuning | 2024 | VLDB | 4.4846425e-05 |
| 8,956 | T3: Accurate and Fast Performance Prediction for Relational Database Systems With Compiled Decision Trees | 2025 | SIGMOD | 4.4214154e-05 |
| 10,726 | Improving DBMS Scheduling Decisions with Accurate Performance Prediction on Concurrent Queries | 2025 | VLDB | 4.1945683e-05 |
| 10,859 | Graph Transformers for Query Plan Representation: Potentials and Challenges | 2025 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 41 of 41 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,459 | Multi-dimensional Resource Scheduling for Parallel Queries | 1996 | SIGMOD | 8.7676516e-05 |
| 4,061 | Advanced Partitioning Techniques for Massively Distributed Computation | 2012 | SIGMOD | 6.483587e-05 |
| 4,802 | Resource Elasticity for Large-Scale Machine Learning | 2015 | SIGMOD | 5.9114415e-05 |
| 3,703 | Multi-Query Optimization in MapReduce Framework | 2014 | VLDB | 6.8289978e-05 |
| 4,700 | Schedule Optimization for Data Processing Flows on the Cloud | 2011 | SIGMOD | 5.9882572e-05 |
| 5,297 | Continuous Cloud-Scale Query Optimization and Processing | 2013 | VLDB | 5.5801669e-05 |
| 3,625 | Cost Models for Big Data Query Processing: Learning, Retrofitting, and Our Findings | 2020 | SIGMOD | 6.9055212e-05 |
| 8,617 | A Spark Optimizer for Adaptive, Fine-Grained Parameter Tuning | 2024 | VLDB | 4.4846425e-05 |
| 11,415 | Budget-Conscious Fine-Grained Configuration Optimization for Spatio-Temporal Applications | 2022 | VLDB | 4.1945683e-05 |
| 9,504 | Supporting Scalable Analytics with Latency Constraints | 2015 | VLDB | 4.3341665e-05 |