Starfish: A Self-tuning System for Big Data Analytics
Summary: Starfish automatically tunes Hadoop MapReduce workflows to improve runtime, resource utilization, and cloud cost without manual knob fiddling. It adapts self-tuning DB techniques—cost models, profiling and reconfiguration—to MapReduce’s workload variability and pay‑as‑you‑go environments. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Herodotos Herodotou
- 2. Harold Lim
- 3. Gang Luo
- 4. Nedyalko Borisov
- 5. Liang Dong
- 6. Fatma Bilgen Cetin
- 7. Shivnath Babu
Incoming Citations (Sorted by Pagerank)
Showing 31 of 31 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 5 of 5 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 157 | HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads | 2009 | VLDB | 0.00040397359 |
| 168 | MAD Skills: New Analysis Practices for Big Data | 2009 | VLDB | 0.00038946305 |
| 794 | Hadoop++: Making a Yellow Elephant Run Like a Cheetah (Without It Even Noticing) | 2010 | VLDB | 0.00016605103 |
| 947 | MRShare: Sharing Across Multiple Queries in MapReduce | 2010 | VLDB | 0.00015114576 |
| 1,615 | The Performance of MapReduce: An In-depth Study | 2010 | VLDB | 0.00011132319 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 7,958 | CARTILAGE: Adding Flexibility to the Hadoop Skeleton | 2013 | SIGMOD | 4.613363e-05 |
| 8,464 | Piranha: Optimizing Short Jobs in Hadoop | 2013 | VLDB | 4.5052127e-05 |
| 3,973 | Apache Hive: From MapReduce to Enterprise-grade Big Data Warehousing | 2019 | SIGMOD | 6.5758017e-05 |
| 2,476 | A Platform for Scalable One-Pass Analytics using MapReduce | 2011 | SIGMOD | 8.6960139e-05 |
| 9,504 | Supporting Scalable Analytics with Latency Constraints | 2015 | VLDB | 4.3341665e-05 |
| 13,356 | Big Data Science Needs Big Data Middleware | 2015 | CIDR | - |
| 2,488 | Shark: Fast Data Analysis Using Coarse-grained Distributed Memory | 2012 | SIGMOD | 8.6683713e-05 |
| 542 | Shark: SQL and Rich Analytics at Scale | 2013 | SIGMOD | 0.00020595648 |
| 6,268 | Speedup Your Analytics: Automatic Parameter Tuning for Databases and Big Data Systems | 2019 | VLDB | 5.133857e-05 |
| 8,358 | MapReduce Programming and Cost-based Optimization? Crossing this Chasm with Starfish | 2011 | VLDB | 4.5372998e-05 |