Back to papers
PerfXplain: Debugging MapReduce Job Performance
Summary: PerfXplain queries MapReduce runtimes from a log of past jobs and explains performance differences. It defines explanations by relevance, precision, and generality, using a decision-tree method to generate insights; evaluated on EC2 against baselines.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 10504
- Venue
- VLDB
- Year
- 2012
- Pagerank
- 0.00011468393
- Overall Rank
- 1,534 | 89.33%
- DOI
-
-
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 22 of 22 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
| 214 |
Scorpion: Explaining Away Outliers in Aggregate Queries |
2013 |
VLDB |
0.0003363692 |
| 942 |
A Formal Approach to Finding Explanations for Database Queries |
2014 |
SIGMOD |
0.00015155714 |
| 1,022 |
DBSherlock: A Performance Diagnostic Tool for Transactional Databases |
2016 |
SIGMOD |
0.00014614917 |
| 1,840 |
dbTouch: Analytics at your Fingertips |
2013 |
CIDR |
0.0001034905 |
| 2,139 |
Diagnosing Root Causes of Intermittent Slow Queries in Cloud Databases |
2020 |
VLDB |
9.4640037e-05 |
| 2,154 |
DIFF: A Relational Interface for Large-Scale Data Explanation |
2019 |
VLDB |
9.4208667e-05 |
| 2,402 |
Causality and Explanations in Databases |
2014 |
VLDB |
8.8928361e-05 |
| 2,649 |
Explaining Query Answers with Explanation-Ready Databases |
2016 |
VLDB |
8.3719123e-05 |
| 2,674 |
Minimal MapReduce Algorithms |
2013 |
SIGMOD |
8.3328645e-05 |
| 3,105 |
Data X-Ray: A Diagnostic Tool for Data Errors |
2015 |
SIGMOD |
7.5568954e-05 |
| 4,361 |
The Complexity of Resilience and Responsibility for Self-Join-Free Conjunctive Queries |
2016 |
VLDB |
6.2559141e-05 |
| 4,802 |
Resource Elasticity for Large-Scale Machine Learning |
2015 |
SIGMOD |
5.9114415e-05 |
| 5,191 |
Going Beyond Provenance: Explaining Query Answers with Pattern-based Counterbalances |
2019 |
SIGMOD |
5.6378768e-05 |
| 5,445 |
QFix: Diagnosing Errors through Query Histories |
2017 |
SIGMOD |
5.5020909e-05 |
| 6,124 |
iQCAR: inter-Query Contention Analyzer for Data Analytics Frameworks |
2019 |
SIGMOD |
5.1988046e-05 |
| 6,160 |
A Demonstration of Interactive Analysis of Performance Measurements with Viska |
2017 |
SIGMOD |
5.1758344e-05 |
| 6,475 |
Explain3D: Explaining Disagreements in Disjoint Datasets |
2019 |
VLDB |
5.0497183e-05 |
| 6,779 |
Explaining Inference Queries with Bayesian Optimization |
2021 |
VLDB |
4.9280116e-05 |
| 6,821 |
Hadoop's Adolescence: An analysis of Hadoop usage in scientific workloads |
2013 |
VLDB |
4.9156923e-05 |
| 7,296 |
Multi-Tenant Cloud Data Services: State-of-the-Art, Challenges and Opportunities |
2022 |
SIGMOD |
4.7723197e-05 |
| 9,779 |
iQCAR: A Demonstration of an Inter-Query Contention Analyzer for Cluster Computing Frameworks |
2018 |
SIGMOD |
4.2856106e-05 |
| 11,949 |
Big Data Research: Will Industry Solve all the Problems? |
2015 |
VLDB |
4.1945683e-05 |
Outgoing Citations (Sorted by Pagerank)
Showing 18 of 18 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 3 |
Pig Latin: A Not-So-Foreign Language for Data Processing |
2008 |
SIGMOD |
0.0024183614 |
| 22 |
SCOPE: Easy and Efficient Parallel Processing of Massive Data Sets |
2008 |
VLDB |
0.0008456613 |
| 158 |
Automated Selection of Materialized Views and Indexes for SQL Databases |
2000 |
VLDB |
0.00040071492 |
| 237 |
An Efficient, Cost-Driven Index Selection Tool for Microsoft SQL Server |
1997 |
VLDB |
0.00031726304 |
| 424 |
Tuning Database Configuration Parameters with iTuned |
2009 |
VLDB |
0.00023616398 |
| 496 |
Automatic SQL Tuning in Oracle 10g |
2004 |
VLDB |
0.00021728655 |
| 516 |
AutoAdmin "What-if" Index Analysis Utility |
1998 |
SIGMOD |
0.00021196031 |
| 661 |
Database Tuning Advisor for Microsoft SQL Server 2005 |
2004 |
VLDB |
0.00018481174 |
| 794 |
Hadoop++: Making a Yellow Elephant Run Like a Cheetah (Without It Even Noticing) |
2010 |
VLDB |
0.00016605103 |
| 953 |
Runtime Measurements in the Cloud: Observing, Analyzing, and Reducing Variance |
2010 |
VLDB |
0.00015095431 |
| 1,071 |
Starfish: A Self-tuning System for Big Data Analytics |
2011 |
CIDR |
0.00014312777 |
| 1,280 |
Automatic Optimization for MapReduce Programs |
2011 |
VLDB |
0.0001285503 |
| 1,615 |
The Performance of MapReduce: An In-depth Study |
2010 |
VLDB |
0.00011132319 |
| 1,770 |
ParaTimer: A Progress Indicator for MapReduce DAGs |
2010 |
SIGMOD |
0.00010618229 |
| 3,653 |
Database Tuning Advisor for Microsoft SQL Server 2005: Demo |
2005 |
SIGMOD |
6.8743355e-05 |
| 4,436 |
Xplus: A SQL-Tuning-Aware Query Optimizer |
2010 |
VLDB |
6.1909336e-05 |
| 4,936 |
Why Did My Query Slow Down? |
2009 |
CIDR |
5.8193534e-05 |
| 5,010 |
iTuned: A Tool for Configuring and Visualizing Database Parameters |
2010 |
SIGMOD |
5.7611118e-05 |
Semantically Similar Papers