Early Accurate Results for Advanced Analytics on MapReduce
Summary: EARL extends Hadoop with a non-parametric, incremental early-result library for arbitrary workflows; bootstrapped online accuracy estimates apply to any function. Minimal MapReduce changes; experiments show major speed-ups on Hadoop for common analytics workloads. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Nikolay Laptev
- 2. Kai Zeng
- 3. Carlo Zaniolo
Incoming Citations (Sorted by Pagerank)
Showing 12 of 12 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 10 of 10 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3 | Pig Latin: A Not-So-Foreign Language for Data Processing | 2008 | SIGMOD | 0.0024183614 |
| 14 | Online Aggregation | 1997 | SIGMOD | 0.0010801504 |
| 70 | Hive - A Warehousing Solution Over a Map-Reduce Framework | 2009 | VLDB | 0.00059533166 |
| 413 | HaLoop: Efficient Iterative Data Processing on Large Clusters | 2010 | VLDB | 0.00023904409 |
| 1,071 | Starfish: A Self-tuning System for Big Data Analytics | 2011 | CIDR | 0.00014312777 |
| 1,464 | Online Aggregation for Large MapReduce Jobs | 2011 | VLDB | 0.00011865546 |
| 1,797 | Effective Use of Block-Level Sampling in Statistics Estimation | 2004 | SIGMOD | 0.00010523169 |
| 2,476 | A Platform for Scalable One-Pass Analytics using MapReduce | 2011 | SIGMOD | 8.6960139e-05 |
| 2,736 | Online Aggregation and Continuous Query support in MapReduce | 2010 | SIGMOD | 8.2043187e-05 |
| 3,167 | Relational Confidence Bounds Are Easy With The Bootstrap* | 2005 | SIGMOD | 7.4523397e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,736 | Online Aggregation and Continuous Query support in MapReduce | 2010 | SIGMOD | 8.2043187e-05 |
| 1,464 | Online Aggregation for Large MapReduce Jobs | 2011 | VLDB | 0.00011865546 |
| 9,375 | Efficient Big Data Processing in Hadoop MapReduce | 2012 | VLDB | 4.347384e-05 |
| 157 | HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads | 2009 | VLDB | 0.00040397359 |
| 11,933 | FP-Hadoop: Efficient Execution of Parallel Jobs Over Skewed Data | 2015 | VLDB | 4.1945683e-05 |
| 9,504 | Supporting Scalable Analytics with Latency Constraints | 2015 | VLDB | 4.3341665e-05 |
| 794 | Hadoop++: Making a Yellow Elephant Run Like a Cheetah (Without It Even Noticing) | 2010 | VLDB | 0.00016605103 |
| 5,903 | Building Wavelet Histograms on Large Data in MapReduce | 2012 | VLDB | 5.2791351e-05 |
| 1,615 | The Performance of MapReduce: An In-depth Study | 2010 | VLDB | 0.00011132319 |
| 2,476 | A Platform for Scalable One-Pass Analytics using MapReduce | 2011 | SIGMOD | 8.6960139e-05 |