Database Paper Browser

Back to papers

ReStore: Reusing Results of MapReduce Jobs

Summary: ReStore enables storage and reuse of intermediate MapReduce results to accelerate future workflows. It reuses whole-job outputs and materializes operator-level results inside jobs; implemented as a Pig extension on Hadoop with measurable PigMix speedups. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
10503
Venue
VLDB
Year
2012
Pagerank
9.2920002e-05
Overall Rank
2,205 | 84.67%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 20 of 20 citing papers.

Rank Citing Paper Year Venue Pagerank
1,666 HELIX: Holistic Optimization for Accelerating Iterative Machine Learning 2019 VLDB 0.0001096361
1,922 Selecting Subexpressions to Materialize at Datacenter Scale 2018 VLDB 0.00010082599
2,576 S4: Top-k Spreadsheet-Style Search for Query Discovery 2015 SIGMOD 8.5112408e-05
2,674 Minimal MapReduce Algorithms 2013 SIGMOD 8.3328645e-05
3,562 MISO: Souping Up Big Data Query Processing with a Multistore System 2014 SIGMOD 6.9694564e-05
3,606 EVA: A Symbolic Approach to Accelerating Exploratory Video Analytics with Materialized Views 2022 SIGMOD 6.9260354e-05
3,703 Multi-Query Optimization in MapReduce Framework 2014 VLDB 6.8289978e-05
4,174 Computation Reuse in Analytics Job Service at Microsoft 2018 SIGMOD 6.3856219e-05
6,053 Optimizing Machine Learning Workloads in Collaborative Environments 2020 SIGMOD 5.2326838e-05
6,075 Opportunistic Physical Design for Big Data Analytics 2014 SIGMOD 5.223901e-05
6,173 Exploiting Soft and Hard Correlations in Big Data Query Optimization 2016 VLDB 5.1699414e-05
6,469 Materialization and Reuse Optimizations for Production Data Science Pipelines 2022 SIGMOD 5.0519488e-05
7,689 ROBUS: Fair Cache Allocation for Data-parallel Workloads 2017 SIGMOD 4.6765769e-05
9,344 Hippo: Sharing Computations in Hyper-Parameter Optimization 2022 VLDB 4.3539442e-05
9,781 D2WORM: A Management Infrastructure for Distributed Data-centric Workflows 2015 SIGMOD 4.2856106e-05
10,503 Self-Enhancing Video Data Management System for Compositional Events with Large Language Models 2025 SIGMOD 4.1945683e-05
11,835 An Efficient MapReduce Cube Algorithm for Varied Data Distributions 2016 SIGMOD 4.1945683e-05
11,958 Shared Execution of Recurring Workloads in MapReduce 2015 VLDB 4.1945683e-05
11,976 Anti-Combining for MapReduce 2014 SIGMOD 4.1945683e-05
12,125 ReStore: Reusing Results of MapReduce Jobs in Pig 2012 SIGMOD 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 10 of 10 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers