Back to papers
RHEEM: Enabling Cross-Platform Data Processing - May The Big Data Be With You! -
Summary: Rheem enables cross-platform data processing by decoupling applications from execution platforms and partitioning tasks. A cost-based optimizer selects platforms and maps subtasks, with an executor orchestrating multi-platform workflows for lower cost.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 11630
- Venue
- VLDB
- Year
- 2018
- Pagerank
- 7.3083672e-05
- Overall Rank
- 3,265 | 77.29%
- DOI
-
10.14778/3236187.3236195
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 21 of 21 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
| 4,293 |
ESTOCADA: Towards Scalable Polystore Systems |
2020 |
VLDB |
6.2885419e-05 |
| 5,402 |
Towards Scalable Hybrid Stores: Constraint-Based Rewriting to the Rescue |
2019 |
SIGMOD |
5.5278023e-05 |
| 5,731 |
Babelfish: Efficient Execution of Polyglot Queries |
2022 |
VLDB |
5.3502065e-05 |
| 5,840 |
Logical and Physical Optimizations for SQL Query Execution over Large Language Models |
2025 |
SIGMOD |
5.3042561e-05 |
| 6,519 |
Expand your Training Limits! Generating Training Data for ML-based Data Management |
2021 |
SIGMOD |
5.0316686e-05 |
| 6,981 |
Dataset Relationship Management |
2019 |
CIDR |
4.8743957e-05 |
| 7,077 |
Skeena: Efficient and Consistent Cross-Engine Transactions |
2022 |
SIGMOD |
4.8425226e-05 |
| 7,990 |
Blueprinting the Cloud: Unifying and Automatically Optimizing Cloud Data Infrastructures with BRAD |
2024 |
VLDB |
4.6117441e-05 |
| 8,758 |
Hyperspace: The Indexing Subsystem of Azure Synapse |
2021 |
VLDB |
4.456315e-05 |
| 9,001 |
The Power of Nested Parallelism in Big Data Processing – Hitting Three Flies with One Slap – |
2021 |
SIGMOD |
4.4107627e-05 |
| 9,125 |
On-Demand State Separation for Cloud Data Warehousing |
2022 |
VLDB |
4.3917246e-05 |
| 9,292 |
Farm Your ML-based Query Optimizer's Food! - Human-Guided Training Data Generation - |
2022 |
CIDR |
4.3619543e-05 |
| 9,473 |
Apache Wayang in Action: Enabling Data Systems Integration via a Unified Data Analytics Framework |
2025 |
SIGMOD |
4.3341665e-05 |
| 9,607 |
Polyglot Data Management: State of the Art & Open Challenges |
2022 |
VLDB |
4.3177432e-05 |
| 9,608 |
Unified Data Analytics: State-of-the-art and Open Problems |
2022 |
VLDB |
4.3177432e-05 |
| 9,917 |
Check Out the Big Brain on BRAD: Simplifying Cloud Data Processing with Learned Automated Data Meshes |
2023 |
VLDB |
4.2561557e-05 |
| 10,263 |
APEROL: Adaptive Parallel Edge-to-cloud Runtime Optimization for Layered Workflow Execution |
2026 |
VLDB |
4.1945683e-05 |
| 10,482 |
Fast and Scalable Data Transfer Across Data Systems |
2025 |
SIGMOD |
4.1945683e-05 |
| 10,591 |
Accio: Bolt-on Query Federation |
2025 |
VLDB |
4.1945683e-05 |
| 11,197 |
QaaD (Query-as-a-Data): Scalable Execution of Massive Number of Small Queries in Spark |
2023 |
SIGMOD |
4.1945683e-05 |
| 11,502 |
In the Land of Data Streams where Synopses are Missing, One Framework to Bring Them All |
2021 |
VLDB |
4.1945683e-05 |
Outgoing Citations (Sorted by Pagerank)
Showing 22 of 22 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 3 |
Pig Latin: A Not-So-Foreign Language for Data Processing |
2008 |
SIGMOD |
0.0024183614 |
| 42 |
A Comparison of Approaches to Large-Scale Data Analysis |
2009 |
SIGMOD |
0.00073498298 |
| 71 |
How Good Are Query Optimizers, Really? |
2016 |
VLDB |
0.00059038975 |
| 165 |
Federated Database Systems for Managing Distributed, Heterogeneous, and Autonomous Databases |
1991 |
VLDB |
0.00039502525 |
| 557 |
SystemML: Declarative Machine Learning on Spark |
2016 |
VLDB |
0.00020197988 |
| 650 |
Robust Query Processing through Progressive Optimization |
2004 |
SIGMOD |
0.00018659177 |
| 1,012 |
NADEEF: A Commodity Data Cleaning System |
2013 |
SIGMOD |
0.0001464733 |
| 1,277 |
The Data Civilizer System |
2017 |
CIDR |
0.00012879695 |
| 1,750 |
Weld: A Common Runtime for High Performance Data Analytics |
2017 |
CIDR |
0.00010683647 |
| 1,820 |
A Demonstration of the BigDAWG Polystore System |
2015 |
VLDB |
0.00010428281 |
| 1,977 |
Split Query Processing in Polybase |
2013 |
SIGMOD |
9.8824589e-05 |
| 2,611 |
Opening the Black Boxes in Data Flow Optimization |
2012 |
VLDB |
8.4536967e-05 |
| 2,946 |
BigDansing: A System for Big Data Cleansing |
2015 |
SIGMOD |
7.8372441e-05 |
| 3,034 |
How to Fit when No One Size Fits |
2013 |
CIDR |
7.6752083e-05 |
| 3,562 |
MISO: Souping Up Big Data Query Processing with a Multistore System |
2014 |
SIGMOD |
6.9694564e-05 |
| 3,571 |
Lightning Fast and Space Efficient Inequality Joins |
2015 |
VLDB |
6.9580858e-05 |
| 3,710 |
Optimizing Analytic Data Flows for Multiple Execution Engines |
2012 |
SIGMOD |
6.8238962e-05 |
| 3,982 |
The Myria Big Data Management and Analytics System and Cloud Service |
2017 |
CIDR |
6.5651188e-05 |
| 4,120 |
Husky: Towards a More Efficient and Expressive Distributed Computing Framework |
2016 |
VLDB |
6.4364588e-05 |
| 5,058 |
A Demo of the Data Civilizer System |
2017 |
SIGMOD |
5.7280139e-05 |
| 6,986 |
A Cost-based Optimizer for Gradient Descent Optimization |
2017 |
SIGMOD |
4.8727048e-05 |
| 9,810 |
Rheem: Enabling Multi-Platform Task Execution |
2016 |
SIGMOD |
4.278405e-05 |
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 9,203 |
Intelligent Automated Workload Analysis for Database Replatforming |
2022 |
SIGMOD |
4.3740313e-05 |
| 9,608 |
Unified Data Analytics: State-of-the-art and Open Problems |
2022 |
VLDB |
4.3177432e-05 |
| 13,356 |
Big Data Science Needs Big Data Middleware |
2015 |
CIDR |
- |
| 11,972 |
Palette: Enabling Scalable Analytics for Big-Memory, Multicore Machines |
2014 |
SIGMOD |
4.1945683e-05 |
| 13,425 |
Data Mining Algorithms as a Service in the Cloud: Exploiting Relational Database Systems |
2013 |
SIGMOD |
- |
| 9,503 |
IReS: Intelligent, Multi-Engine Resource Scheduler for Big Data Analytics Workflows |
2015 |
SIGMOD |
4.3341665e-05 |
| 2,458 |
REX: Recursive, Delta-Based Data-Centric Computation |
2012 |
VLDB |
8.7683462e-05 |
| 2,476 |
A Platform for Scalable One-Pass Analytics using MapReduce |
2011 |
SIGMOD |
8.6960139e-05 |
| 7,050 |
REEF: Retainable Evaluator Execution Framework |
2013 |
VLDB |
4.85001e-05 |
| 9,810 |
Rheem: Enabling Multi-Platform Task Execution |
2016 |
SIGMOD |
4.278405e-05 |