Database Paper Browser

Back to papers

RHEEM: Enabling Cross-Platform Data Processing - May The Big Data Be With You! -

Summary: Rheem enables cross-platform data processing by decoupling applications from execution platforms and partitioning tasks. A cost-based optimizer selects platforms and maps subtasks, with an executor orchestrating multi-platform workflows for lower cost. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
11630
Venue
VLDB
Year
2018
Pagerank
7.3083672e-05
Overall Rank
3,265 | 77.29%
DOI
10.14778/3236187.3236195

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 21 of 21 citing papers.

Rank Citing Paper Year Venue Pagerank
4,293 ESTOCADA: Towards Scalable Polystore Systems 2020 VLDB 6.2885419e-05
5,402 Towards Scalable Hybrid Stores: Constraint-Based Rewriting to the Rescue 2019 SIGMOD 5.5278023e-05
5,731 Babelfish: Efficient Execution of Polyglot Queries 2022 VLDB 5.3502065e-05
5,840 Logical and Physical Optimizations for SQL Query Execution over Large Language Models 2025 SIGMOD 5.3042561e-05
6,519 Expand your Training Limits! Generating Training Data for ML-based Data Management 2021 SIGMOD 5.0316686e-05
6,981 Dataset Relationship Management 2019 CIDR 4.8743957e-05
7,077 Skeena: Efficient and Consistent Cross-Engine Transactions 2022 SIGMOD 4.8425226e-05
7,990 Blueprinting the Cloud: Unifying and Automatically Optimizing Cloud Data Infrastructures with BRAD 2024 VLDB 4.6117441e-05
8,758 Hyperspace: The Indexing Subsystem of Azure Synapse 2021 VLDB 4.456315e-05
9,001 The Power of Nested Parallelism in Big Data Processing – Hitting Three Flies with One Slap – 2021 SIGMOD 4.4107627e-05
9,125 On-Demand State Separation for Cloud Data Warehousing 2022 VLDB 4.3917246e-05
9,292 Farm Your ML-based Query Optimizer's Food! - Human-Guided Training Data Generation - 2022 CIDR 4.3619543e-05
9,473 Apache Wayang in Action: Enabling Data Systems Integration via a Unified Data Analytics Framework 2025 SIGMOD 4.3341665e-05
9,607 Polyglot Data Management: State of the Art & Open Challenges 2022 VLDB 4.3177432e-05
9,608 Unified Data Analytics: State-of-the-art and Open Problems 2022 VLDB 4.3177432e-05
9,917 Check Out the Big Brain on BRAD: Simplifying Cloud Data Processing with Learned Automated Data Meshes 2023 VLDB 4.2561557e-05
10,263 APEROL: Adaptive Parallel Edge-to-cloud Runtime Optimization for Layered Workflow Execution 2026 VLDB 4.1945683e-05
10,482 Fast and Scalable Data Transfer Across Data Systems 2025 SIGMOD 4.1945683e-05
10,591 Accio: Bolt-on Query Federation 2025 VLDB 4.1945683e-05
11,197 QaaD (Query-as-a-Data): Scalable Execution of Massive Number of Small Queries in Spark 2023 SIGMOD 4.1945683e-05
11,502 In the Land of Data Streams where Synopses are Missing, One Framework to Bring Them All 2021 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 22 of 22 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
3 Pig Latin: A Not-So-Foreign Language for Data Processing 2008 SIGMOD 0.0024183614
42 A Comparison of Approaches to Large-Scale Data Analysis 2009 SIGMOD 0.00073498298
71 How Good Are Query Optimizers, Really? 2016 VLDB 0.00059038975
165 Federated Database Systems for Managing Distributed, Heterogeneous, and Autonomous Databases 1991 VLDB 0.00039502525
557 SystemML: Declarative Machine Learning on Spark 2016 VLDB 0.00020197988
650 Robust Query Processing through Progressive Optimization 2004 SIGMOD 0.00018659177
1,012 NADEEF: A Commodity Data Cleaning System 2013 SIGMOD 0.0001464733
1,277 The Data Civilizer System 2017 CIDR 0.00012879695
1,750 Weld: A Common Runtime for High Performance Data Analytics 2017 CIDR 0.00010683647
1,820 A Demonstration of the BigDAWG Polystore System 2015 VLDB 0.00010428281
1,977 Split Query Processing in Polybase 2013 SIGMOD 9.8824589e-05
2,611 Opening the Black Boxes in Data Flow Optimization 2012 VLDB 8.4536967e-05
2,946 BigDansing: A System for Big Data Cleansing 2015 SIGMOD 7.8372441e-05
3,034 How to Fit when No One Size Fits 2013 CIDR 7.6752083e-05
3,562 MISO: Souping Up Big Data Query Processing with a Multistore System 2014 SIGMOD 6.9694564e-05
3,571 Lightning Fast and Space Efficient Inequality Joins 2015 VLDB 6.9580858e-05
3,710 Optimizing Analytic Data Flows for Multiple Execution Engines 2012 SIGMOD 6.8238962e-05
3,982 The Myria Big Data Management and Analytics System and Cloud Service 2017 CIDR 6.5651188e-05
4,120 Husky: Towards a More Efficient and Expressive Distributed Computing Framework 2016 VLDB 6.4364588e-05
5,058 A Demo of the Data Civilizer System 2017 SIGMOD 5.7280139e-05
6,986 A Cost-based Optimizer for Gradient Descent Optimization 2017 SIGMOD 4.8727048e-05
9,810 Rheem: Enabling Multi-Platform Task Execution 2016 SIGMOD 4.278405e-05
Previous Page 1 / 1 Next

Semantically Similar Papers