Back to papers
The Myria Big Data Management and Analytics System and Cloud Service
Summary: End-to-end big-data management and analytics stack and cloud service (Myria) from UW, integrating a scalable parallel engine with domain-scientist-oriented usability and operational tooling. Paper presents Myria's core design choices, innovations, and deployment lessons across real data-science workloads.
(summarized by gpt-5-mini on Feb 09 2026)
- Paper ID
- 300
- Venue
- CIDR
- Year
- 2017
- Pagerank
- 6.5651188e-05
- Overall Rank
- 3,982 | 72.30%
- DOI
-
-
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 14 of 14 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
| 2,099 |
Axiomatic Foundations and Algorithms for Deciding Semantic Equivalences of SQL Queries |
2018 |
VLDB |
9.5479391e-05 |
| 2,919 |
RaSQL: Greater Power and Performance for Big Data Analytics with Recursive-aggregate-SQL on Spark |
2019 |
SIGMOD |
7.9047279e-05 |
| 2,954 |
Magpie: Python at Speed and Scale using Cloud Backends |
2021 |
CIDR |
7.8262582e-05 |
| 3,265 |
RHEEM: Enabling Cross-Platform Data Processing - May The Big Data Be With You! - |
2018 |
VLDB |
7.3083672e-05 |
| 3,343 |
Comparative Evaluation of Big-Data Systems on Scientific Image Analytics Workloads |
2017 |
VLDB |
7.1967343e-05 |
| 3,948 |
A Comparative Evaluation of Systems for Scalable Linear Algebra-based Analytics |
2018 |
VLDB |
6.5959084e-05 |
| 4,689 |
Algorithmic Aspects of Parallel Query Processing |
2018 |
SIGMOD |
5.9980099e-05 |
| 4,920 |
Shared Arrangements: practical inter-query sharing for streaming dataflows |
2020 |
VLDB |
5.8241888e-05 |
| 7,990 |
Blueprinting the Cloud: Unifying and Automatically Optimizing Cloud Data Infrastructures with BRAD |
2024 |
VLDB |
4.6117441e-05 |
| 9,607 |
Polyglot Data Management: State of the Art & Open Challenges |
2022 |
VLDB |
4.3177432e-05 |
| 9,608 |
Unified Data Analytics: State-of-the-art and Open Problems |
2022 |
VLDB |
4.3177432e-05 |
| 9,917 |
Check Out the Big Brain on BRAD: Simplifying Cloud Data Processing with Learned Automated Data Meshes |
2023 |
VLDB |
4.2561557e-05 |
| 10,591 |
Accio: Bolt-on Query Federation |
2025 |
VLDB |
4.1945683e-05 |
| 10,810 |
DortDB: Bridging Query Languages for Multi-Model Data Ponds |
2025 |
VLDB |
4.1945683e-05 |
Outgoing Citations (Sorted by Pagerank)
Showing 15 of 15 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 37 |
Distributed GraphLab: A Framework for Machine Learning and Data Mining in the Cloud |
2012 |
VLDB |
0.0007522744 |
| 157 |
HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads |
2009 |
VLDB |
0.00040397359 |
| 167 |
The Snowflake Elastic Data Warehouse |
2016 |
SIGMOD |
0.00039180521 |
| 318 |
Overview of SciDB: Large Scale Array Storage, Processing and Analysis |
2010 |
SIGMOD |
0.00027795661 |
| 476 |
Impala: A Modern, Open-Source SQL Engine for Hadoop |
2015 |
CIDR |
0.00022226941 |
| 1,023 |
Query Steering for Interactive Data Exploration |
2013 |
CIDR |
0.00014611863 |
| 1,411 |
Communication Steps for Parallel Query Processing |
2013 |
PODS |
0.0001212565 |
| 1,438 |
AsterixDB: A Scalable, Open Source BDMS |
2014 |
VLDB |
0.00011973592 |
| 1,939 |
From Theory to Practice: Efficient Join Query Evaluation in a Parallel Database System |
2015 |
SIGMOD |
0.00010025655 |
| 2,234 |
PerfEnforce Demonstration: Data Analytics with Performance Guarantees |
2016 |
SIGMOD |
9.2272296e-05 |
| 3,343 |
Comparative Evaluation of Big-Data Systems on Scientific Image Analytics Workloads |
2017 |
VLDB |
7.1967343e-05 |
| 3,377 |
Demonstration of the Myria Big Data Management Service |
2014 |
SIGMOD |
7.1624478e-05 |
| 3,809 |
Changing the Face of Database Cloud Services with Personalized Service Level Agreements |
2015 |
CIDR |
6.7409982e-05 |
| 4,696 |
Asynchronous and Fault-Tolerant Recursive Datalog Evaluation in Shared-Nothing Engines |
2015 |
VLDB |
5.9911301e-05 |
| 8,022 |
FORWARD: Data-Centric UIs using Declarative Templates that Efficiently Wrap Third-Party JavaScript Components |
2014 |
VLDB |
4.6038333e-05 |
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 7,889 |
Cost-Intelligent Data Analytics in the Cloud |
2024 |
CIDR |
4.6253386e-05 |
| 8,136 |
Big Data and Cloud Computing: New Wine or just New Bottles? |
2010 |
VLDB |
4.5775272e-05 |
| 8,416 |
Towards Building Autonomous Data Services on Azure |
2023 |
SIGMOD |
4.5196199e-05 |
| 11,949 |
Big Data Research: Will Industry Solve all the Problems? |
2015 |
VLDB |
4.1945683e-05 |
| 11,635 |
Automated Performance Management for the Big Data Stack |
2019 |
CIDR |
4.1945683e-05 |
| 11,668 |
Cost-Effective, Workload-Adaptive Migration of Big Data Applications to the Cloud |
2019 |
SIGMOD |
4.1945683e-05 |
| 7,217 |
Myriad: Scalable and Expressive Data Generation |
2012 |
VLDB |
4.7983955e-05 |
| 3,343 |
Comparative Evaluation of Big-Data Systems on Scientific Image Analytics Workloads |
2017 |
VLDB |
7.1967343e-05 |
| 13,356 |
Big Data Science Needs Big Data Middleware |
2015 |
CIDR |
- |
| 3,377 |
Demonstration of the Myria Big Data Management Service |
2014 |
SIGMOD |
7.1624478e-05 |