Database Paper Browser

Back to papers

Computation Reuse in Analytics Job Service at Microsoft

Summary: CloudViews: computation reuse for Microsoft’s SCOPE analytics service. Online materialized views capture recurring workloads; a feedback loop uses compile-time/run-time stats to estimate utility vs. cost, enabling online, no-offline materialization. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
5489
Venue
SIGMOD
Year
2018
Pagerank
6.3856219e-05
Overall Rank
4,174 | 70.97%
DOI
10.1145/3183713.3190656

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 30 of 30 citing papers.

Rank Citing Paper Year Venue Pagerank
1,703 Are We Ready For Learned Cardinality Estimation? 2021 VLDB 0.00010836769
1,922 Selecting Subexpressions to Materialize at Datacenter Scale 2018 VLDB 0.00010082599
2,083 Towards a Learning Optimizer for Shared Clouds 2019 VLDB 9.5834572e-05
3,625 Cost Models for Big Data Query Processing: Learning, Retrofitting, and Our Findings 2020 SIGMOD 6.9055212e-05
3,973 Apache Hive: From MapReduce to Enterprise-grade Big Data Warehousing 2019 SIGMOD 6.5758017e-05
4,248 Hyper Dimension Shuffle: Efficient Data Repartition at Petabyte Scale in SCOPE 2019 VLDB 6.3247927e-05
4,690 Deploying a Steered Query Optimizer in Production at Microsoft 2022 SIGMOD 5.997226e-05
5,567 Optimizing Data Pipelines for Machine Learning in Feature Stores 2023 VLDB 5.4305348e-05
6,040 Steering Query Optimizers: A Practical Take on Big Data Workloads 2021 SIGMOD 5.2412035e-05
6,149 Crystal: A Unified Cache Storage System for Analytical Databases 2021 VLDB 5.1847534e-05
6,261 The Cosmos Big Data Platform at Microsoft: Over a Decade of Progress and a Decade to Look Forward 2021 VLDB 5.1350714e-05
6,885 PilotScope: Steering Databases with Machine Learning Drivers 2024 VLDB 4.895386e-05
6,988 CrocodileDB: Efficient Database Execution through Intelligent Deferment 2020 CIDR 4.8718019e-05
7,476 Lachesis: Automatic Partitioning for UDF-Centric Analytics 2021 VLDB 4.7188928e-05
7,655 Machine Learning for Cloud Data Systems: the Progress so far and the Path Forward 2021 VLDB 4.6872456e-05
7,684 AutoToken: Predicting Peak Parallelism for Big Data Analytics at Microsoft 2020 VLDB 4.6796855e-05
8,002 Pangea: Monolithic Distributed Storage for Data Analytics 2019 VLDB 4.6088289e-05
8,131 Sibyl: Forecasting Time-Evolving Query Workloads 2024 SIGMOD 4.5784634e-05
8,197 SparkCruise: Workload Optimization in Managed Spark Clusters at Microsoft 2021 VLDB 4.5607121e-05
8,295 View Selection over Knowledge Graphs in Triple Stores 2021 VLDB 4.5435639e-05
8,416 Towards Building Autonomous Data Services on Azure 2023 SIGMOD 4.5196199e-05
8,582 Towards Query Optimizer as a Service (QOaaS) in a Unified LakeHouse Ecosystem: Can One QO Rule Them All? 2025 CIDR 4.492033e-05
8,758 Hyperspace: The Indexing Subsystem of Azure Synapse 2021 VLDB 4.456315e-05
8,783 GEqO: ML-Accelerated Semantic Equivalence Detection 2023 SIGMOD 4.452825e-05
9,194 Phoebe: A Learning-based Checkpoint Optimizer 2021 VLDB 4.3761777e-05
9,344 Hippo: Sharing Computations in Hyper-Parameter Optimization 2022 VLDB 4.3539442e-05
9,735 SparkCruise: Handsfree Computation Reuse in Spark 2019 VLDB 4.2942813e-05
9,762 QURE: AI-Assisted and Automatically Verified UDF Inlining 2025 SIGMOD 4.2856106e-05
9,800 Cquirrel: Continuous Query Processing over Acyclic Relational Schemas 2021 VLDB 4.2818172e-05
13,196 PikePlace: Generating Intelligence for Marketplace Datasets 2023 VLDB -
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 26 of 26 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
11 Implementing Data Cubes Efficiently 1996 SIGMOD 0.0011708144
22 SCOPE: Easy and Efficient Parallel Processing of Massive Data Sets 2008 VLDB 0.0008456613
70 Hive - A Warehousing Solution Over a Map-Reduce Framework 2009 VLDB 0.00059533166
71 How Good Are Query Optimizers, Really? 2016 VLDB 0.00059038975
158 Automated Selection of Materialized Views and Indexes for SQL Databases 2000 VLDB 0.00040071492
179 Efficient and Extensible Algorithms for Multi Query Optimization 2000 SIGMOD 0.00037672155
182 LEO - DB2's LEarning Optimizer 2001 VLDB 0.00036962631
220 Efficient Mid-Query Re-Optimization of Sub-Optimal Query Execution Plans 1998 SIGMOD 0.00033194808
610 Goods: Organizing Google's Datasets 2016 SIGMOD 0.00019232674
650 Robust Query Processing through Progressive Optimization 2004 SIGMOD 0.00018659177
830 Main-Memory Scan Sharing For Multi-Core CPUs 2008 VLDB 0.00016171897
947 MRShare: Sharing Across Multiple Queries in MapReduce 2010 VLDB 0.00015114576
1,026 Cooperative Scans: Dynamic Bandwidth Sharing in a DBMS 2007 VLDB 0.00014589172
1,272 Proactive Re-Optimization 2005 SIGMOD 0.00012920076
1,353 Data Warehouse Configuration 1997 VLDB 0.00012410919
1,476 Efficient Exploitation of Similar Subexpressions for Query Processing 2007 SIGMOD 0.00011779092
2,205 ReStore: Reusing Results of MapReduce Jobs 2012 VLDB 9.2920002e-05
2,693 An Architecture for Recycling Intermediates in a Column-store 2009 SIGMOD 8.2883398e-05
2,925 Shared Workload Optimization 2014 VLDB 7.888494e-05
3,703 Multi-Query Optimization in MapReduce Framework 2014 VLDB 6.8289978e-05
5,014 Dynamically Optimizing Queries over Large Scale Data Platforms 2014 SIGMOD 5.7586174e-05
5,293 MQJoin: Efficient Shared Execution of Main-Memory Joins 2016 VLDB 5.5815698e-05
5,297 Continuous Cloud-Scale Query Optimization and Processing 2013 VLDB 5.5801669e-05
7,207 Kodiak: Leveraging Materialized Views For Very Low-Latency Analytics Over High-Dimensional Web-Scale Data 2016 VLDB 4.800763e-05
7,689 ROBUS: Fair Cache Allocation for Data-parallel Workloads 2017 SIGMOD 4.6765769e-05
7,833 Dependency-Driven Analytics: a Compass for Uncharted Data Oceans 2017 CIDR 4.6382648e-05
Previous Page 1 / 1 Next

Semantically Similar Papers