Database Paper Browser

Back to papers

Opportunistic Physical Design for Big Data Analytics

Summary: Treats materialized intermediates from exploratory MapReduce jobs as an opportunistic physical design. Proposes a semantic UDF model to reuse UDF-containing views and a minimum-cost rewrite algorithm (with provable guarantees); Hive prototype shows dramatic speedups. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
4883
Venue
SIGMOD
Year
2014
Pagerank
5.223901e-05
Overall Rank
6,075 | 57.74%
DOI
10.1145/2685555.2680152

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 6 of 6 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 18 of 18 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
82 Answering Queries Using Views (Extended Abstract) 1995 PODS 0.00054402763
140 The MADlib Analytics Library or MAD Skills, the SQL 2012 VLDB 0.00042270404
158 Automated Selection of Materialized Views and Indexes for SQL Databases 2000 VLDB 0.00040071492
542 Shark: SQL and Rich Analytics at Scale 2013 SIGMOD 0.00020595648
731 Optimizing Queries Using Materialized Views: A Practical, Scalable Solution 2001 SIGMOD 0.00017468889
947 MRShare: Sharing Across Multiple Queries in MapReduce 2010 VLDB 0.00015114576
979 Interactive Analytical Processing in Big Data Systems: A Cross-Industry Study of MapReduce Workloads 2012 VLDB 0.0001488055
1,059 Answering Complex SQL Queries Using Automatic Summary Tables 2000 SIGMOD 0.00014382575
1,155 A Scalable Algorithm for Answering Queries Using Views 2000 VLDB 0.00013616518
2,205 ReStore: Reusing Results of MapReduce Jobs 2012 VLDB 9.2920002e-05
2,476 A Platform for Scalable One-Pass Analytics using MapReduce 2011 SIGMOD 8.6960139e-05
2,611 Opening the Black Boxes in Data Flow Optimization 2012 VLDB 8.4536967e-05
3,562 MISO: Souping Up Big Data Query Processing with a Multistore System 2014 SIGMOD 6.9694564e-05
3,710 Optimizing Analytic Data Flows for Multiple Execution Engines 2012 SIGMOD 6.8238962e-05
4,082 On the Content of Materialized Aggregate Views 2000 PODS 6.4639136e-05
5,144 Scalable Query Rewriting: A Graph-Based Approach 2011 SIGMOD 5.6651982e-05
6,821 Hadoop's Adolescence: An analysis of Hadoop usage in scientific workloads 2013 VLDB 4.9156923e-05
9,009 Odyssey: A Multi-Store System for Evolutionary Analytics 2013 VLDB 4.4100992e-05
Previous Page 1 / 1 Next

Semantically Similar Papers