Database Paper Browser

Back to papers

Shark: SQL and Rich Analytics at Scale

Summary: Shark unifies SQL and analytics on clusters via a distributed memory abstraction into a single scalable engine. In-memory columnar storage, replanning, and fault tolerance enable SQL and ML, 100x faster than Hive/Hadoop, competitive with MPP. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
4697
Venue
SIGMOD
Year
2013
Pagerank
0.00020595648
Overall Rank
542 | 96.24%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 4 of 54 citing papers.

Rank Citing Paper Year Venue Pagerank
11,948 Tutorial: SQL-on-Hadoop Systems 2015 VLDB 4.1945683e-05
11,974 DoomDB - Kill the Query 2014 SIGMOD 4.1945683e-05
11,993 A Partitioning Framework for Aggressive Data Skipping 2014 VLDB 4.1945683e-05
11,999 Getting Your Big Data Priorities Straight: A Demonstration of Priority-based QoS using Social-network-driven Stock Recommendation 2014 VLDB 4.1945683e-05
Previous Page 2 / 2 Next

Outgoing Citations (Sorted by Pagerank)

Showing 18 of 18 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
4 Pregel: A System for Large-Scale Graph Processing 2010 SIGMOD 0.0019005923
21 C-Store: A Column-oriented DBMS 2005 VLDB 0.00086087497
22 SCOPE: Easy and Efficient Parallel Processing of Massive Data Sets 2008 VLDB 0.0008456613
37 Distributed GraphLab: A Framework for Machine Learning and Data Mining in the Cloud 2012 VLDB 0.0007522744
42 A Comparison of Approaches to Large-Scale Data Analysis 2009 SIGMOD 0.00073498298
109 Dremel: Interactive Analysis of Web-Scale Datasets 2010 VLDB 0.00048186983
115 Eddies: Continuously Adaptive Query Processing 2000 SIGMOD 0.00046221215
157 HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads 2009 VLDB 0.00040397359
168 MAD Skills: New Analysis Practices for Big Data 2009 VLDB 0.00038946305
220 Efficient Mid-Query Re-Optimization of Sub-Optimal Query Execution Plans 1998 SIGMOD 0.00033194808
413 HaLoop: Efficient Iterative Data Processing on Large Clusters 2010 VLDB 0.00023904409
456 Cost-based Query Scrambling for Initial Delays 1998 SIGMOD 0.00022717134
658 Towards a Unified Architecture for in-RDBMS Analytics 2012 SIGMOD 0.00018506577
913 Tenzing A SQL Implementation On The MapReduce Framework 2011 VLDB 0.00015408131
1,334 SkewTune: Mitigating Skew in MapReduce Applications 2012 SIGMOD 0.0001250413
1,470 Processing a Trillion Cells per Mouse Click 2012 VLDB 0.00011833779
1,721 Distributed Data-Parallel Computing Using a High-Level Programming Language 2009 SIGMOD 0.00010762918
1,863 Cheetah: A High Performance, Custom Data Warehouse on Top of MapReduce 2010 VLDB 0.00010286531
Previous Page 1 / 1 Next

Semantically Similar Papers