Shark: Fast Data Analysis Using Coarse-grained Distributed Memory
Summary: Shark is a data-analysis system built on a coarse-grained distributed shared-memory abstraction, unifying SQL querying with near-data analytics. Scales to thousands of fault-tolerant nodes; delivers 40x faster queries vs Hive and 25x faster ML vs MapReduce on large datasets. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Cliff Engle
- 2. Antonio Lupher
- 3. Reynold Xin
- 4. Matei Zaharia
- 5. Michael J. Franklin
- 6. Scott Shenker
- 7. Ion Stoica
Incoming Citations (Sorted by Pagerank)
Showing 12 of 12 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 1 of 1 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 42 | A Comparison of Approaches to Large-Scale Data Analysis | 2009 | SIGMOD | 0.00073498298 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 9,889 | SHARQL: Shape Analysis of Recursive SPARQL Queries | 2020 | SIGMOD | 4.2617199e-05 |
| 9,801 | Amoeba: A Shape changing Storage System for Big Data | 2016 | VLDB | 4.2815507e-05 |
| 8,464 | Piranha: Optimizing Short Jobs in Hadoop | 2013 | VLDB | 4.5052127e-05 |
| 10,307 | SHARD: A Scalable and Resize-optimized Hash Index on Disaggregated Memory | 2026 | VLDB | 4.1945683e-05 |
| 7,511 | Hone: "Scaling Down" Hadoop on Shared-Memory Systems | 2013 | VLDB | 4.7180617e-05 |
| 4,120 | Husky: Towards a More Efficient and Expressive Distributed Computing Framework | 2016 | VLDB | 6.4364588e-05 |
| 1,152 | Blink and It's Done: Interactive Queries on Very Large Data | 2012 | VLDB | 0.00013645792 |
| 1,071 | Starfish: A Self-tuning System for Big Data Analytics | 2011 | CIDR | 0.00014312777 |
| 4,713 | SharkDB: An In-Memory Storage System for Massive Trajectory Data | 2015 | SIGMOD | 5.9786915e-05 |
| 542 | Shark: SQL and Rich Analytics at Scale | 2013 | SIGMOD | 0.00020595648 |