SparkR: Scaling R Programs with Spark
Summary: R's single-threaded nature and memory limits curb interactive analytics. SparkR adds an R frontend to Apache Spark, enabling scalable, distributed data analysis from the R shell via Spark's DataFrame API and distributed computation. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Shivaram Venkataraman
- 2. Zongheng Yang
- 3. Davies Liu
- 4. Eric Liang
- 5. Hossein Falaki
- 6. Xiangrui Meng
- 7. Reynold Xin
- 8. Ali Ghodsi
- 9. Michael Franklin
- 10. Ion Stoica
- 11. Matei Zaharia
Incoming Citations (Sorted by Pagerank)
Showing 6 of 6 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,377 | Lakehouse: A New Generation of Open Platforms that Unify Data Warehousing and Advanced Analytics | 2021 | CIDR | 0.00012296941 |
| 1,532 | Data Management in Machine Learning: Challenges, Techniques, and Systems | 2017 | SIGMOD | 0.00011472681 |
| 2,194 | Enabling and Optimizing Non-linear Feature Interactions in Factorized Linear Algebra | 2019 | SIGMOD | 9.3138337e-05 |
| 5,731 | Babelfish: Efficient Execution of Polyglot Queries | 2022 | VLDB | 5.3502065e-05 |
| 9,584 | Introduction to Spark 2.0 for Database Researchers | 2016 | SIGMOD | 4.3218691e-05 |
| 10,969 | Query Compilation Without Regrets | 2024 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 7 of 7 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 66 | Spark SQL: Relational Data Processing in Spark | 2015 | SIGMOD | 0.00061639801 |
| 476 | Impala: A Modern, Open-Source SQL Engine for Hadoop | 2015 | CIDR | 0.00022226941 |
| 542 | Shark: SQL and Rich Analytics at Scale | 2013 | SIGMOD | 0.00020595648 |
| 1,495 | Ricardo: Integrating R and Hadoop | 2010 | SIGMOD | 0.00011691049 |
| 3,535 | Scaling Spark in the Real World: Performance and Usability | 2015 | VLDB | 6.9992495e-05 |
| 5,395 | Large-scale Predictive Analytics in Vertica: Fast Data Transfer, Distributed Model Creation, and In-database Prediction | 2015 | SIGMOD | 5.5318806e-05 |
| 6,870 | Stat! - An Interactive Analytics Environment for Big Data | 2013 | SIGMOD | 4.9004414e-05 |
Previous
Page 1 / 1
Next