Database Paper Browser

Back to papers

Ricardo: Integrating R and Hadoop

Summary: Ricardo integrates R and Hadoop for scalable deep analytics. It decomposes analytics into R computations and Hadoop data-management steps to minimize data transfer, avoiding re-implementation and letting analysts run large analyses inside familiar tools. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
4311
Venue
SIGMOD
Year
2010
Pagerank
0.00011691049
Overall Rank
1,495 | 89.61%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 19 of 19 citing papers.

Rank Citing Paper Year Venue Pagerank
1,158 Simulation of Database-Valued Markov Chains Using SimSQL 2013 SIGMOD 0.0001361064
1,167 Learning Generalized Linear Models Over Normalized Data 2015 SIGMOD 0.00013547713
1,265 Jaql: A Scripting Language for Large Scale Semistructured Data Analysis 2011 VLDB 0.00012947629
1,402 Hybrid Parallelization Strategies for Large-Scale Machine Learning in SystemML 2014 VLDB 0.00012180605
1,967 Compressed Linear Algebra for Large-Scale Machine Learning 2016 VLDB 9.9131712e-05
2,255 LINVIEW: Incremental View Maintenance for Complex Analytical Queries 2014 SIGMOD 9.1884983e-05
2,667 Cumulon: Optimizing Statistical Data Analysis in the Cloud 2013 SIGMOD 8.3413995e-05
3,345 QuickFOIL: Scalable Inductive Logic Programming 2015 VLDB 7.1958815e-05
3,455 A Comparison of Platforms for Implementing and Running Very Large Scale Machine Learning Algorithms 2014 SIGMOD 7.0771839e-05
4,077 Towards High-Throughput Gibbs Sampling at Scale: A Study across Storage Managers 2013 SIGMOD 6.4678697e-05
5,395 Large-scale Predictive Analytics in Vertica: Fast Data Transfer, Distributed Model Creation, and In-database Prediction 2015 SIGMOD 5.5318806e-05
5,964 Bridging Two Worlds with RICE: Integrating R into the SAP In-Memory Computing Engine 2011 VLDB 5.2520617e-05
6,322 The BUDS Language for Distributed Bayesian Machine Learning 2017 SIGMOD 5.1124615e-05
6,541 ConnectorX: Accelerating Data Loading From Databases to Dataframes 2022 VLDB 5.0216945e-05
6,542 Profiling R on a Contemporary Processor 2015 VLDB 5.0216639e-05
6,784 SparkR: Scaling R Programs with Spark 2016 SIGMOD 4.9265155e-05
7,306 DAPHNE: An Open and Extensible System Infrastructure for Integrated Data Analysis Pipelines 2022 CIDR 4.7678574e-05
11,859 dmapply: A functional primitive to express distributed machine learning algorithms in R 2016 VLDB 4.1945683e-05
12,062 Next Generation Data Analytics at IBM Research 2013 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 6 of 6 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
3 Pig Latin: A Not-So-Foreign Language for Data Processing 2008 SIGMOD 0.0024183614
70 Hive - A Warehousing Solution Over a Map-Reduce Framework 2009 VLDB 0.00059533166
168 MAD Skills: New Analysis Practices for Big Data 2009 VLDB 0.00038946305
928 Requirements for Science Data Bases and SciDB 2009 CIDR 0.00015247726
1,076 RIOT: I/O-Efficient Numerical Computing without SQL 2009 CIDR 0.00014248449
2,217 Introduction to Recommender Systems 2008 SIGMOD 9.2690171e-05
Previous Page 1 / 1 Next

Semantically Similar Papers