A Comparison of Approaches to Large-Scale Data Analysis
Summary: Compare MapReduce with parallel DBMSs for large-scale data analysis, tying MR to decades of parallel-SQL work. A 100-node benchmark finds DBMSs load/tune longer but run faster than MR; discusses causes and future-system implications. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Andrew Pavlo
- 2. Erik Paulson
- 3. Alexander Rasin
- 4. Daniel J. Abadi
- 5. David J. DeWitt
- 6. Samuel Madden
- 7. Michael Stonebraker
Incoming Citations (Sorted by Pagerank)
Showing 22 of 72 citing papers.
Outgoing Citations (Sorted by Pagerank)
Showing 7 of 7 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3 | Pig Latin: A Not-So-Foreign Language for Data Processing | 2008 | SIGMOD | 0.0024183614 |
| 20 | GAMMA - A High Performance Dataflow Database Machine | 1986 | VLDB | 0.00086459551 |
| 22 | SCOPE: Easy and Efficient Parallel Processing of Massive Data Sets | 2008 | VLDB | 0.0008456613 |
| 78 | Multiprocessor Hash-Based Join Algorithms | 1985 | VLDB | 0.00056413752 |
| 168 | MAD Skills: New Analysis Practices for Big Data | 2009 | VLDB | 0.00038946305 |
| 202 | LINQ: Reconciling Objects, Relations and XML in the .NET Framework | 2006 | SIGMOD | 0.00034920912 |
| 520 | An Overview of The System Software of A Parallel Relational Database Machine GRACE | 1986 | VLDB | 0.00021152636 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 6,061 | Towards Energy-Efficient Database Cluster Design | 2012 | VLDB | 5.2304505e-05 |
| 13,513 | Database Systems Research on Data Mining | 2010 | SIGMOD | - |
| 658 | Towards a Unified Architecture for in-RDBMS Analytics | 2012 | SIGMOD | 0.00018506577 |
| 15 | Map-Reduce-Merge: Simplified Relational Data Processing on Large Clusters | 2007 | SIGMOD | 0.0010654262 |
| 3,703 | Multi-Query Optimization in MapReduce Framework | 2014 | VLDB | 6.8289978e-05 |
| 2,476 | A Platform for Scalable One-Pass Analytics using MapReduce | 2011 | SIGMOD | 8.6960139e-05 |
| 979 | Interactive Analytical Processing in Big Data Systems: A Cross-Industry Study of MapReduce Workloads | 2012 | VLDB | 0.0001488055 |
| 157 | HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads | 2009 | VLDB | 0.00040397359 |
| 2,674 | Minimal MapReduce Algorithms | 2013 | SIGMOD | 8.3328645e-05 |
| 1,615 | The Performance of MapReduce: An In-depth Study | 2010 | VLDB | 0.00011132319 |