Automatically Leveraging MapReduce Frameworks for Data-Intensive Applications
Summary: Casper translates Java programs into MapReduce-style implementations for Hadoop, Spark, and Flink. It uses program synthesis to infer a MapReduce summary, verified by a theorem prover, then emits executable code; benchmarks show up to 48.2x speedups. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 8 of 8 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,804 | Extending Relational Query Processing with ML Inference | 2020 | CIDR | 8.0935487e-05 |
| 4,648 | Aggify: Lifting the Curse of Cursor Loops using Custom Aggregates | 2020 | SIGMOD | 6.0247446e-05 |
| 5,614 | New Directions in Cloud Programming | 2021 | CIDR | 5.4101976e-05 |
| 8,534 | Translation of Array-Based Loops to Distributed Data-Parallel Programs | 2020 | VLDB | 4.4937074e-05 |
| 8,645 | Predicate Pushdown for Data Science Pipelines | 2023 | SIGMOD | 4.4772518e-05 |
| 11,300 | Towards Auto-Generated Data Systems | 2023 | VLDB | 4.1945683e-05 |
| 11,513 | TraNCE: Transforming Nested Collections Efficiently | 2021 | VLDB | 4.1945683e-05 |
| 11,542 | View-Driven Optimization of Database-Backed Web Applications | 2020 | CIDR | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 7 of 7 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1 | Access Path Selection in a Relational Database Management System | 1979 | SIGMOD | 0.0040449103 |
| 66 | Spark SQL: Relational Data Processing in Spark | 2015 | SIGMOD | 0.00061639801 |
| 557 | SystemML: Declarative Machine Learning on Spark | 2016 | VLDB | 0.00020197988 |
| 704 | Building Efficient Query Engines in a High-Level Language | 2014 | VLDB | 0.00017900583 |
| 1,750 | Weld: A Common Runtime for High Performance Data Analytics | 2017 | CIDR | 0.00010683647 |
| 2,818 | Implicit Parallelism through Deep Language Embedding | 2015 | SIGMOD | 8.0665558e-05 |
| 3,296 | Extracting Equivalent SQL from Imperative Code in Database Applications | 2016 | SIGMOD | 7.2596583e-05 |
Previous
Page 1 / 1
Next