Nova: Continuous Pig/Hadoop Workflows
Summary: Nova is a workflow manager for Pig Latin graphs on Hadoop, enabling batched, stateful processing of continuous data. It overlays a Pig/Hadoop workflow atop Pig programs to enable scalable, disk-based scheduling and data routing for batch processing. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Christopher Olston
- 2. Greg Chiou
- 3. Laukik Chitnis
- 4. Francis Liu
- 5. Yiping Han
- 6. Mattias Larsson
- 7. Andreas Neumann
- 8. Vellanki B. N. Rao
- 9. Vijayanand Sankarasubramanian
- 10. Siddharth Seth
- 11. Chao Tian
- 12. Topher (listed as "Topher")
- 13. Xiaodan Wang
Incoming Citations (Sorted by Pagerank)
Showing 6 of 6 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,605 | Muppet: MapReduce-Style Processing of Fast Data | 2012 | VLDB | 8.4646171e-05 |
| 8,981 | Data Stream Warehousing | 2013 | SIGMOD | 4.4167397e-05 |
| 9,114 | Data Stream Warehousing in Tidalrace | 2015 | CIDR | 4.3935469e-05 |
| 9,232 | AutoComp: Automated Data Compaction for Log-Structured Tables in Data Lakes | 2025 | SIGMOD | 4.3690661e-05 |
| 9,266 | Redoop Infrastructure for Recurring Big Data Queries | 2014 | VLDB | 4.3667196e-05 |
| 10,577 | Agamotto: Scheduling of Deadline-Oriented Incremental Query Execution under Uncertain Resource Price | 2025 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 2 of 2 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3 | Pig Latin: A Not-So-Foreign Language for Data Processing | 2008 | SIGMOD | 0.0024183614 |
| 780 | Building a High-Level Dataflow System on top of Map-Reduce: The Pig Experience | 2009 | VLDB | 0.00016775082 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 12,400 | Ad-Hoc Data Processing in the Cloud | 2008 | VLDB | 4.1945683e-05 |
| 522 | Differential dataflow | 2013 | CIDR | 0.00021099241 |
| 2,476 | A Platform for Scalable One-Pass Analytics using MapReduce | 2011 | SIGMOD | 8.6960139e-05 |
| 3,601 | Large-Scale Machine Learning at Twitter | 2012 | SIGMOD | 6.9315087e-05 |
| 4,857 | The "Big Data" Ecosystem at LinkedIn | 2013 | SIGMOD | 5.8736144e-05 |
| 2,736 | Online Aggregation and Continuous Query support in MapReduce | 2010 | SIGMOD | 8.2043187e-05 |
| 12,125 | ReStore: Reusing Results of MapReduce Jobs in Pig | 2012 | SIGMOD | 4.1945683e-05 |
| 2,205 | ReStore: Reusing Results of MapReduce Jobs | 2012 | VLDB | 9.2920002e-05 |
| 780 | Building a High-Level Dataflow System on top of Map-Reduce: The Pig Experience | 2009 | VLDB | 0.00016775082 |
| 3 | Pig Latin: A Not-So-Foreign Language for Data Processing | 2008 | SIGMOD | 0.0024183614 |