Pig Latin: A Not-So-Foreign Language for Data Processing
Summary: Pig Latin sits between SQL and MapReduce, enabling procedural analysts to express data flows without MapReduce coding. Pig compiles Pig Latin to Hadoop MapReduce plans and offers an integrated debugger; open-source under Apache Incubator with Yahoo-scale deployments. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Christopher Olston
- 2. Benjamin Reed
- 3. Utkarsh Srivastava
- 4. Ravi Kumar
- 5. Andrew Tomkins
Incoming Citations (Sorted by Pagerank)
Showing 4 of 154 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 12,125 | ReStore: Reusing Results of MapReduce Jobs in Pig | 2012 | SIGMOD | 4.1945683e-05 |
| 12,287 | LifeRaft: Data-Driven, Batch Processing for the Exploration of Scientific Databases | 2009 | CIDR | 4.1945683e-05 |
| 12,290 | From Declarative Languages to Declarative Processing in Computer Games | 2009 | CIDR | 4.1945683e-05 |
| 12,401 | Large-Scale Collaborative Analysis and Extraction of Web Data | 2008 | VLDB | 4.1945683e-05 |
Outgoing Citations (Sorted by Pagerank)
Showing 2 of 2 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 15 | Map-Reduce-Merge: Simplified Relational Data Processing on Large Clusters | 2007 | SIGMOD | 0.0010654262 |
| 18 | On Random Sampling over Joins | 1999 | SIGMOD | 0.00092385438 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 8,790 | From SPARQL to MapReduce: The Journey Using a Nested TripleGroup Algebra | 2011 | VLDB | 4.4508494e-05 |
| 4,857 | The "Big Data" Ecosystem at LinkedIn | 2013 | SIGMOD | 5.8736144e-05 |
| 11,690 | Integration of Large-Scale Data Processing Systems and Traditional Parallel Database Technology | 2019 | VLDB | 4.1945683e-05 |
| 1,265 | Jaql: A Scripting Language for Large Scale Semistructured Data Analysis | 2011 | VLDB | 0.00012947629 |
| 12,125 | ReStore: Reusing Results of MapReduce Jobs in Pig | 2012 | SIGMOD | 4.1945683e-05 |
| 70 | Hive - A Warehousing Solution Over a Map-Reduce Framework | 2009 | VLDB | 0.00059533166 |
| 13,426 | The Farm - where Pig Scripts are bred and raised | 2013 | SIGMOD | - |
| 3,601 | Large-Scale Machine Learning at Twitter | 2012 | SIGMOD | 6.9315087e-05 |
| 4,425 | Nova: Continuous Pig/Hadoop Workflows | 2011 | SIGMOD | 6.198382e-05 |
| 780 | Building a High-Level Dataflow System on top of Map-Reduce: The Pig Experience | 2009 | VLDB | 0.00016775082 |