Data Warehousing and Analytics Infrastructure at Facebook
Summary: Facebook's scalable data warehouse uses Scribe, Hadoop, Hive to unify log collection, storage, and analytics for BI dashboards and feature services. Stores >15PB (2.5PB compressed), ingests ~60TB/day (10TB compressed); discusses design choices, day-to-day operations, and planned improvements. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Ashish Thusoo
- 2. Zheng Shao
- 3. Suresh Anthony
- 4. Dhruba Borthakur
- 5. Namit Jain
- 6. Joydeep Sen Sarma
- 7. Raghotham Murthy
- 8. Hao Liu
Incoming Citations (Sorted by Pagerank)
Showing 17 of 17 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 1 of 1 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 80 | Weaving Relations for Cache Performance | 2001 | VLDB | 0.00055721729 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,337 | Efficient Processing of Data Warehousing Queries in a Split Execution Environment | 2011 | SIGMOD | 9.0098186e-05 |
| 3,601 | Large-Scale Machine Learning at Twitter | 2012 | SIGMOD | 6.9315087e-05 |
| 157 | HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads | 2009 | VLDB | 0.00040397359 |
| 2,476 | A Platform for Scalable One-Pass Analytics using MapReduce | 2011 | SIGMOD | 8.6960139e-05 |
| 7,877 | Emerging Trends in the Enterprise Data Analytics: Connecting Hadoop and DB2 Warehouse | 2011 | SIGMOD | 4.6297559e-05 |
| 1,499 | Apache Hadoop Goes Realtime at Facebook | 2011 | SIGMOD | 0.00011675192 |
| 4,857 | The "Big Data" Ecosystem at LinkedIn | 2013 | SIGMOD | 5.8736144e-05 |
| 3,973 | Apache Hive: From MapReduce to Enterprise-grade Big Data Warehousing | 2019 | SIGMOD | 6.5758017e-05 |
| 6,850 | Petabyte Scale Databases and Storage Systems at Facebook | 2013 | SIGMOD | 4.9085019e-05 |
| 70 | Hive - A Warehousing Solution Over a Map-Reduce Framework | 2009 | VLDB | 0.00059533166 |