Petabyte Scale Databases and Storage Systems at Facebook
Summary: Facebook's petabyte-scale data stack: sharded MySQL+Memcache for real-time access, TAO for geo consistency, Haystack for billions of photos, Hadoop/HBase for analytics. Examines workload-driven choices, ACID needs, and graph-relational mapping for geo deployments. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 3 of 3 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,913 | BF-Tree: Approximate Tree Indexing | 2014 | VLDB | 0.00010113937 |
| 3,131 | FINEdex: A Fine-grained Learned Index Scheme for Scalable and Concurrent Memory Systems | 2022 | VLDB | 7.4985793e-05 |
| 8,222 | Sieve: A Learned Data-Skipping Index for Data Analytics | 2023 | VLDB | 4.5555621e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 0 of 0 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3,232 | Managing Large Dynamic Graphs Efficiently | 2012 | SIGMOD | 7.336861e-05 |
| 1,613 | Realtime Data Processing at Facebook | 2016 | SIGMOD | 0.00011140777 |
| 5,463 | TAOBench: An End-to-End Benchmark for Social Network Workloads | 2022 | VLDB | 5.4938614e-05 |
| 189 | Megastore: Providing Scalable, Highly Available Storage for Interactive Services | 2011 | CIDR | 0.00035925334 |
| 2,337 | Efficient Processing of Data Warehousing Queries in a Split Execution Environment | 2011 | SIGMOD | 9.0098186e-05 |
| 3,402 | TAO: How Facebook Serves the Social Graph | 2012 | SIGMOD | 7.1378698e-05 |
| 1,711 | SCADS: Scale-Independent Storage for Social Computing Applications | 2009 | CIDR | 0.0001080509 |
| 10,413 | RedTAO: A Trillion-edge High-throughput Graph Store | 2025 | SIGMOD | 4.1945683e-05 |
| 2,658 | Data Warehousing and Analytics Infrastructure at Facebook | 2010 | SIGMOD | 8.3607429e-05 |
| 1,499 | Apache Hadoop Goes Realtime at Facebook | 2011 | SIGMOD | 0.00011675192 |