Scuba: Diving into Data at Facebook
Summary: Scuba: Facebook's in-memory, distributed DB for real-time analytics on live data. Ingests millions of rows per second, RAM-resident on hundreds of servers, delivering sub-second interactive queries for code regression, bug monitoring, and performance debugging. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Lior Abraham
- 2. Vinayak Borkar
- 3. Daniel Merl
- 4. Subbu Subramanian
- 5. John Allen
- 6. Bhuwan Chopra
- 7. Josh Metzler
- 8. Janet L. Wiener
- 9. Oleksandr Barykin
- 10. Ciprian Gerea
- 11. David Reiss
- 12. Okay Zed
Incoming Citations (Sorted by Pagerank)
Showing 22 of 22 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 7 of 7 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 21 | C-Store: A Column-oriented DBMS | 2005 | VLDB | 0.00086087497 |
| 70 | Hive - A Warehousing Solution Over a Map-Reduce Framework | 2009 | VLDB | 0.00059533166 |
| 109 | Dremel: Interactive Analysis of Web-Scale Datasets | 2010 | VLDB | 0.00048186983 |
| 542 | Shark: SQL and Rich Analytics at Scale | 2013 | SIGMOD | 0.00020595648 |
| 858 | Efficient Transaction Processing in SAP HANA Database – The End of a Column Store Myth | 2012 | SIGMOD | 0.000158756 |
| 1,470 | Processing a Trillion Cells per Mouse Click | 2012 | VLDB | 0.00011833779 |
| 2,488 | Shark: Fast Data Analysis Using Coarse-grained Distributed Memory | 2012 | SIGMOD | 8.6683713e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 8,357 | Cubrick: Indexing Millions of Records per Second for Interactive Analytics | 2016 | VLDB | 4.5373339e-05 |
| 3,402 | TAO: How Facebook Serves the Social Graph | 2012 | SIGMOD | 7.1378698e-05 |
| 281 | LinkBench: a Database Benchmark Based on the Facebook Social Graph | 2013 | SIGMOD | 0.0002906793 |
| 1,711 | SCADS: Scale-Independent Storage for Social Computing Applications | 2009 | CIDR | 0.0001080509 |
| 6,850 | Petabyte Scale Databases and Storage Systems at Facebook | 2013 | SIGMOD | 4.9085019e-05 |
| 210 | Gorilla: A Fast, Scalable, In-Memory Time Series Database | 2015 | VLDB | 0.0003404384 |
| 1,613 | Realtime Data Processing at Facebook | 2016 | SIGMOD | 0.00011140777 |
| 2,658 | Data Warehousing and Analytics Infrastructure at Facebook | 2010 | SIGMOD | 8.3607429e-05 |
| 5,120 | Fast Database Restarts at Facebook | 2014 | SIGMOD | 5.6803959e-05 |
| 9,111 | Meta's Next-generation Realtime Monitoring and Analytics Platform | 2022 | VLDB | 4.3942367e-05 |