Back to papers
Kodiak: Leveraging Materialized Views For Very Low-Latency Analytics Over High-Dimensional Web-Scale Data
Summary: Kodiak is a distributed analytics platform for web-scale, high-dimensional data using thousands of precomputed, partitioned materialized views to deliver interactive queries. In Turn production, 2490 views over 3PB, 200K daily queries; median 8ms, 99th 252ms; 3 orders of magnitude faster and 4 orders cheaper than state-of-the-art platforms via auto view selection.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 11236
- Venue
- VLDB
- Year
- 2016
- Pagerank
- 4.800763e-05
- Overall Rank
- 7,207 | 49.87%
- DOI
-
-
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 3 of 3 citing papers.
Outgoing Citations (Sorted by Pagerank)
Showing 17 of 17 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 3 |
Pig Latin: A Not-So-Foreign Language for Data Processing |
2008 |
SIGMOD |
0.0024183614 |
| 53 |
PNUTS: Yahoo!'s Hosted Data Serving Platform |
2008 |
VLDB |
0.00066144767 |
| 66 |
Spark SQL: Relational Data Processing in Spark |
2015 |
SIGMOD |
0.00061639801 |
| 70 |
Hive - A Warehousing Solution Over a Map-Reduce Framework |
2009 |
VLDB |
0.00059533166 |
| 476 |
Impala: A Modern, Open-Source SQL Engine for Hadoop |
2015 |
CIDR |
0.00022226941 |
| 542 |
Shark: SQL and Rich Analytics at Scale |
2013 |
SIGMOD |
0.00020595648 |
| 779 |
Materialized View Maintenance and Integrity Constraint Checking: Trading Space for Time |
1996 |
SIGMOD |
0.00016786961 |
| 906 |
F1: A Distributed SQL Database That Scales |
2013 |
VLDB |
0.00015448884 |
| 962 |
Maintenance of Data Cubes and Summary Tables in a Warehouse |
1997 |
SIGMOD |
0.00014986226 |
| 1,421 |
Algorithms for Deferred View Maintenance |
1996 |
SIGMOD |
0.0001205793 |
| 1,588 |
Druid: A Real-time Analytical Data Store |
2014 |
SIGMOD |
0.00011239313 |
| 1,853 |
On Brewing Fresh Espresso: LinkedIn’s Distributed Data Serving Platform |
2013 |
SIGMOD |
0.00010320369 |
| 1,863 |
Cheetah: A High Performance, Custom Data Warehouse on Top of MapReduce |
2010 |
VLDB |
0.00010286531 |
| 3,037 |
How To Roll a Join: Asynchronous Incremental View Maintenance |
2000 |
SIGMOD |
7.6731715e-05 |
| 3,092 |
Asynchronous View Maintenance for VLSD Databases |
2009 |
SIGMOD |
7.5800633e-05 |
| 8,464 |
Piranha: Optimizing Short Jobs in Hadoop |
2013 |
VLDB |
4.5052127e-05 |
| 9,387 |
Overview of Turn Data Management Platform for Digital Advertising |
2013 |
VLDB |
4.3443757e-05 |
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 11,411 |
High-dimensional Data Cubes |
2022 |
VLDB |
4.1945683e-05 |
| 7,534 |
Enabling Efficient and General Subpopulation Analytics in Multidimensional Data Streams |
2022 |
VLDB |
4.7180004e-05 |
| 10,385 |
Optimizing Block Skipping for High-Dimensional Data with Learned Adaptive Curve |
2025 |
SIGMOD |
4.1945683e-05 |
| 3,157 |
High-Dimensional OLAP: A Minimal Cubing Approach |
2004 |
VLDB |
7.4656511e-05 |
| 11,197 |
QaaD (Query-as-a-Data): Scalable Execution of Massive Number of Small Queries in Spark |
2023 |
SIGMOD |
4.1945683e-05 |
| 3,388 |
Analytics in Motion: High Performance Event-Processing AND Real-Time Analytics in the Same Database |
2015 |
SIGMOD |
7.1571148e-05 |
| 2,418 |
Tupleware: "Big" Data, Big Analytics, Small Clusters |
2015 |
CIDR |
8.8556595e-05 |
| 8,094 |
Modularis: Modular Relational Analytics over Heterogeneous Distributed Platforms |
2021 |
VLDB |
4.5867812e-05 |
| 8,357 |
Cubrick: Indexing Millions of Records per Second for Interactive Analytics |
2016 |
VLDB |
4.5373339e-05 |
| 9,504 |
Supporting Scalable Analytics with Latency Constraints |
2015 |
VLDB |
4.3341665e-05 |