Vortex: A Stream-oriented Storage Engine For Big Data Analytics
Summary: Vortex is a stream-oriented storage engine in Google BigQuery for real-time analytics on continuous data. It supports both streaming and batch workloads, delivering petabyte-scale ingestion with sub-second freshness and low-latency queries. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Pavan Edara
- 2. Jonathan Forbes
- 3. Bigang Li
Incoming Citations (Sorted by Pagerank)
Showing 2 of 2 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 6,402 | BigLake: BigQuery’s Evolution toward a Multi-Cloud Lakehouse | 2024 | SIGMOD | 5.079818e-05 |
| 10,766 | Scribe: How Meta transports terabytes per second in real time | 2025 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 6 of 6 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 109 | Dremel: Interactive Analysis of Web-Scale Datasets | 2010 | VLDB | 0.00048186983 |
| 131 | Integrating Compression and Execution in Column-Oriented Database Systems | 2006 | SIGMOD | 0.0004370331 |
| 368 | Small Materialized Aggregates: A Light Weight Index Structure for Data Warehousing | 1998 | VLDB | 0.000254931 |
| 538 | The Dataflow Model: A Practical Approach to Balancing Correctness, Latency, and Cost in Massive-Scale, Unbounded, Out-of-Order Data Processing | 2015 | VLDB | 0.00020678804 |
| 2,062 | Dremel: A Decade of Interactive SQL Analysis at Web Scale | 2020 | VLDB | 9.6481955e-05 |
| 4,530 | Big Metadata: When Metadata is Big Data | 2021 | VLDB | 6.1075429e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,548 | Structured Streaming: A Declarative API for Real-Time Applications in Apache Spark | 2018 | SIGMOD | 0.00011431383 |
| 3,333 | SnappyData: A Unified Cluster for Streaming, Transactions, and Interactive Analytics | 2017 | CIDR | 7.2093479e-05 |
| 7,573 | Squall: Scalable Real-time Analytics | 2016 | VLDB | 4.7071608e-05 |
| 11,668 | Cost-Effective, Workload-Adaptive Migration of Big Data Applications to the Cloud | 2019 | SIGMOD | 4.1945683e-05 |
| 4,530 | Big Metadata: When Metadata is Big Data | 2021 | VLDB | 6.1075429e-05 |
| 1,820 | A Demonstration of the BigDAWG Polystore System | 2015 | VLDB | 0.00010428281 |
| 6,123 | Data Ingestion for the Connected World | 2017 | CIDR | 5.1991194e-05 |
| 9,504 | Supporting Scalable Analytics with Latency Constraints | 2015 | VLDB | 4.3341665e-05 |
| 6,402 | BigLake: BigQuery’s Evolution toward a Multi-Cloud Lakehouse | 2024 | SIGMOD | 5.079818e-05 |
| 6,453 | Vortex: Overcoming Memory Capacity Limitations in GPU-Accelerated Large-Scale Data Analytics | 2025 | VLDB | 5.0571108e-05 |