Back to papers
Helios: Hyperscale Indexing for the Cloud & Edge
Summary: Helios is a hyperscale cloud/edge indexing system that feeds real-time streams into relational engines. It ingests quadrillions of events and indexes trillions of keys daily across dozens of data centers, using a simple data model and asynchronous indexing - a scalable blueprint for Microsoft-scale analytics.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 12205
- Venue
- VLDB
- Year
- 2020
- Pagerank
- 5.1408379e-05
- Overall Rank
- 6,242 | 56.58%
- DOI
-
10.14778/3415478.3415547
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 5 of 5 citing papers.
Outgoing Citations (Sorted by Pagerank)
Showing 26 of 26 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 2 |
R-Trees: A Dynamic Index Structure For Spatial Searching |
1984 |
SIGMOD |
0.0032169493 |
| 22 |
SCOPE: Easy and Efficient Parallel Processing of Massive Data Sets |
2008 |
VLDB |
0.0008456613 |
| 53 |
PNUTS: Yahoo!'s Hosted Data Serving Platform |
2008 |
VLDB |
0.00066144767 |
| 66 |
Spark SQL: Relational Data Processing in Spark |
2015 |
SIGMOD |
0.00061639801 |
| 87 |
Hekaton: SQL Server’s Memory-Optimized OLTP Engine |
2013 |
SIGMOD |
0.00052389723 |
| 102 |
The Case for Learned Index Structures |
2018 |
SIGMOD |
0.00049545203 |
| 156 |
Amazon Aurora: Design Considerations for High Throughput Cloud-Native Relational Databases |
2017 |
SIGMOD |
0.00040504295 |
| 158 |
Automated Selection of Materialized Views and Indexes for SQL Databases |
2000 |
VLDB |
0.00040071492 |
| 189 |
Megastore: Providing Scalable, Highly Available Storage for Interactive Services |
2011 |
CIDR |
0.00035925334 |
| 237 |
An Efficient, Cost-Driven Index Selection Tool for Microsoft SQL Server |
1997 |
VLDB |
0.00031726304 |
| 242 |
Generalized Search Trees for Database Systems (Extended Abstract) |
1995 |
VLDB |
0.00031110894 |
| 286 |
Integrating Vertical and Horizontal Partitioning into Automated Physical Database Design |
2004 |
SIGMOD |
0.00028990057 |
| 288 |
Storm @Twitter |
2014 |
SIGMOD |
0.00028939871 |
| 314 |
MillWheel: Fault-Tolerant Stream Processing at Internet Scale |
2013 |
VLDB |
0.00028084774 |
| 359 |
Self-Driving Database Management Systems |
2017 |
CIDR |
0.0002592783 |
| 521 |
Hyder - A Transactional Record Manager for Shared Flash |
2011 |
CIDR |
0.00021139547 |
| 538 |
The Dataflow Model: A Practical Approach to Balancing Correctness, Latency, and Cost in Massive-Scale, Unbounded, Out-of-Order Data Processing |
2015 |
VLDB |
0.00020678804 |
| 1,098 |
Trill: A High-Performance Incremental Query Processor for Diverse Analytics |
2015 |
VLDB |
0.00014114442 |
| 1,548 |
Structured Streaming: A Declarative API for Real-Time Applications in Apache Spark |
2018 |
SIGMOD |
0.00011431383 |
| 1,977 |
Split Query Processing in Polybase |
2013 |
SIGMOD |
9.8824589e-05 |
| 2,516 |
Concurrency and Recovery in Generalized Search Trees |
1997 |
SIGMOD |
8.6106981e-05 |
| 3,038 |
Azure Data Lake Store: A Hyperscale Distributed File Service for Big Data Analytics |
2017 |
SIGMOD |
7.6717218e-05 |
| 3,052 |
Deuteronomy: Transaction Support for Cloud Data |
2011 |
CIDR |
7.6507181e-05 |
| 3,550 |
Chi: A Scalable and Programmable Control Plane for Distributed Stream Processing Systems |
2018 |
VLDB |
6.9843512e-05 |
| 7,599 |
Quill: Efficient, Transferable, and Rich Analytics at Scale |
2016 |
VLDB |
4.7003593e-05 |
| 8,633 |
Demonstration: MacroBase, A Fast Data Analysis Engine |
2017 |
SIGMOD |
4.4802036e-05 |
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 6,261 |
The Cosmos Big Data Platform at Microsoft: Over a Decade of Progress and a Decade to Look Forward |
2021 |
VLDB |
5.1350714e-05 |
| 5,297 |
Continuous Cloud-Scale Query Optimization and Processing |
2013 |
VLDB |
5.5801669e-05 |
| 9,480 |
Cost-Effective, Low Latency Vector Search with Azure Cosmos DB |
2025 |
VLDB |
4.3341665e-05 |
| 6,316 |
HydraList: A Scalable In-Memory Index Using Asynchronous Updates and Partial Replication |
2020 |
VLDB |
5.1141977e-05 |
| 3,792 |
Schema-Agnostic Indexing with Azure DocumentDB |
2015 |
VLDB |
6.7618051e-05 |
| 9,349 |
A Framework for Supporting DBMS-like Indexes in the Cloud |
2011 |
VLDB |
4.3526413e-05 |
| 5,376 |
Holistic Indexing in Main-memory Column-stores |
2015 |
SIGMOD |
5.5417421e-05 |
| 2,047 |
Automatically Indexing Millions of Databases in Microsoft Azure SQL Database |
2019 |
SIGMOD |
9.6920209e-05 |
| 8,758 |
Hyperspace: The Indexing Subsystem of Azure Synapse |
2021 |
VLDB |
4.456315e-05 |
| 11,651 |
Helios: An Adaptive and Query Workload-driven Partitioning Framework for Distributed Graph Stores |
2019 |
SIGMOD |
4.1945683e-05 |