[Demo] Low-latency Spark Queries on Updatable Data
Summary: Demo: Indexed DataFrame—cached Spark DataFrame with an integrated index for fast lookups and joins on updatable data. Supports multi-version concurrency for updates; evaluated on growing social-network graphs with microbenchmarks and real-world queries. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Alexandru Uta
- 2. Bogdan Ghit
- 3. Ankur Dave
- 4. Peter Boncz
Incoming Citations (Sorted by Pagerank)
Showing 1 of 1 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 8,758 | Hyperspace: The Indexing Subsystem of Azure Synapse | 2021 | VLDB | 4.456315e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 3 of 3 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 66 | Spark SQL: Relational Data Processing in Spark | 2015 | SIGMOD | 0.00061639801 |
| 536 | The LDBC Social Network Benchmark: Interactive Workload | 2015 | SIGMOD | 0.00020722862 |
| 1,548 | Structured Streaming: A Declarative API for Real-Time Applications in Apache Spark | 2018 | SIGMOD | 0.00011431383 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 9,124 | Dynamic Speculative Optimizations for SQL Compilation in Apache Spark | 2020 | VLDB | 4.391961e-05 |
| 4,128 | Are Updatable Learned Indexes Ready? | 2022 | VLDB | 6.4292373e-05 |
| 11,405 | SparkCAD: Caching Anomalies Detector for Spark Applications | 2022 | VLDB | 4.1945683e-05 |
| 1,548 | Structured Streaming: A Declarative API for Real-Time Applications in Apache Spark | 2018 | SIGMOD | 0.00011431383 |
| 3,200 | Big Data Analytics with Datalog Queries on Spark | 2016 | SIGMOD | 7.3912411e-05 |
| 4,755 | Indexing for Interactive Exploration of Big Data Series | 2014 | SIGMOD | 5.946863e-05 |
| 11,197 | QaaD (Query-as-a-Data): Scalable Execution of Massive Number of Small Queries in Spark | 2023 | SIGMOD | 4.1945683e-05 |
| 3,535 | Scaling Spark in the Real World: Performance and Usability | 2015 | VLDB | 6.9992495e-05 |
| 4,650 | LocationSpark: A Distributed In-Memory Data Management System for Big Spatial Data | 2016 | VLDB | 6.0234336e-05 |
| 9,504 | Supporting Scalable Analytics with Latency Constraints | 2015 | VLDB | 4.3341665e-05 |