| 66 |
Spark SQL: Relational Data Processing in Spark
|
2015 |
SIGMOD |
0.00061639801 |
| 746 |
Delta Lake: High-Performance ACID Table Storage over Cloud Object Stores
|
2020 |
VLDB |
0.00017326979 |
| 1,377 |
Lakehouse: A New Generation of Open Platforms that Unify Data Warehousing and Advanced Analytics
|
2021 |
CIDR |
0.00012296941 |
| 1,477 |
Fine-grained Partitioning for Aggressive Data Skipping
|
2014 |
SIGMOD |
0.00011770865 |
| 1,548 |
Structured Streaming: A Declarative API for Real-Time Applications in Apache Spark
|
2018 |
SIGMOD |
0.00011431383 |
| 2,355 |
G-OLA: Generalized On-Line Aggregation for Interactive Analysis on Big Data
|
2015 |
SIGMOD |
8.9677847e-05 |
| 2,473 |
Photon: A Fast Query Engine for Lakehouse Systems
|
2022 |
SIGMOD |
8.7237281e-05 |
| 2,700 |
Filter Before You Parse: Faster Analytics on Raw Data with Sparser
|
2018 |
VLDB |
8.2728509e-05 |
| 3,535 |
Scaling Spark in the Real World: Performance and Usability
|
2015 |
VLDB |
6.9992495e-05 |
| 4,239 |
The Composable Data Management System Manifesto
|
2023 |
VLDB |
6.3318452e-05 |
| 4,641 |
VIVA: An End-to-End System for Interactive Video Analytics
|
2022 |
CIDR |
6.027004e-05 |
| 5,318 |
Analyzing and Comparing Lakehouse Storage Systems
|
2023 |
CIDR |
5.5715872e-05 |
| 6,400 |
iOLAP: Managing Uncertainty for Efficient Incremental OLAP
|
2016 |
SIGMOD |
5.0803518e-05 |
| 6,784 |
SparkR: Scaling R Programs with Spark
|
2016 |
SIGMOD |
4.9265155e-05 |
| 7,059 |
Adaptive and Robust Query Execution for Lakehouses at Scale
|
2024 |
VLDB |
4.8477825e-05 |
| 8,181 |
Foreign Keys Open the Door for Faster Incremental View Maintenance
|
2023 |
SIGMOD |
4.5660166e-05 |
| 8,197 |
SparkCruise: Workload Optimization in Managed Spark Clusters at Microsoft
|
2021 |
VLDB |
4.5607121e-05 |
| 8,506 |
New Query Optimization Techniques in the Spark Engine of Azure Synapse
|
2022 |
VLDB |
4.4957661e-05 |
| 8,608 |
Unity Catalog: Open and Universal Governance for the Lakehouse and Beyond
|
2025 |
SIGMOD |
4.4853979e-05 |
| 9,016 |
Making Data Engineering Declarative
|
2023 |
CIDR |
4.4094312e-05 |
| 9,093 |
Databricks Lakeguard: Supporting Fine-grained Access Control and Multi-user Capabilities for Apache Spark Workloads
|
2025 |
SIGMOD |
4.398149e-05 |
| 9,516 |
[Demo] Low-latency Spark Queries on Updatable Data
|
2019 |
SIGMOD |
4.3335877e-05 |
| 9,555 |
Bringing the Operational and Analytical Worlds Together with Lakebase
|
2025 |
VLDB |
4.3254416e-05 |
| 9,584 |
Introduction to Spark 2.0 for Database Researchers
|
2016 |
SIGMOD |
4.3218691e-05 |
| 9,747 |
Still Asking: How Good Are Query Optimizers, Really?
|
2025 |
VLDB |
4.2897489e-05 |
| 9,808 |
Photon: A High-Performance Query Engine for the Lakehouse
|
2022 |
CIDR |
4.2794025e-05 |
| 10,394 |
Ultraverse: An Efficient What-if Analysis Framework for Software Applications Interacting with Database Systems
|
2025 |
SIGMOD |
4.1945683e-05 |
| 10,496 |
Physical Visualization Design: Decoupling Interface and System Design
|
2025 |
SIGMOD |
4.1945683e-05 |
| 10,774 |
Automatic Indexing in Oracle
|
2025 |
VLDB |
4.1945683e-05 |
| 11,077 |
A Flexible Forecasting Stack
|
2024 |
VLDB |
4.1945683e-05 |
| 11,194 |
A Step Toward Deep Online Aggregation
|
2023 |
SIGMOD |
4.1945683e-05 |
| 11,366 |
Statistical Schema Learning using Occam's Razor
|
2022 |
SIGMOD |
4.1945683e-05 |
| 13,096 |
Blink Twice - Automatic Workload Pinning and Regression Detection for Versionless Apache Spark using Retries
|
2025 |
SIGMOD |
- |
| 13,124 |
Delta Sharing: An Open Protocol for Cross-Platform Data Sharing
|
2025 |
VLDB |
- |
| 13,269 |
Designing Production-Friendly Machine Learning
|
2021 |
VLDB |
- |
| 13,277 |
The Challenge of Building Effective Data Lakes
|
2020 |
SIGMOD |
- |