Database Paper Browser

Back to papers

SageDB: An Instance-Optimized Data Analytics System

Summary: First instance-optimized data system auto-tunes for a workload by synthesizing indexes, estimators, and analytics components into a cohesive stack. Outperforms a commercial cloud analytics system by 3x on end-to-end workloads and 250x on queries. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
12915
Venue
VLDB
Year
2022
Pagerank
4.5120602e-05
Overall Rank
8,442 | 41.28%
DOI
10.14778/3565838.3565857

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 7 of 7 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 30 of 30 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
7 Optimal Aggregation Algorithms for Middleware [Extended Abstract] 2001 PODS 0.0015496097
102 The Case for Learned Index Structures 2018 SIGMOD 0.00049545203
158 Automated Selection of Materialized Views and Indexes for SQL Databases 2000 VLDB 0.00040071492
179 Efficient and Extensible Algorithms for Multi Query Optimization 2000 SIGMOD 0.00037672155
183 Automatic Database Management System Tuning Through Large-scale Machine Learning 2017 SIGMOD 0.00036721403
209 Schism: a Workload-Driven Approach to Database Replication and Partitioning 2010 VLDB 0.00034468292
286 Integrating Vertical and Horizontal Partitioning into Automated Physical Database Design 2004 SIGMOD 0.00028990057
333 Neo: A Learned Query Optimizer 2019 VLDB 0.00027206884
424 Tuning Database Configuration Parameters with iTuned 2009 VLDB 0.00023616398
640 Bao: Making Learned Query Optimization Practical 2021 SIGMOD 0.00018759152
679 Skew-Aware Automatic Database Partitioning in Shared-Nothing, Parallel OLTP Systems 2012 SIGMOD 0.00018215154
716 Query-based Workload Forecasting for Self-Driving Database Management Systems 2018 SIGMOD 0.00017723171
731 Optimizing Queries Using Materialized Views: A Practical, Scalable Solution 2001 SIGMOD 0.00017468889
735 Umbra: A Disk-Based System with In-Memory Performance 2020 CIDR 0.00017452467
801 SageDB: A Learned Database System 2019 CIDR 0.00016505496
826 ALEX: An Updatable Adaptive Learned Index 2020 SIGMOD 0.00016224841
874 Index Selection in a Self-Adaptive Data Base Management System 1976 SIGMOD 0.00015728533
981 DynaMat: A Dynamic View Management System for Data Warehouses 1999 SIGMOD 0.00014879532
1,478 Learning Multi-dimensional Indexes 2020 SIGMOD 0.00011762542
1,611 Qd-tree: Learning Data Layouts for Big Data Analytics 2020 SIGMOD 0.00011147324
1,810 SQL Memory Management in Oracle9i 2002 VLDB 0.0001047003
1,814 Mesa: Geo-Replicated, Near Real-Time, Scalable Data Warehousing 2014 VLDB 0.00010458107
1,889 Tsunami: A Learned Multi-dimensional Index for Correlated Data and Skewed Workloads 2021 VLDB 0.00010200865
1,922 Selecting Subexpressions to Materialize at Datacenter Scale 2018 VLDB 0.00010082599
3,779 Instance-Optimized Data Layouts for Cloud Analytics Workloads 2021 SIGMOD 6.7747205e-05
4,227 Cosine: A Cloud-Cost Optimized Self-Designing Key-Value Storage Engine 2022 VLDB 6.3434324e-05
4,240 Make Your Database System Dream of Electric Sheep: Towards Self-Driving Operation 2021 VLDB 6.3318228e-05
4,590 MB2: Decomposed Behavior Modeling for Self-Driving Database Management Systems 2021 SIGMOD 6.0620053e-05
4,670 Napa: Powering Scalable Data Warehousing with Robust Query Performance at Google 2021 VLDB 6.0104466e-05
6,984 Replicated Layout for In-Memory Database Systems 2022 VLDB 4.873081e-05
Previous Page 1 / 1 Next

Semantically Similar Papers

Overall Rank Paper Year Venue Pagerank
10,411 OpenMLDB: A Real-Time Relational Data Feature Computation System for Online ML 2025 SIGMOD 4.1945683e-05
5,861 Machine Learning for Databases 2021 VLDB 5.298883e-05
2,568 Towards Cost-Optimal Query Processing in the Cloud 2021 VLDB 8.5239227e-05
4,549 Database-Agnostic Workload Management 2019 CIDR 6.0926728e-05
7,889 Cost-Intelligent Data Analytics in the Cloud 2024 CIDR 4.6253386e-05
658 Towards a Unified Architecture for in-RDBMS Analytics 2012 SIGMOD 0.00018506577
5,264 SeeDB: Visualizing Database Queries Efficiently 2014 VLDB 5.597302e-05
6,456 From Auto-tuning One Size Fits All to Self-designed and Learned Data-intensive Systems 2019 SIGMOD 5.0564619e-05
6,297 Towards instance-optimized data systems 2021 VLDB 5.1227886e-05
801 SageDB: A Learned Database System 2019 CIDR 0.00016505496