Back to papers
Optimizing Block Skipping for High-Dimensional Data with Learned Adaptive Curve
Summary: Learned adaptive curve for SMA block skipping in high-dimensional data via an attention-based network and end-to-end training. Scales to 1000 columns with a 2.8x block-skipping improvement over static space-filling curves on real Spark workloads.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 7053
- Venue
- SIGMOD
- Year
- 2025
- Pagerank
- 4.1945683e-05
- Overall Rank
- 10,385 | 27.76%
- DOI
-
10.1145/3709710
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
Outgoing Citations (Sorted by Pagerank)
Showing 24 of 24 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 2 |
R-Trees: A Dynamic Index Structure For Spatial Searching |
1984 |
SIGMOD |
0.0032169493 |
| 66 |
Spark SQL: Relational Data Processing in Spark |
2015 |
SIGMOD |
0.00061639801 |
| 102 |
The Case for Learned Index Structures |
2018 |
SIGMOD |
0.00049545203 |
| 204 |
Learned Cardinalities: Estimating Correlated Joins with Deep Learning |
2019 |
CIDR |
0.00034784455 |
| 241 |
DB2 with BLU Acceleration: So Much More than Just a Column Store |
2013 |
VLDB |
0.00031420034 |
| 290 |
Linear Clustering of Objects with Multiple Attributes |
1990 |
SIGMOD |
0.00028919734 |
| 368 |
Small Materialized Aggregates: A Light Weight Index Structure for Data Warehousing |
1998 |
VLDB |
0.000254931 |
| 746 |
Delta Lake: High-Performance ACID Table Storage over Cloud Object Stores |
2020 |
VLDB |
0.00017326979 |
| 758 |
Deep Unsupervised Cardinality Estimation |
2020 |
VLDB |
0.0001706608 |
| 1,477 |
Fine-grained Partitioning for Aggressive Data Skipping |
2014 |
SIGMOD |
0.00011770865 |
| 1,478 |
Learning Multi-dimensional Indexes |
2020 |
SIGMOD |
0.00011762542 |
| 1,611 |
Qd-tree: Learning Data Layouts for Big Data Analytics |
2020 |
SIGMOD |
0.00011147324 |
| 1,889 |
Tsunami: A Learned Multi-dimensional Index for Correlated Data and Skewed Workloads |
2021 |
VLDB |
0.00010200865 |
| 2,678 |
Effectively Learning Spatial Indices |
2020 |
VLDB |
8.3252088e-05 |
| 2,837 |
Correlation Maps: A Compressed Access Method for Exploiting Soft Functional Dependencies |
2009 |
VLDB |
8.0414149e-05 |
| 3,737 |
Skipping-oriented Partitioning for Columnar Layouts |
2017 |
VLDB |
6.8033227e-05 |
| 3,779 |
Instance-Optimized Data Layouts for Cloud Analytics Workloads |
2021 |
SIGMOD |
6.7747205e-05 |
| 4,956 |
Dimensions Based Data Clustering and Zone Maps |
2017 |
VLDB |
5.8040891e-05 |
| 5,334 |
LEON: A New Framework for ML-Aided Query Optimization |
2023 |
VLDB |
5.5649836e-05 |
| 6,947 |
QUILTS: Multidimensional Partitioning Framework Based on Query-Aware and Skew-Tolerant Space-Filling Curves |
2017 |
SIGMOD |
4.8909129e-05 |
| 7,042 |
LMSFC: A Novel Multidimensional Index based on Learned Monotonic Space Filling Curves |
2023 |
VLDB |
4.8541986e-05 |
| 8,225 |
Automated Multidimensional Data Layouts in Amazon Redshift |
2024 |
SIGMOD |
4.555289e-05 |
| 8,405 |
Towards Designing and Learning Piecewise Space-Filling Curves |
2023 |
VLDB |
4.5224126e-05 |
| 9,108 |
BASE: Bridging the Gap between Cost and Latency for Query Optimization |
2023 |
VLDB |
4.3950066e-05 |
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 1,478 |
Learning Multi-dimensional Indexes |
2020 |
SIGMOD |
0.00011762542 |
| 9,701 |
Towards Functional Decomposition of Storage Formats |
2025 |
CIDR |
4.3008468e-05 |
| 1,477 |
Fine-grained Partitioning for Aggressive Data Skipping |
2014 |
SIGMOD |
0.00011770865 |
| 10,748 |
Benchmarking Adaptive Multidimensional Indices |
2025 |
VLDB |
4.1945683e-05 |
| 1,611 |
Qd-tree: Learning Data Layouts for Big Data Analytics |
2020 |
SIGMOD |
0.00011147324 |
| 11,993 |
A Partitioning Framework for Aggressive Data Skipping |
2014 |
VLDB |
4.1945683e-05 |
| 8,617 |
A Spark Optimizer for Adaptive, Fine-Grained Parameter Tuning |
2024 |
VLDB |
4.4846425e-05 |
| 5,118 |
AdaptDB: Adaptive Partitioning for Distributed Joins |
2017 |
VLDB |
5.6820984e-05 |
| 3,779 |
Instance-Optimized Data Layouts for Cloud Analytics Workloads |
2021 |
SIGMOD |
6.7747205e-05 |
| 6,809 |
Adaptive Data Skipping in Main-Memory Systems |
2016 |
SIGMOD |
4.9206606e-05 |