Database Paper Browser

Back to papers

Skipping-oriented Partitioning for Columnar Layouts

Summary: Introduces Generalized Skipping-Oriented Partitioning (GSOP) for columnar layouts. Jointly optimizes horizontal skipping and vertical partitioning to balance skip opportunities with tuple reconstruction, reducing scanned data and improving end-to-end queries vs. state-of-the-art on benchmarks and real workloads. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
11539
Venue
VLDB
Year
2017
Pagerank
6.8033227e-05
Overall Rank
3,737 | 74.01%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 20 of 20 citing papers.

Rank Citing Paper Year Venue Pagerank
1,377 Lakehouse: A New Generation of Open Platforms that Unify Data Warehousing and Advanced Analytics 2021 CIDR 0.00012296941
1,611 Qd-tree: Learning Data Layouts for Big Data Analytics 2020 SIGMOD 0.00011147324
3,488 Optimal Column Layout for Hybrid Workloads 2019 VLDB 7.0479329e-05
3,779 Instance-Optimized Data Layouts for Cloud Analytics Workloads 2021 SIGMOD 6.7747205e-05
3,922 Pushing Data-Induced Predicates Through Joins in Big-Data Clusters 2020 VLDB 6.6291079e-05
5,749 BinDex: A Two-Layered Index for Fast and Robust Scans 2020 SIGMOD 5.3418923e-05
6,398 Endure: A Robust Tuning Paradigm for LSM Trees Under Workload Uncertainty 2022 VLDB 5.0819209e-05
6,466 Pando: Enhanced Data Skipping with Logical Data Partitioning 2023 VLDB 5.0528281e-05
7,053 Statisticum: Data Statistics Management in SAP HANA 2017 VLDB 4.8497195e-05
7,128 Jigsaw: A Data Storage and Query Processing Engine for Irregular Table Partitioning 2021 SIGMOD 4.8230171e-05
7,483 RTScan: Efficient Scan with Ray Tracing Cores 2024 VLDB 4.7180617e-05
8,222 Sieve: A Learned Data-Skipping Index for Data Analytics 2023 VLDB 4.5555621e-05
8,415 Pruning in Snowflake: Working Smarter, Not Harder 2025 SIGMOD 4.5197687e-05
8,886 Provenance-based Data Skipping 2022 VLDB 4.4279829e-05
10,230 Breaking the Isolation-Freshness Trade-off: Joint Adaptive Storage Optimization for HTAP Systems 2026 VLDB 4.1945683e-05
10,385 Optimizing Block Skipping for High-Dimensional Data with Learned Adaptive Curve 2025 SIGMOD 4.1945683e-05
10,404 Dynamic Pruning for Recursive Joins 2025 SIGMOD 4.1945683e-05
10,761 SIEVE: Effective Filtered Vector Search with Collection of Indexes 2025 VLDB 4.1945683e-05
11,067 Partition, Don’t Sort! Compression Boosters for Cloud Data Ingestion Pipelines 2024 VLDB 4.1945683e-05
11,212 SH2O: Efficient Data Access for Work-Sharing Databases 2023 SIGMOD 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 30 of 30 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
21 C-Store: A Column-oriented DBMS 2005 VLDB 0.00086087497
35 MonetDB/X100: Hyper-Pipelining Query Execution 2005 CIDR 0.00076197749
66 Spark SQL: Relational Data Processing in Spark 2015 SIGMOD 0.00061639801
109 Dremel: Interactive Analysis of Web-Scale Datasets 2010 VLDB 0.00048186983
131 Integrating Compression and Execution in Column-Oriented Database Systems 2006 SIGMOD 0.0004370331
158 Automated Selection of Materialized Views and Indexes for SQL Databases 2000 VLDB 0.00040071492
167 The Snowflake Elastic Data Warehouse 2016 SIGMOD 0.00039180521
209 Schism: a Workload-Driven Approach to Database Replication and Partitioning 2010 VLDB 0.00034468292
241 DB2 with BLU Acceleration: So Much More than Just a Column Store 2013 VLDB 0.00031420034
285 Automating Physical Database Design in a Parallel Database 2002 SIGMOD 0.0002899128
286 Integrating Vertical and Horizontal Partitioning into Automated Physical Database Design 2004 SIGMOD 0.00028990057
310 The Vertica Analytic Database: C-Store 7 Years Later 2012 VLDB 0.00028132402
368 Small Materialized Aggregates: A Light Weight Index Structure for Data Warehousing 1998 VLDB 0.000254931
408 Database Cracking 2007 CIDR 0.00023953844
426 Amazon Redshift and the Case for Simpler Data Warehouses 2015 SIGMOD 0.00023594359
596 HYRISE—A Main Memory Hybrid Storage Engine 2011 VLDB 0.00019481482
1,470 Processing a Trillion Cells per Mouse Click 2012 VLDB 0.00011833779
1,477 Fine-grained Partitioning for Aggressive Data Skipping 2014 SIGMOD 0.00011770865
1,807 H2O: A Hands-free Adaptive Store 2014 SIGMOD 0.00010487796
1,999 Data Morphing: An Adaptive, Cache-Conscious Storage Technique 2003 VLDB 9.8235392e-05
2,229 Self-organizing Tuple Reconstruction in Column-stores 2009 SIGMOD 9.2350274e-05
2,444 Brighthouse: An Analytic Data Warehouse for Ad-hoc Queries 2008 VLDB 8.8076551e-05
2,773 JSON Data Management – Supporting Schema-less Development in RDBMS 2014 SIGMOD 8.1386587e-05
2,987 The Uncracked Pieces in Database Cracking 2014 VLDB 7.7787088e-05
2,998 Major Technical Advancements in Apache Hive 2014 SIGMOD 7.753765e-05
3,028 Efficient Query Processing for Multi-Dimensionally Clustered Tables in DB2 2003 VLDB 7.6816205e-05
3,208 Column-Oriented Storage Techniques for MapReduce 2011 VLDB 7.3781897e-05
4,061 Advanced Partitioning Techniques for Massively Distributed Computation 2012 SIGMOD 6.483587e-05
6,802 Understanding Insights into the Basic Structure and Essential Issues of Table Placement Methods in Clusters 2013 VLDB 4.9226626e-05
7,114 A Comparison of Knives for Bread Slicing 2013 VLDB 4.827351e-05
Previous Page 1 / 1 Next

Semantically Similar Papers