Jigsaw: A Data Storage and Query Processing Engine for Irregular Table Partitioning

Summary: Jigsaw, a storage-and-query engine, enables irregular, non-rectangular partitions to reduce I/O. A partition-at-a-time model avoids repeated reads on irregular partitions, delivering up to 4.2x speedups vs columnar and reducing data transfer to ~21% on HAP/TPC-H. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID: 6237
Venue: SIGMOD
Year: 2021
Pagerank: 4.8184276e-05
Overall Rank: 7,127 | 50.47%
DOI: 10.1145/3448016.3457547

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 5 of 5 citing papers.

Rank	Citing Paper	Year	Venue	Pagerank
5,571	A Deep Dive into Common Open Formats for Analytical DBMSs	2023	VLDB	5.4279553e-05
6,788	Proteus: Autonomous Adaptive Storage for Mixed Workloads	2022	SIGMOD	4.9207259e-05
11,070	Partition, Don’t Sort! Compression Boosters for Cloud Data Ingestion Pipelines	2024	VLDB	4.1905499e-05
11,178	Grouping Time Series for Efficient Columnar Storage	2023	SIGMOD	4.1905499e-05
11,214	SH2O: Efficient Data Access for Work-Sharing Databases	2023	SIGMOD	4.1905499e-05

Outgoing Citations (Sorted by Pagerank)

Showing 35 of 35 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank	Cited Paper	Year	Venue	Pagerank
20	C-Store: A Column-oriented DBMS	2005	VLDB	0.00086163998
35	MonetDB/X100: Hyper-Pipelining Query Execution	2005	CIDR	0.00076209479
160	Automated Selection of Materialized Views and Indexes for SQL Databases	2000	VLDB	0.00040053897
179	Efficient and Extensible Algorithms for Multi Query Optimization	2000	SIGMOD	0.00037637319
208	Schism: a Workload-Driven Approach to Database Replication and Partitioning	2010	VLDB	0.00034478612
241	DB2 with BLU Acceleration: So Much More than Just a Column Store	2013	VLDB	0.00031314629
258	DB2 Design Advisor: Integrated Automatic Physical Database Design	2004	VLDB	0.00030196528
283	Integrating Vertical and Horizontal Partitioning into Automated Physical Database Design	2004	SIGMOD	0.00029024583
285	Automating Physical Database Design in a Parallel Database	2002	SIGMOD	0.00028978423
286	High-Performance Concurrency Control Mechanisms for Main-Memory Databases	2012	VLDB	0.0002894802
310	The Vertica Analytic Database: C-Store 7 Years Later	2012	VLDB	0.0002815547
466	A Case for Fractured Mirrors	2002	VLDB	0.00022455567
496	Column-Stores vs. Row-Stores: How Different Are They Really?	2008	SIGMOD	0.00021705611
517	AutoAdmin "What-if" Index Analysis Utility	1998	SIGMOD	0.00021193179
594	HYRISE—A Main Memory Hybrid Storage Engine	2011	VLDB	0.00019515008
679	Skew-Aware Automatic Database Partitioning in Shared-Nothing, Parallel OLTP Systems	2012	SIGMOD	0.00018211621
708	Performance Tradeoffs in Read-Optimized Databases	2006	VLDB	0.00017753572
1,046	Buffering Database Operations for Enhanced Instruction Cache Performance	2004	SIGMOD	0.00014446882
1,109	Sybase IQ Multiplex – Designed For Analytics	2004	VLDB	0.00013927106
1,473	Fine-grained Partitioning for Aggressive Data Skipping	2014	SIGMOD	0.00011786148
1,475	Efficient Exploitation of Similar Subexpressions for Query Processing	2007	SIGMOD	0.00011765071
1,608	Qd-tree: Learning Data Layouts for Big Data Analytics	2020	SIGMOD	0.00011169837
1,697	Bridging the Archipelago between Row-Stores and Column-Stores for Hybrid Workloads	2016	SIGMOD	0.00010859294
1,801	H2O: A Hands-free Adaptive Store	2014	SIGMOD	0.00010485628
1,921	Selecting Subexpressions to Materialize at Datacenter Scale	2018	VLDB	0.00010085899
2,003	Data Morphing: An Adaptive, Cache-Conscious Storage Technique	2003	VLDB	9.8198787e-05
2,410	Automated Partitioning Design in Parallel Database Systems	2011	SIGMOD	8.8643562e-05
3,482	Optimal Column Layout for Hybrid Workloads	2019	VLDB	7.0514808e-05
3,731	Skipping-oriented Partitioning for Columnar Layouts	2017	VLDB	6.8074069e-05
3,761	Read-Optimized Databases, In Depth	2008	VLDB	6.7777865e-05
4,068	Advanced Partitioning Techniques for Massively Distributed Computation	2012	SIGMOD	6.4748133e-05
4,160	Access Path Selection in Main-Memory Optimized Data Systems: Should I Scan or Should I Probe?	2017	SIGMOD	6.3886736e-05
4,621	Automated Generation of Materialized Views in Oracle	2020	VLDB	6.036749e-05
7,111	A Comparison of Knives for Bread Slicing	2013	VLDB	4.8228024e-05
9,002	Chasing Similarity: Distribution-aware Aggregation Scheduling	2019	VLDB	4.4077753e-05

Semantically Similar Papers

Overall Rank	Paper	Year	Venue	Pagerank
679	Skew-Aware Automatic Database Partitioning in Shared-Nothing, Parallel OLTP Systems	2012	SIGMOD	0.00018211621
5,116	AdaptDB: Adaptive Partitioning for Distributed Joins	2017	VLDB	5.6805476e-05
549	Hash-Partitioned Join Method Using Dynamic Destaging Strategy	1988	VLDB	0.00020348047
7,713	Query Centric Partitioning and Allocation for Partially Replicated Database Systems	2017	SIGMOD	4.6662571e-05
1,473	Fine-grained Partitioning for Aggressive Data Skipping	2014	SIGMOD	0.00011786148
4,107	Cracking the Database Store	2005	CIDR	6.4384924e-05
12,001	A Partitioning Framework for Aggressive Data Skipping	2014	VLDB	4.1905499e-05
7,791	Jigsaw: Efficient Optimization Over Uncertain Enterprise Data	2011	SIGMOD	4.6457822e-05
2,231	Self-organizing Tuple Reconstruction in Column-stores	2009	SIGMOD	9.2367968e-05
3,731	Skipping-oriented Partitioning for Columnar Layouts	2017	VLDB	6.8074069e-05