Back to papers
Fast Multi-Column Sorting in Main-Memory Column-Stores
Summary: Proposes code massaging, a bit-level cross-column reordering technique to reduce the number of sorting rounds for multi-column ORDER BY/GROUP BY in main-memory column-stores. Delivers up to 4.7x (TPC-H), 4.7x (TPC-H skew), 4x (TPC-DS), and 3.2x (real workloads) speedups by increasing SIMD parallelism.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 5232
- Venue
- SIGMOD
- Year
- 2016
- Pagerank
- 4.8336115e-05
- Overall Rank
- 7,097 | 50.63%
- DOI
-
10.1145/2882903.2915205
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 3 of 3 citing papers.
Outgoing Citations (Sorted by Pagerank)
Showing 22 of 22 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 21 |
C-Store: A Column-oriented DBMS |
2005 |
VLDB |
0.00086087497 |
| 35 |
MonetDB/X100: Hyper-Pipelining Query Execution |
2005 |
CIDR |
0.00076197749 |
| 351 |
Sort vs. Hash Revisited: Fast Join Implementation on Modern Multi-Core CPUs |
2009 |
VLDB |
0.0002636504 |
| 404 |
Multi-Core, Main-Memory Joins: Sort vs. Hash Revisited |
2014 |
VLDB |
0.00024143076 |
| 585 |
Massively Parallel Sort-Merge Joins in Main Memory Multi-Core Database Systems |
2012 |
VLDB |
0.00019706145 |
| 596 |
HYRISE—A Main Memory Hybrid Storage Engine |
2011 |
VLDB |
0.00019481482 |
| 757 |
Database Architecture Evolution: Mammals Flourished long before Dinosaurs became Extinct |
2009 |
VLDB |
0.00017078358 |
| 868 |
Profiling, What-if Analysis, and Cost-based Optimization of MapReduce Programs |
2011 |
VLDB |
0.00015789681 |
| 930 |
Fast Sort on CPUs and GPUs: A Case for Bandwidth Oblivious SIMD Sort |
2010 |
SIGMOD |
0.00015238545 |
| 946 |
Efficient Implementation of Sorting on Multi-Core SIMD CPU Architecture |
2008 |
VLDB |
0.0001513324 |
| 958 |
Rethinking SIMD Vectorization for In-Memory Databases |
2015 |
SIGMOD |
0.00015045316 |
| 1,134 |
Dictionary-based Order-preserving String Compression for Main Memory Column Stores |
2009 |
SIGMOD |
0.00013761456 |
| 1,270 |
BitWeaving: Fast Scans for Main Memory Data Processing |
2013 |
SIGMOD |
0.00012926086 |
| 1,607 |
A Comprehensive Study of Main-Memory Partitioning and its Application to Large-Scale Comparison- and Radix-Sort |
2014 |
SIGMOD |
0.00011162682 |
| 1,618 |
Row-wise Parallel Predicate Evaluation |
2008 |
VLDB |
0.00011114015 |
| 1,731 |
Fast Updates on Read-Optimized Databases Using Multi-Core CPUs |
2012 |
VLDB |
0.0001073454 |
| 2,390 |
ByteSlice: Pushing the Envelop of Main Memory Data Processing with a New Storage Layout |
2015 |
SIGMOD |
8.9084657e-05 |
| 2,412 |
WideTable: An Accelerator for Analytical Data Processing |
2014 |
VLDB |
8.8726508e-05 |
| 2,882 |
Database Compression on Graphics Processors |
2010 |
VLDB |
7.9661218e-05 |
| 4,042 |
PARADIS: An Efficient Parallel Algorithm for In-place Radix Sort |
2015 |
VLDB |
6.5026989e-05 |
| 4,655 |
SIMD- and Cache-Friendly Algorithm for Sorting an Array of Structures |
2015 |
VLDB |
6.0221672e-05 |
| 5,532 |
A Padded Encoding Scheme to Accelerate Scans by Leveraging Skew |
2015 |
SIGMOD |
5.4548897e-05 |
Semantically Similar Papers