CellSort: High Performance Sorting on the Cell Processor
Summary: Three-tiered distributed bitonic sort on Cell: SIMD in-SPE sort for 128KB, cross-SPE DMA-driven in-core merge, and out-of-core distributed merge. Achieves up to 1.7× faster than QuickSort on Xeon for in-SPE, up to 10× with 16 SPEs, and up to 4× on 0.5GB. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Buğra Gedik
- 2. Rajesh R. Bordawekar
- 3. Philip S. Yu
Incoming Citations (Sorted by Pagerank)
Showing 8 of 8 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 404 | Multi-Core, Main-Memory Joins: Sort vs. Hash Revisited | 2014 | VLDB | 0.00024143076 |
| 946 | Efficient Implementation of Sorting on Multi-Core SIMD CPU Architecture | 2008 | VLDB | 0.0001513324 |
| 950 | Data Processing on FPGAs | 2009 | VLDB | 0.00015108484 |
| 1,467 | SPADE: The System S Declarative Stream Processing Engine | 2008 | SIGMOD | 0.00011849864 |
| 4,042 | PARADIS: An Efficient Parallel Algorithm for In-place Radix Sort | 2015 | VLDB | 6.5026989e-05 |
| 4,655 | SIMD- and Cache-Friendly Algorithm for Sorting an Array of Structures | 2015 | VLDB | 6.0221672e-05 |
| 6,041 | FPGA: What's in it for a Database? | 2009 | SIGMOD | 5.2407055e-05 |
| 8,927 | An Application-Specific Instruction Set for Accelerating Set-Oriented Database Primitives | 2014 | SIGMOD | 4.427232e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 2 of 2 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 239 | GPUTeraSort: High Performance Graphics Co-processor Sorting for Large Database Management | 2006 | SIGMOD | 0.00031617428 |
| 343 | Implementing Database Operations Using SIMD Instructions | 2002 | SIGMOD | 0.00026768139 |
Previous
Page 1 / 1
Next