Back to papers
Templating Shuffles
Summary: TeShu: a unified, extensible shuffle service that represents optimization choices as parameterized shuffle templates capturing application, workload, and data‑center variability. Templates are instantiated via efficient sampling to quickly find near‑optimal shuffles.
(summarized by gpt-5-mini on Feb 09 2026)
- Paper ID
- 500
- Venue
- CIDR
- Year
- 2023
- Pagerank
- 4.1945683e-05
- Overall Rank
- 11,154 | 22.41%
- DOI
-
-
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
Outgoing Citations (Sorted by Pagerank)
Showing 18 of 18 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 4 |
Pregel: A System for Large-Scale Graph Processing |
2010 |
SIGMOD |
0.0019005923 |
| 66 |
Spark SQL: Relational Data Processing in Spark |
2015 |
SIGMOD |
0.00061639801 |
| 288 |
Storm @Twitter |
2014 |
SIGMOD |
0.00028939871 |
| 396 |
One Trillion Edges: Graph Processing at Facebook-Scale |
2015 |
VLDB |
0.00024424102 |
| 557 |
SystemML: Declarative Machine Learning on Spark |
2016 |
VLDB |
0.00020197988 |
| 1,326 |
Starling: A Scalable Query Engine on Cloud Functions |
2020 |
SIGMOD |
0.00012576952 |
| 1,543 |
NUMA-aware algorithms: the case of data shuffling |
2013 |
CIDR |
0.0001145318 |
| 2,424 |
Lambada: Interactive Data Analytics on Cold Data Using Serverless Cloud Infrastructure |
2020 |
SIGMOD |
8.8380822e-05 |
| 2,840 |
Understanding the Effect of Data Center Resource Disaggregation on Production DBMSs |
2020 |
VLDB |
8.0349523e-05 |
| 3,200 |
Big Data Analytics with Datalog Queries on Spark |
2016 |
SIGMOD |
7.3912411e-05 |
| 4,247 |
Rethinking Data Management Systems for Disaggregated Data Centers |
2020 |
CIDR |
6.325596e-05 |
| 4,483 |
DFI: The Data Flow Interface for High-Speed Networks |
2021 |
SIGMOD |
6.148188e-05 |
| 4,800 |
Boxer: Data Analytics on Network-enabled Serverless Platforms |
2021 |
CIDR |
5.9117077e-05 |
| 5,118 |
AdaptDB: Adaptive Partitioning for Distributed Joins |
2017 |
VLDB |
5.6820984e-05 |
| 6,282 |
Cheetah: Accelerating Database Queries with Switch Pruning |
2020 |
SIGMOD |
5.128797e-05 |
| 8,396 |
Optimizing Declarative Graph Queries at Large Scale |
2019 |
SIGMOD |
4.5276541e-05 |
| 8,462 |
Topology-aware Parallel Data Processing: Models, Algorithms and Systems at Scale |
2020 |
CIDR |
4.5056381e-05 |
| 9,504 |
Supporting Scalable Analytics with Latency Constraints |
2015 |
VLDB |
4.3341665e-05 |
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 5,142 |
A Software-Defined Networking based Approach for Performance Management of Analytical Queries on Distributed Data Stores |
2014 |
SIGMOD |
5.6673393e-05 |
| 6,104 |
Automating Distributed Tiered Storage Management in Cluster Computing |
2020 |
VLDB |
5.2080102e-05 |
| 7,484 |
Privacy Amplification via Shuffling: Unified, Simplified, and Tightened |
2024 |
VLDB |
4.7180617e-05 |
| 6,388 |
Optimizing Data-intensive Systems in Disaggregated Data Centers with TELEPORT |
2022 |
SIGMOD |
5.0851841e-05 |
| 5,888 |
Magnet: Push-based Shuffle Service for Large-scale Data Processing |
2020 |
VLDB |
5.2873617e-05 |
| 8,512 |
Network Shuffling: Privacy Amplification via Random Walks |
2022 |
SIGMOD |
4.4947966e-05 |
| 8,649 |
Zero-sided RDMA: Network-driven Data Shuffling for Disaggregated Heterogeneous Cloud DBMSs |
2024 |
SIGMOD |
4.4762914e-05 |
| 9,155 |
Towards Resource Efficiency: Practical Insights into Large-Scale Spark Workloads at ByteDance |
2024 |
VLDB |
4.3849295e-05 |
| 1,543 |
NUMA-aware algorithms: the case of data shuffling |
2013 |
CIDR |
0.0001145318 |
| 4,248 |
Hyper Dimension Shuffle: Efficient Data Repartition at Petabyte Scale in SCOPE |
2019 |
VLDB |
6.3247927e-05 |