Templating Shuffles

Summary: TeShu: a unified, extensible shuffle service that represents optimization choices as parameterized shuffle templates capturing application, workload, and data‑center variability. Templates are instantiated via efficient sampling to quickly find near‑optimal shuffles. (summarized by gpt-5-mini on Feb 09 2026)

Paper ID: 500
Venue: CIDR
Year: 2023
Pagerank: 4.1905499e-05
Overall Rank: 11,157 | 22.46%
DOI: -

Incoming Non-self Citations Over Time

No non-self incoming citations found for this paper in this database.

Authors

Incoming Citations (Sorted by Pagerank)

Showing 0 of 0 citing papers.

Rank	Citing Paper	Year	Venue	Pagerank

Outgoing Citations (Sorted by Pagerank)

Showing 18 of 18 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank	Cited Paper	Year	Venue	Pagerank
4	Pregel: A System for Large-Scale Graph Processing	2010	SIGMOD	0.0019040811
66	Spark SQL: Relational Data Processing in Spark	2015	SIGMOD	0.00061707583
287	Storm @Twitter	2014	SIGMOD	0.00028917909
395	One Trillion Edges: Graph Processing at Facebook-Scale	2015	VLDB	0.00024440144
557	SystemML: Declarative Machine Learning on Spark	2016	VLDB	0.00020186115
1,324	Starling: A Scalable Query Engine on Cloud Functions	2020	SIGMOD	0.00012585081
1,540	NUMA-aware algorithms: the case of data shuffling	2013	CIDR	0.00011451745
2,420	Lambada: Interactive Data Analytics on Cold Data Using Serverless Cloud Infrastructure	2020	SIGMOD	8.8474951e-05
2,829	Understanding the Effect of Data Center Resource Disaggregation on Production DBMSs	2020	VLDB	8.0542308e-05
3,207	Big Data Analytics with Datalog Queries on Spark	2016	SIGMOD	7.3847098e-05
3,932	Rethinking Data Management Systems for Disaggregated Data Centers	2020	CIDR	6.6190727e-05
4,449	DFI: The Data Flow Interface for High-Speed Networks	2021	SIGMOD	6.1740574e-05
4,803	Boxer: Data Analytics on Network-enabled Serverless Platforms	2021	CIDR	5.906363e-05
5,116	AdaptDB: Adaptive Partitioning for Distributed Joins	2017	VLDB	5.6805476e-05
6,280	Cheetah: Accelerating Database Queries with Switch Pruning	2020	SIGMOD	5.1239052e-05
8,394	Optimizing Declarative Graph Queries at Large Scale	2019	SIGMOD	4.5233733e-05
8,458	Topology-aware Parallel Data Processing: Models, Algorithms and Systems at Scale	2020	CIDR	4.5013189e-05
9,505	Supporting Scalable Analytics with Latency Constraints	2015	VLDB	4.3300131e-05

Semantically Similar Papers

Overall Rank	Paper	Year	Venue	Pagerank
5,142	A Software-Defined Networking based Approach for Performance Management of Analytical Queries on Distributed Data Stores	2014	SIGMOD	5.6618933e-05
6,108	Automating Distributed Tiered Storage Management in Cluster Computing	2020	VLDB	5.2030492e-05
7,484	Privacy Amplification via Shuffling: Unified, Simplified, and Tightened	2024	VLDB	4.7135369e-05
5,649	Optimizing Data-intensive Systems in Disaggregated Data Centers with TELEPORT	2022	SIGMOD	5.3897976e-05
5,587	Magnet: Push-based Shuffle Service for Large-scale Data Processing	2020	VLDB	5.4193445e-05
8,509	Network Shuffling: Privacy Amplification via Random Walks	2022	SIGMOD	4.4904878e-05
8,647	Zero-sided RDMA: Network-driven Data Shuffling for Disaggregated Heterogeneous Cloud DBMSs	2024	SIGMOD	4.4720005e-05
8,454	Towards Resource Efficiency: Practical Insights into Large-Scale Spark Workloads at ByteDance	2024	VLDB	4.5022073e-05
1,540	NUMA-aware algorithms: the case of data shuffling	2013	CIDR	0.00011451745
4,226	Hyper Dimension Shuffle: Efficient Data Repartition at Petabyte Scale in SCOPE	2019	VLDB	6.3382156e-05