Parallelism-Optimizing Data Placement for Faster Data-Parallel Computations
Summary: Shows that minimizing tail latency for data-parallel queries requires placing items so each query's accesses are spread across many machines to maximize per-query parallelism. Proposes a linear computable parallelism metric and a scalable partitioning-based placement optimizer; 7–64% p99 improvements on Solr/MongoDB. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Nirvik Baruah
- 2. Peter Kraft
- 3. Fiodar Kazhamiaka
- 4. Peter Bailis
- 5. Matei Zaharia
Incoming Citations (Sorted by Pagerank)
Showing 2 of 2 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 8,854 | Optimizing the cloud? Don't train models. Build oracles! | 2024 | CIDR | 4.4349047e-05 |
| 9,601 | SkyPIE: A Fast & Accurate Oracle for Object Placement | 2024 | SIGMOD | 4.3177432e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 6 of 6 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 209 | Schism: a Workload-Driven Approach to Database Replication and Partitioning | 2010 | VLDB | 0.00034468292 |
| 1,092 | E-Store: Fine-Grained Elastic Partitioning for Distributed Transaction Processing Systems | 2015 | VLDB | 0.00014135961 |
| 1,588 | Druid: A Real-time Analytical Data Store | 2014 | SIGMOD | 0.00011239313 |
| 3,005 | Clay: Fine-Grained Adaptive Partitioning for General Database Schemas | 2017 | VLDB | 7.7303579e-05 |
| 3,675 | Accordion: Elastic Scalability for Database Systems Supporting Distributed Transactions | 2014 | VLDB | 6.8555664e-05 |
| 5,451 | NashDB: An End-to-End Economic Method for Elastic Database Fragmentation, Replication, and Provisioning | 2018 | SIGMOD | 5.5002949e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,110 | Parallel Evaluation of Conjunctive Queries | 2011 | PODS | 0.00013968198 |
| 4,261 | Parallelizing Query Optimization | 2008 | VLDB | 6.31244e-05 |
| 6,337 | Parallelizing Extensible Query Optimizers | 2009 | SIGMOD | 5.1053757e-05 |
| 3,124 | Parallel Query Scheduling and Optimization with Time- and Space-Shared Resources | 1997 | VLDB | 7.5201555e-05 |
| 6,304 | Elastic Pipelining in an In-Memory Database Cluster | 2016 | SIGMOD | 5.1210182e-05 |
| 2,413 | Automated Partitioning Design in Parallel Database Systems | 2011 | SIGMOD | 8.8672223e-05 |
| 7,913 | Resource Bricolage for Parallel Database Systems | 2015 | VLDB | 4.6180739e-05 |
| 438 | Query Optimization for Parallel Execution | 1992 | SIGMOD | 0.00023199245 |
| 1,825 | Optimization Algorithms for Exploiting the Parallelism-Communication Tradeoff in Pipelined Parallelism | 1994 | VLDB | 0.00010401739 |
| 9,305 | Parallelizing Query Optimization on Shared-Nothing Architectures | 2016 | VLDB | 4.3577129e-05 |