Skew-Aware Join Optimization for Array Databases
Summary: Skew-aware, tile-based join optimization for distributed arrays handles nonuniformity via data-aware reorganization. Two-phase planner selects algorithm and tile granularity; then maps tiles to cluster nodes with a cost model, delivering 2.5x speedups. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 7 of 7 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,574 | Incremental View Maintenance over Array Data | 2017 | SIGMOD | 6.0738556e-05 |
| 6,507 | Similarity Join over Array Data | 2016 | SIGMOD | 5.0337166e-05 |
| 6,619 | Near-Optimal Distributed Band-Joins through Recursive Partitioning | 2020 | SIGMOD | 4.9910152e-05 |
| 7,153 | Submodularity of Distributed Join Computation | 2018 | SIGMOD | 4.8153963e-05 |
| 7,476 | Lachesis: Automatic Partitioning for UDF-Centric Analytics | 2021 | VLDB | 4.7188928e-05 |
| 8,957 | Adaptive Quotient Filters | 2024 | SIGMOD | 4.4211093e-05 |
| 10,662 | ArrayMorph: Optimizing Hyperslab Queries on the Cloud for Machine Learning Pipelines | 2025 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 16 of 16 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,132 | Advanced Join Strategies for Large-Scale Distributed Computation | 2014 | VLDB | 6.4241067e-05 |
| 588 | Practical Skew Handling in Parallel Joins | 1992 | VLDB | 0.00019604754 |
| 1,939 | From Theory to Practice: Efficient Join Query Evaluation in a Parallel Database System | 2015 | SIGMOD | 0.00010025655 |
| 1,619 | Adaptive Optimization of Very Large Join Queries | 2018 | SIGMOD | 0.00011111678 |
| 2,275 | Adopting Worst-Case Optimal Joins in Relational Database Systems | 2020 | VLDB | 9.1262202e-05 |
| 11,890 | Let's Rethink Join Optimization in Distributed Systems | 2015 | CIDR | 4.1945683e-05 |
| 2,044 | Optimization of Multi-Way Join Queries for Parallel Execution | 1991 | VLDB | 9.6953608e-05 |
| 861 | A Taxonomy and Performance Model of Data Skew Effects in Parallel Joins | 1991 | VLDB | 0.00015848554 |
| 6,214 | Skew Handling Techniques in Sort-Merge Join | 2002 | SIGMOD | 5.1546943e-05 |
| 6,507 | Similarity Join over Array Data | 2016 | SIGMOD | 5.0337166e-05 |