Optimizing I/O for Big Array Analytics
Summary: Introduces a declarative framework for big array analytics via nested-loop tasks, exposing shared I/O opportunities. An optimizer finds execution plans that exploit cross-step I/O sharing, yielding notable data-movement savings. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 6 of 6 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,402 | Hybrid Parallelization Strategies for Large-Scale Machine Learning in SystemML | 2014 | VLDB | 0.00012180605 |
| 1,532 | Data Management in Machine Learning: Challenges, Techniques, and Systems | 2017 | SIGMOD | 0.00011472681 |
| 4,376 | Just-in-time compilation for SQL query processing | 2013 | VLDB | 6.2424797e-05 |
| 5,667 | Code generation for efficient query processing in managed runtimes | 2014 | VLDB | 5.3806399e-05 |
| 7,823 | Measuring and Optimizing Distributed Array Programs | 2016 | VLDB | 4.6419393e-05 |
| 8,620 | PreVision: An Out-of-Core Matrix Computation System with Optimal Buffer Replacement | 2024 | SIGMOD | 4.4837361e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 9 of 9 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 35 | MonetDB/X100: Hyper-Pipelining Query Execution | 2005 | CIDR | 0.00076197749 |
| 36 | Fast Algorithms for Mining Association Rules | 1994 | VLDB | 0.00076161096 |
| 179 | Efficient and Extensible Algorithms for Multi Query Optimization | 2000 | SIGMOD | 0.00037672155 |
| 318 | Overview of SciDB: Large Scale Array Storage, Processing and Analysis | 2010 | SIGMOD | 0.00027795661 |
| 515 | QPipe: A Simultaneously Pipelined Relational Query Engine | 2005 | SIGMOD | 0.00021214633 |
| 1,026 | Cooperative Scans: Dynamic Bandwidth Sharing in a DBMS | 2007 | VLDB | 0.00014589172 |
| 1,076 | RIOT: I/O-Efficient Numerical Computing without SQL | 2009 | CIDR | 0.00014248449 |
| 1,299 | The DataPath System: A Data-Centric Analytic Processing Engine for Large Data Warehouses | 2010 | SIGMOD | 0.00012751522 |
| 9,426 | Storing Matrices on Disk: Theory and Practice Revisited | 2011 | VLDB | 4.3441378e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 5,014 | Dynamically Optimizing Queries over Large Scale Data Platforms | 2014 | SIGMOD | 5.7586174e-05 |
| 5,209 | Explaining Outputs in Modern Data Analytics | 2016 | VLDB | 5.629362e-05 |
| 5,960 | Skew-Aware Join Optimization for Array Databases | 2015 | SIGMOD | 5.2559595e-05 |
| 3,779 | Instance-Optimized Data Layouts for Cloud Analytics Workloads | 2021 | SIGMOD | 6.7747205e-05 |
| 1,876 | ArrayStore: A Storage Manager for Complex Parallel Array Processing | 2011 | SIGMOD | 0.00010239284 |
| 5,368 | Fine-Grained Modeling and Optimization for Intelligent Resource Management in Big Data Processing | 2022 | VLDB | 5.5457532e-05 |
| 2,611 | Opening the Black Boxes in Data Flow Optimization | 2012 | VLDB | 8.4536967e-05 |
| 2,172 | Spinning Fast Iterative Data Flows | 2012 | VLDB | 9.3706587e-05 |
| 1,939 | From Theory to Practice: Efficient Join Query Evaluation in a Parallel Database System | 2015 | SIGMOD | 0.00010025655 |
| 8,534 | Translation of Array-Based Loops to Distributed Data-Parallel Programs | 2020 | VLDB | 4.4937074e-05 |