Database Paper Browser

Back to papers

Overview of SciDB: Large Scale Array Storage, Processing and Analysis

Summary: Massively parallel, open-source array DB for petabyte-scale scientific data; introduces a novel storage manager, array data model, and extensible query language. Architecture and design enable parallelized array processing across astronomy, climate, biology, and large-scale log analytics. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
4308
Venue
SIGMOD
Year
2010
Pagerank
0.00027795661
Overall Rank
318 | 97.79%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 50 of 53 citing papers.

Rank Citing Paper Year Venue Pagerank
734 The TileDB Array Data Storage Manager 2017 VLDB 0.00017455248
761 Materialization Optimizations for Feature Selection Workloads 2014 SIGMOD 0.00017053783
1,876 ArrayStore: A Storage Manager for Complex Parallel Array Processing 2011 SIGMOD 0.00010239284
2,667 Cumulon: Optimizing Statistical Data Analysis in the Cloud 2013 SIGMOD 8.3413995e-05
2,757 Parallel Data Analysis Directly on Scientific File Formats 2014 SIGMOD 8.1679384e-05
2,848 Exploiting Matrix Dependency for Efficient Distributed Matrix Computation 2015 SIGMOD 8.0208832e-05
3,058 Rethinking Data-Intensive Science Using Scalable Analytics Systems 2015 SIGMOD 7.6410159e-05
3,147 Searchlight: Enabling Integrated Search and Exploration over Large Multidimensional Data 2015 VLDB 7.4771804e-05
3,254 Query Processing on Tensor Computation Runtimes 2022 VLDB 7.3161051e-05
3,343 Comparative Evaluation of Big-Data Systems on Scientific Image Analytics Workloads 2017 VLDB 7.1967343e-05
3,763 Flexible Rule-Based Decomposition and Metadata Independence in Modin: A Parallel Dataframe System 2022 VLDB 6.7801795e-05
3,904 Progressive Top-k Subarray Query Processing in Array Databases 2019 VLDB 6.6424961e-05
3,948 A Comparative Evaluation of Systems for Scalable Linear Algebra-based Analytics 2018 VLDB 6.5959084e-05
3,958 MLog: Towards Declarative In-Database Machine Learning 2017 VLDB 6.5897636e-05
3,982 The Myria Big Data Management and Analytics System and Cloud Service 2017 CIDR 6.5651188e-05
4,014 Exploiting Correlations for Expensive Predicate Evaluation 2015 SIGMOD 6.5273084e-05
4,197 Incremental View Maintenance with Triple Lock Factorization Benefits 2018 SIGMOD 6.367895e-05
4,259 Optimizing I/O for Big Array Analytics 2012 VLDB 6.3147285e-05
4,574 Incremental View Maintenance over Array Data 2017 SIGMOD 6.0738556e-05
4,820 SciQL: Array Data Processing Inside an RDBMS 2013 SIGMOD 5.8972557e-05
4,839 ChronosDB: Distributed, File Based, Geospatial Array DBMS 2018 VLDB 5.8875955e-05
5,039 VisualWorldDB: A DBMS for the Visual World 2020 CIDR 5.7425824e-05
5,290 LightDB: A DBMS for Virtual Reality Video 2018 VLDB 5.5828169e-05
5,821 Tensor Relational Algebra for Distributed Machine Learning System Design 2021 VLDB 5.3134851e-05
5,960 Skew-Aware Join Optimization for Array Databases 2015 SIGMOD 5.2559595e-05
6,123 Data Ingestion for the Connected World 2017 CIDR 5.1991194e-05
6,322 The BUDS Language for Distributed Bayesian Machine Learning 2017 SIGMOD 5.1124615e-05
6,507 Similarity Join over Array Data 2016 SIGMOD 5.0337166e-05
6,745 DistME: A Fast and Elastic Distributed Matrix Computation Engine using GPUs 2019 SIGMOD 4.9417155e-05
7,134 Incremental Elasticity For Array Databases 2014 SIGMOD 4.822331e-05
7,369 Using VDMS to Index and Search 100M Images 2021 VLDB 4.750437e-05
7,823 Measuring and Optimizing Distributed Array Programs 2016 VLDB 4.6419393e-05
7,903 A Demonstration of Iterative Parallel Array Processing in Support of Telescope Image Analysis 2013 VLDB 4.6215911e-05
8,262 FuseME: Distributed Matrix Computation Engine based on Cuboid-based Fused Operator and Plan Generation 2022 SIGMOD 4.5467867e-05
8,534 Translation of Array-Based Loops to Distributed Data-Parallel Programs 2020 VLDB 4.4937074e-05
8,796 Interactive Search and Exploration of Waveform Data with Searchlight 2016 SIGMOD 4.4494067e-05
8,922 Enabling Signal Processing over Data Streams 2017 SIGMOD 4.427232e-05
9,332 PlinyCompute: A Platform for High-Performance, Distributed, Data-Intensive Tool Development 2018 SIGMOD 4.3556432e-05
9,382 Hephaestus: Data Reuse for Accelerating Scientific Discovery 2015 CIDR 4.3457368e-05
9,426 Storing Matrices on Disk: Theory and Practice Revisited 2011 VLDB 4.3441378e-05
9,437 BlockJoin: Efficient Matrix Partitioning Through Joins 2017 VLDB 4.3425552e-05
10,378 HyperMR: Efficient Hypergraph-enhanced Matrix Storage on Compute-in-Memory Architecture 2025 SIGMOD 4.1945683e-05
10,621 BLAEQ: A Multigrid Index for Spatial Query on Geometry Data 2025 VLDB 4.1945683e-05
10,662 ArrayMorph: Optimizing Hyperslab Queries on the Cloud for Machine Learning Pipelines 2025 VLDB 4.1945683e-05
10,757 Polaris: An Interactive and Scalable Data Infrastructure for Polar Science 2025 VLDB 4.1945683e-05
10,864 RDPro: Distributed Processing of Big Raster Data 2025 VLDB 4.1945683e-05
11,339 Redundancy Elimination in Distributed Matrix Computation 2022 SIGMOD 4.1945683e-05
11,402 ReMac: A Matrix Computation System with Redundancy Elimination 2022 VLDB 4.1945683e-05
11,472 Hybrid Evaluation for Distributed Iterative Matrix Computation 2021 SIGMOD 4.1945683e-05
11,758 Demonstrating the BigDAWG Polystore System for Ocean Metagenomic Analysis 2017 CIDR 4.1945683e-05
Previous Page 1 / 2 Next

Outgoing Citations (Sorted by Pagerank)

Showing 3 of 3 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
21 C-Store: A Column-oriented DBMS 2005 VLDB 0.00086087497
44 The Design Of Postgres 1986 SIGMOD 0.00071838587
1,239 A Demonstration of SciDB: A Science-Oriented DBMS 2009 VLDB 0.00013102195
Previous Page 1 / 1 Next

Semantically Similar Papers

Overall Rank Paper Year Venue Pagerank
4,820 SciQL: Array Data Processing Inside an RDBMS 2013 SIGMOD 5.8972557e-05
1,909 SciBORQ: Scientific data management with Bounds On Runtime and Quality 2011 CIDR 0.00010121304
11,885 The Case for Small Data Management 2015 CIDR 4.1945683e-05
5,470 Cloud Databases: What’s New? 2010 VLDB 5.4894049e-05
11,860 Database System Support of Simulation Data 2016 VLDB 4.1945683e-05
928 Requirements for Science Data Bases and SciDB 2009 CIDR 0.00015247726
2,757 Parallel Data Analysis Directly on Scientific File Formats 2014 SIGMOD 8.1679384e-05
734 The TileDB Array Data Storage Manager 2017 VLDB 0.00017455248
13,486 Managing Scientific Data: Lessons, Challenges, and Opportunities 2011 SIGMOD -
1,239 A Demonstration of SciDB: A Science-Oriented DBMS 2009 VLDB 0.00013102195