Database Paper Browser

Back to papers

DAPHNE: An Open and Extensible System Infrastructure for Integrated Data Analysis Pipelines

Summary: DAPHNE: extensible infra unifying DM, HPC and ML pipelines via shared language abstractions, compiler/runtime integration and multi-level scheduling. Key novelty: vectorized engine for computational storage and accelerators to cut data-movement/format overheads for local and distributed ops; prelim results show notable speedups. (summarized by gpt-5-mini on Feb 09 2026)

Paper ID
451
Venue
CIDR
Year
2022
Pagerank
4.7678574e-05
Overall Rank
7,306 | 49.18%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 5 of 5 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 37 of 37 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
4 Pregel: A System for Large-Scale Graph Processing 2010 SIGMOD 0.0019005923
35 MonetDB/X100: Hyper-Pipelining Query Execution 2005 CIDR 0.00076197749
167 The Snowflake Elastic Data Warehouse 2016 SIGMOD 0.00039180521
418 Morsel-Driven Parallelism: A NUMA-Aware Query Evaluation Framework for the Many-Core Age 2014 SIGMOD 0.00023729211
543 MLbase: A Distributed Machine-learning System 2013 CIDR 0.00020526854
658 Towards a Unified Architecture for in-RDBMS Analytics 2012 SIGMOD 0.00018506577
821 Designing And Mining Multi-Terabyte Astronomy Archives: The Sloan Digital Sky Survey 2000 SIGMOD 0.00016272349
1,402 Hybrid Parallelization Strategies for Large-Scale Machine Learning in SystemML 2014 VLDB 0.00012180605
1,495 Ricardo: Integrating R and Hadoop 2010 SIGMOD 0.00011691049
1,532 Data Management in Machine Learning: Challenges, Techniques, and Systems 2017 SIGMOD 0.00011472681
1,630 Garlic: A New Flavor of Federated Query Processing for DB2 2002 SIGMOD 0.0001108111
1,882 Tuplex: Data Science in Python at Native Code Speed 2021 SIGMOD 0.0001021625
1,940 SliceLine: Fast, Linear-Algebra-based Slice Finding for ML Model Debugging 2021 SIGMOD 0.00010020173
2,122 SystemDS: A Declarative Machine Learning System for the End-to-End Data Science Lifecycle 2020 CIDR 9.4989076e-05
2,443 Data Management for Data Science: Towards Embedded Analytics 2020 CIDR 8.8078476e-05
2,573 Query Optimization for Dynamic Imputation 2017 VLDB 8.518235e-05
2,611 Opening the Black Boxes in Data Flow Optimization 2012 VLDB 8.4536967e-05
3,327 Pump Up the Volume: Processing Large Data on GPUs with Fast Interconnects 2020 SIGMOD 7.2205738e-05
3,330 Adapting to Source Properties in Processing Data Integration Queries 2004 SIGMOD 7.2150831e-05
3,918 On Optimizing Operator Fusion Plans for Large-Scale Machine Learning in SystemML 2018 VLDB 6.6315176e-05
4,397 Estimating Compilation Time of a Query Optimizer 2003 SIGMOD 6.2230918e-05
4,484 The bionic DBMS is coming, but what will it look like? 2013 CIDR 6.1475055e-05
4,701 Tensors: An abstraction for general data processing 2021 VLDB 5.9866564e-05
4,774 LIMA: Fine-grained Lineage Tracing and Reuse in Machine Learning Systems 2021 SIGMOD 5.9316087e-05
5,067 Not your Grandpa's SSD: The Era of Co-Designed Storage Devices 2021 SIGMOD 5.7215985e-05
5,087 Accelerating Queries with Group-By and Join by Groupjoin 2011 VLDB 5.7075009e-05
5,216 Computational Storage: Where Are We Today? 2021 CIDR 5.6228313e-05
5,586 QuERy: A Framework for Integrating Entity Resolution with Query Processing 2016 VLDB 5.4219548e-05
5,720 BAGUA: Scaling up Distributed Learning with System Relaxations 2022 VLDB 5.3527734e-05
5,964 Bridging Two Worlds with RICE: Integrating R into the SAP In-Memory Computing Engine 2011 VLDB 5.2520617e-05
6,964 A Morsel-Driven Query Execution Engine for Heterogeneous Multi-Cores 2019 VLDB 4.8815971e-05
7,209 GPU-accelerated data management under the test of time 2020 CIDR 4.7996023e-05
7,243 Data Integration and Machine Learning: A Natural Synergy 2018 VLDB 4.7913666e-05
7,704 ExDRa: Exploratory Data Science on Federated Raw Data 2021 SIGMOD 4.6733838e-05
7,811 Hardware-Oblivious SIMD Parallelism for In-Memory Column-Stores 2020 CIDR 4.6445165e-05
8,462 Topology-aware Parallel Data Processing: Models, Algorithms and Systems at Scale 2020 CIDR 4.5056381e-05
9,001 The Power of Nested Parallelism in Big Data Processing – Hitting Three Flies with One Slap – 2021 SIGMOD 4.4107627e-05
Previous Page 1 / 1 Next

Semantically Similar Papers