Database Paper Browser

Back to papers

ExDRa: Exploratory Data Science on Federated Raw Data

Summary: ExDRa enables exploratory data science on federated raw data: ad-hoc integration, intermediates reuse, and lifecycle optimization for partially accessible data. It adds a federated SystemDS backend for linear algebra, PS, and data prep to enable enterprise federated ML and privacy-aware data management. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
6238
Venue
SIGMOD
Year
2021
Pagerank
4.6733838e-05
Overall Rank
7,704 | 46.41%
DOI
10.1145/3448016.3457549

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 7 of 7 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 38 of 38 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
66 Spark SQL: Relational Data Processing in Spark 2015 SIGMOD 0.00061639801
168 MAD Skills: New Analysis Practices for Big Data 2009 VLDB 0.00038946305
192 HoloClean: Holistic Data Repairs with Probabilistic Inference 2017 VLDB 0.00035728858
328 An Architecture for Parallel Topic Models 2010 VLDB 0.0002728514
408 Database Cracking 2007 CIDR 0.00023953844
667 Incremental Knowledge Base Construction Using DeepDive 2015 VLDB 0.00018440557
761 Materialization Optimizations for Feature Selection Workloads 2014 SIGMOD 0.00017053783
921 Democratizing Data Science through Interactive Curation of ML Pipelines 2019 SIGMOD 0.00015337438
1,143 Privacy Preserving Vertical Federated Learning for Tree-based Models 2020 VLDB 0.00013710269
1,277 The Data Civilizer System 2017 CIDR 0.00012879695
1,337 HoloDetect: Few-Shot Learning for Error Detection 2019 SIGMOD 0.00012497164
1,343 NoDB: Efficient Query Execution on Raw Data Files 2012 SIGMOD 0.00012482538
1,630 Garlic: A New Flavor of Federated Query Processing for DB2 2002 SIGMOD 0.0001108111
1,666 HELIX: Holistic Optimization for Accelerating Iterative Machine Learning 2019 VLDB 0.0001096361
1,942 Heterogeneity-aware Distributed Parameter Servers 2017 SIGMOD 0.00010012691
1,967 Compressed Linear Algebra for Large-Scale Machine Learning 2016 VLDB 9.9131712e-05
2,022 Lazy Maintenance of Materialized Views 2007 VLDB 9.754634e-05
2,122 SystemDS: A Declarative Machine Learning System for the End-to-End Data Science Lifecycle 2020 CIDR 9.4989076e-05
2,152 MISTIQUE: A System to Store and Query Model Intermediates for Model Diagnosis 2018 SIGMOD 9.4239787e-05
2,255 LINVIEW: Incremental View Maintenance for Complex Analytical Queries 2014 SIGMOD 9.1884983e-05
2,302 Nearest Neighbor Classifiers over Incomplete Information: From Certain Answers to Certain Predictions 2021 VLDB 9.0668832e-05
2,367 Here are my Data Files. Here are my Queries. Where are my Results? 2011 CIDR 8.9511058e-05
2,573 Query Optimization for Dynamic Imputation 2017 VLDB 8.518235e-05
2,693 An Architecture for Recycling Intermediates in a Column-store 2009 SIGMOD 8.2883398e-05
2,928 WANalytics: Analytics for a Geo-Distributed Data-Intensive World 2015 CIDR 7.8812874e-05
2,934 AIDA - Abstraction for Advanced In-Database Analytics 2018 VLDB 7.8595778e-05
3,023 Helix: Accelerating Human-in-the-loop Machine Learning 2018 VLDB 7.6929986e-05
3,481 Building High Throughput Permissioned Blockchain Fabrics: Challenges and Opportunities 2020 VLDB 7.0534352e-05
4,196 Overton: A Data System for Monitoring and Improving Machine-Learned Products 2020 CIDR 6.3686231e-05
4,326 Fast Queries Over Heterogeneous Data Through Engine Customization 2016 VLDB 6.288323e-05
4,409 Declarative Recursive Computation on an RDBMS 2019 VLDB 6.2104034e-05
4,774 LIMA: Fine-grained Lineage Tracing and Reuse in Machine Learning Systems 2021 SIGMOD 5.9316087e-05
5,427 The NebulaStream Platform: Data and Application Management for the Internet of Things 2020 CIDR 5.509468e-05
5,433 "Amnesia" - A Selection of Machine Learning Models That Can Forget User Data Very Fast 2020 CIDR 5.5051607e-05
6,053 Optimizing Machine Learning Workloads in Collaborative Environments 2020 SIGMOD 5.2326838e-05
6,538 Tuple-oriented Compression for Large-scale Mini-batch Stochastic Gradient Descent 2019 SIGMOD 5.023239e-05
6,648 Grizzly: Efficient Stream Processing Through Adaptive Query Compilation 2020 SIGMOD 4.9771723e-05
7,414 State of Public and Private Blockchains: Myths and Reality 2019 SIGMOD 4.7356435e-05
Previous Page 1 / 1 Next

Semantically Similar Papers