Database Paper Browser

Back to papers

Magpie: Python at Speed and Scale using Cloud Backends

Summary: Magpie exposes the Pandas API but lazily pushes dataframe work into cloud query engines (SQL DW, Spark, SCOPE) through a common data layer, avoiding cross-engine transfer and leveraging DB-grade features. It auto-selects optimal backends to deliver database-scale performance to Python analytics; production traces show ~25% of internal computations could benefit. (summarized by gpt-5-mini on Feb 09 2026)

Paper ID
407
Venue
CIDR
Year
2021
Pagerank
7.8262582e-05
Overall Rank
2,954 | 79.46%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 16 of 16 citing papers.

Rank Citing Paper Year Venue Pagerank
3,393 Lux: Always-on Visualization Recommendations for Exploratory Dataframe Workflows 2022 VLDB 7.1483239e-05
3,407 End-to-end Optimization of Machine Learning Prediction Queries 2022 SIGMOD 7.1295646e-05
3,763 Flexible Rule-Based Decomposition and Metadata Independence in Modin: A Parallel Dataframe System 2022 VLDB 6.7801795e-05
4,773 PolyFrame: A Retargetable Query-based Approach to Scaling Dataframes 2021 VLDB 5.9320139e-05
5,731 Babelfish: Efficient Execution of Polyglot Queries 2022 VLDB 5.3502065e-05
6,261 The Cosmos Big Data Platform at Microsoft: Over a Decade of Progress and a Decade to Look Forward 2021 VLDB 5.1350714e-05
6,541 ConnectorX: Accelerating Data Loading From Databases to Dataframes 2022 VLDB 5.0216945e-05
6,701 YeSQL: “You extend SQL” with Rich and Highly Performant User-Defined Functions in Relational Databases 2022 VLDB 4.9561066e-05
6,895 Decentralized Actor Scheduling and Reference-based Storage in Xorbits: a Native Scalable Data Science Engine 2025 VLDB 4.8925595e-05
8,583 Efficient Execution of User-Defined Functions in SQL Queries 2023 VLDB 4.4919445e-05
8,645 Predicate Pushdown for Data Science Pipelines 2023 SIGMOD 4.4772518e-05
9,343 The Key to Effective UDF Optimization: Before Inlining, First Perform Outlining 2025 VLDB 4.3546206e-05
9,762 QURE: AI-Assisted and Automatically Verified UDF Inlining 2025 SIGMOD 4.2856106e-05
9,911 Dias: Dynamic Rewriting of Pandas Code 2024 SIGMOD 4.2565279e-05
10,931 Proactive Resume and Pause of Resources for Microsoft Azure SQL Database Serverless 2024 SIGMOD 4.1945683e-05
11,024 SplitDF: Splitting Dataframes for Memory-Efficient Data Analysis 2024 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 14 of 14 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers