Decentralized Actor Scheduling and Reference-based Storage in Xorbits: a Native Scalable Data Science Engine
Summary: Xorbits’ decentralized actor model (Xoscar) avoids a global scheduler to enable fine-grained distribution of pipeline operators. Reference-based distributed storage unifies heterogeneous memory and key-level intermediates, yielding up to 3.22× speedups. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Weizheng Lu
- 2. Chao Hui
- 3. Yunhai Wang
- 4. Feng Zhang
- 5. Yueguo Chen
- 6. Bao Liu
- 7. Chengjie Li
- 8. Zhaoxin Wu
- 9. Xuye Qin
Incoming Citations (Sorted by Pagerank)
Showing 1 of 1 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 10,243 | TPCx-AI under the Microscope: A Benchmarking Debt Analysis | 2026 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 9 of 9 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 542 | Shark: SQL and Rich Analytics at Scale | 2013 | SIGMOD | 0.00020595648 |
| 1,427 | Towards Scalable Dataframe Systems | 2020 | VLDB | 0.0001204248 |
| 2,754 | Giraph Unchained: Barrierless Asynchronous Parallel Execution in Pregel-like Graph Processing Systems | 2015 | VLDB | 8.169411e-05 |
| 2,954 | Magpie: Python at Speed and Scale using Cloud Backends | 2021 | CIDR | 7.8262582e-05 |
| 3,252 | Auto-Suggest: Learning-to-Recommend Data Preparation Steps Using Data Science Notebooks | 2020 | SIGMOD | 7.3178277e-05 |
| 3,763 | Flexible Rule-Based Decomposition and Metadata Independence in Modin: A Parallel Dataframe System | 2022 | VLDB | 6.7801795e-05 |
| 4,773 | PolyFrame: A Retargetable Query-based Approach to Scaling Dataframes | 2021 | VLDB | 5.9320139e-05 |
| 5,605 | TPCx-AI - An Industry Standard Benchmark for Artificial Intelligence and Machine Learning Systems | 2023 | VLDB | 5.4142007e-05 |
| 7,059 | Adaptive and Robust Query Execution for Lakehouses at Scale | 2024 | VLDB | 4.8477825e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 11,679 | I Can't Believe It's Not (Only) Software! Bionic Distributed Storage for Parquet Files | 2019 | VLDB | 4.1945683e-05 |
| 2,848 | Exploiting Matrix Dependency for Efficient Distributed Matrix Computation | 2015 | SIGMOD | 8.0208832e-05 |
| 4,557 | Distributed Deep Learning on Data Systems: A Comparative Analysis of Approaches | 2021 | VLDB | 6.087611e-05 |
| 7,813 | GraphScope: A One-Stop Large Graph Processing System | 2021 | VLDB | 4.6441779e-05 |
| 6,541 | ConnectorX: Accelerating Data Loading From Databases to Dataframes | 2022 | VLDB | 5.0216945e-05 |
| 3,058 | Rethinking Data-Intensive Science Using Scalable Analytics Systems | 2015 | SIGMOD | 7.6410159e-05 |
| 543 | MLbase: A Distributed Machine-learning System | 2013 | CIDR | 0.00020526854 |
| 2,172 | Spinning Fast Iterative Data Flows | 2012 | VLDB | 9.3706587e-05 |
| 6,836 | An Algebraic Approach for Data-Centric Scientific Workflows | 2011 | VLDB | 4.9114673e-05 |
| 10,482 | Fast and Scalable Data Transfer Across Data Systems | 2025 | SIGMOD | 4.1945683e-05 |