An Experimental Evaluation of Large Scale GBDT Systems
Summary: Systematic evaluation of distributed GBDT data management policies; introduces a quadrant model by data partitioning and storage. Proposes Vero (vertical partitioning + row-store) and benchmarks quadrants vs state-of-the-art, yielding workload-driven guidelines for policy choice. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Fangcheng Fu
- 2. Jiawei Jiang
- 3. Yingxia Shao
- 4. Bin Cui
Incoming Citations (Sorted by Pagerank)
Showing 8 of 8 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,143 | Privacy Preserving Vertical Federated Learning for Tree-based Models | 2020 | VLDB | 0.00013710269 |
| 1,895 | VF2Boost: Very Fast Vertical Federated Gradient Boosting for Cross-Enterprise Learning | 2021 | SIGMOD | 0.00010180896 |
| 2,791 | Towards Demystifying Serverless Machine Learning Training | 2021 | SIGMOD | 8.1206618e-05 |
| 3,506 | BlindFL: Vertical Federated Machine Learning without Peeking into Your Data | 2022 | SIGMOD | 7.0291192e-05 |
| 6,566 | Reliable Data Distillation on Graph Convolutional Network | 2020 | SIGMOD | 5.0074274e-05 |
| 8,808 | FlexMoE: Scaling Large-scale Sparse Pre-trained Model Training via Dynamic Device Placement | 2023 | SIGMOD | 4.4454035e-05 |
| 9,222 | Towards an Optimized GROUP BY Abstraction for Large-Scale Machine Learning | 2021 | VLDB | 4.3698672e-05 |
| 9,806 | The Image Calculator: 10x Faster Image-AI Inference by Replacing JPEG with Self-designing Storage Format | 2024 | SIGMOD | 4.2805224e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 10 of 10 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 21 | C-Store: A Column-oriented DBMS | 2005 | VLDB | 0.00086087497 |
| 126 | Space-Efficient Online Computation of Quantile Summaries | 2001 | SIGMOD | 0.00044744986 |
| 131 | Integrating Compression and Execution in Column-Oriented Database Systems | 2006 | SIGMOD | 0.0004370331 |
| 286 | Integrating Vertical and Horizontal Partitioning into Automated Physical Database Design | 2004 | SIGMOD | 0.00028990057 |
| 497 | Column-Stores vs. Row-Stores: How Different Are They Really? | 2008 | SIGMOD | 0.00021716559 |
| 1,044 | DimmWitted: A Study of Main-Memory Statistical Analytics | 2014 | VLDB | 0.00014475229 |
| 1,942 | Heterogeneity-aware Distributed Parameter Servers | 2017 | SIGMOD | 0.00010012691 |
| 2,953 | Moment-Based Quantile Sketches for Efficient High Cardinality Aggregation Queries | 2018 | VLDB | 7.8267643e-05 |
| 4,964 | PS2: Parameter Server on Spark | 2019 | SIGMOD | 5.7965988e-05 |
| 9,469 | DimBoost: Boosting Gradient Boosting Decision Tree to Higher Dimensions | 2018 | SIGMOD | 4.3342363e-05 |
Previous
Page 1 / 1
Next