Database Paper Browser

Back to papers

ByteCard: Enhancing ByteDance’s Data Warehouse with Learned Cardinality Estimation

Summary: ByteCard trains and deploys cardinality estimators in ByteHouse, replacing Selinger-style estimates for huge-scale queries. Balancing accuracy and practicality (latency, size, training cost), it yields up to 30% lower 99th percentile latency on real data. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
6784
Venue
SIGMOD
Year
2024
Pagerank
4.4394021e-05
Overall Rank
8,834 | 38.55%
DOI
10.1145/3626246.3653376

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 5 of 5 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 38 of 38 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
1 Access Path Selection in a Relational Database Management System 1979 SIGMOD 0.0040449103
71 How Good Are Query Optimizers, Really? 2016 VLDB 0.00059038975
167 The Snowflake Elastic Data Warehouse 2016 SIGMOD 0.00039180521
183 Automatic Database Management System Tuning Through Large-scale Machine Learning 2017 SIGMOD 0.00036721403
204 Learned Cardinalities: Estimating Correlated Joins with Deep Learning 2019 CIDR 0.00034784455
310 The Vertica Analytic Database: C-Store 7 Years Later 2012 VLDB 0.00028132402
378 Towards Estimation Error Guarantees for Distinct Values 2000 PODS 0.0002497492
488 TiDB: A Raft-based HTAP Database 2020 VLDB 0.000220409
514 An End-to-End Automatic Cloud Database Tuning System Using Deep Reinforcement Learning 2019 SIGMOD 0.0002124895
608 DeepDB: Learn from Data, not from Queries! 2020 VLDB 0.00019235898
758 Deep Unsupervised Cardinality Estimation 2020 VLDB 0.0001706608
806 An End-to-End Learning-based Cost Estimator 2020 VLDB 0.00016434274
910 NeuroCard: One Cardinality Estimator for All Tables 2021 VLDB 0.00015423056
953 Runtime Measurements in the Cloud: Observing, Analyzing, and Reducing Variance 2010 VLDB 0.00015095431
1,254 Selectivity Estimation for Range Predicates using Lightweight Models 2019 VLDB 0.00013027411
1,284 Amazon Redshift Re-invented 2022 SIGMOD 0.00012837822
1,313 Cost-Based Optimization for Magic: Algebra and Implementation 1996 SIGMOD 0.0001263831
1,590 Column-oriented Database Systems 2009 VLDB 0.00011233838
1,638 Cardinality Estimation in DBMS: A Comprehensive Benchmark Evaluation 2022 VLDB 0.00011049779
2,762 FLAT: Fast, Lightweight and Accurate Method for Cardinality Estimation 2021 VLDB 8.1585394e-05
3,152 AnalyticDB: Real-time OLAP Database System at Alibaba Cloud 2019 VLDB 7.4711766e-05
3,449 Learned Cardinality Estimation: A Design Space Exploration and A Comparative Evaluation 2022 VLDB 7.0824319e-05
3,499 Fauce: Fast and Accurate Deep Ensembles with Uncertainty for Cardinality Estimation 2021 VLDB 7.0376445e-05
3,522 ResTune: Resource Oriented Tuning Boosted by Meta-Learning for Cloud Databases 2021 SIGMOD 7.0096727e-05
3,625 Cost Models for Big Data Query Processing: Learning, Retrofitting, and Our Findings 2020 SIGMOD 6.9055212e-05
3,793 Constructing and Analyzing the LSM Compaction Design Space 2021 VLDB 6.7617833e-05
3,828 Zero-Shot Cost Models for Out-of-the-box Learned Cost Prediction 2022 VLDB 6.7208524e-05
3,990 FactorJoin: A New Cardinality Estimation Framework for Join Queries 2023 SIGMOD 6.5581983e-05
4,152 openGauss: An Autonomous Database System 2021 VLDB 6.4060406e-05
4,417 Robust Query Driven Cardinality Estimation under Changing Workloads 2023 VLDB 6.2037371e-05
4,543 FACE: A Normalizing Flow based Cardinality Estimator 2022 VLDB 6.1011198e-05
4,593 Auto-WLM: Machine Learning Enhanced Workload Management in Amazon Redshift 2023 SIGMOD 6.0606891e-05
5,258 One Model to Rule them All: Towards Zero-Shot Learning for Databases 2022 CIDR 5.5998705e-05
5,531 Presto: A Decade of SQL Analytics at Meta 2023 SIGMOD 5.4549499e-05
5,633 Analyzing the Impact of Cardinality Estimation on Execution Plans in Microsoft SQL Server 2023 VLDB 5.4011156e-05
6,775 A Unified Transferable Model for ML-Enhanced DBMS 2022 CIDR 4.9299192e-05
7,610 Learning to be a Statistician: Learned Estimator for Number of Distinct Values 2022 VLDB 4.6965039e-05
8,442 SageDB: An Instance-Optimized Data Analytics System 2022 VLDB 4.5120602e-05
Previous Page 1 / 1 Next

Semantically Similar Papers