Database Paper Browser

Back to papers

Grep: A Graph Learning Based Database Partitioning System

Summary: Grep leverages a graph-based encoding of data and queries (columns as nodes, query relations as edges; weights from data diversity and joins) with graph neural networks to learn partitioning keys. An evaluation model predicts partitioning performance without repartitioning; deployed in a commercial DB, it delivers 68% throughput gain on 30K banking queries. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
6597
Venue
SIGMOD
Year
2023
Pagerank
4.5852201e-05
Overall Rank
8,103 | 43.63%
DOI
10.1145/3588948

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 6 of 6 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 34 of 34 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
71 How Good Are Query Optimizers, Really? 2016 VLDB 0.00059038975
209 Schism: a Workload-Driven Approach to Database Replication and Partitioning 2010 VLDB 0.00034468292
285 Automating Physical Database Design in a Parallel Database 2002 SIGMOD 0.0002899128
661 Database Tuning Advisor for Microsoft SQL Server 2005 2004 VLDB 0.00018481174
782 QTune: A Query-Aware Database Tuning System with Deep Reinforcement Learning 2019 VLDB 0.00016729063
806 An End-to-End Learning-based Cost Estimator 2020 VLDB 0.00016434274
1,477 Fine-grained Partitioning for Aggressive Data Skipping 2014 SIGMOD 0.00011770865
1,683 Cardinality Estimation: An Experimental Survey 2018 VLDB 0.00010922679
1,700 Bridging the Archipelago between Row-Stores and Column-Stores for Hybrid Workloads 2016 SIGMOD 0.00010858865
1,902 Black or White? How to Develop an AutoTuner for Memory-based Analytics 2020 SIGMOD 0.00010157713
2,165 Self-Tuning, GPU-Accelerated Kernel Density Models for Multidimensional Selectivity Estimation 2015 SIGMOD 9.389622e-05
2,564 Combating Web Spam with TrustRank 2004 VLDB 8.5277793e-05
2,983 Supporting Table Partitioning By Reference in Oracle 2008 SIGMOD 7.7796493e-05
3,005 Clay: Fine-Grained Adaptive Partitioning for General Database Schemas 2017 VLDB 7.7303579e-05
3,076 Learning a Partitioning Advisor for Cloud Databases 2020 SIGMOD 7.6107677e-05
3,214 Query Optimization Techniques for Partitioned Tables 2011 SIGMOD 7.3661891e-05
3,248 A Learned Query Rewrite System using Monte Carlo Tree Search 2022 VLDB 7.3258782e-05
3,449 Learned Cardinality Estimation: A Design Space Exploration and A Comparative Evaluation 2022 VLDB 7.0824319e-05
3,473 AI Meets Database: AI4DB and DB4AI 2021 SIGMOD 7.062864e-05
3,580 Query Performance Prediction for Concurrent Queries using Graph Embedding 2020 VLDB 6.9500996e-05
3,727 Cost-based or Learning-based? A Hybrid Query Optimizer for Query Plan Selection 2022 VLDB 6.8141709e-05
3,821 Locality-aware Partitioning in Parallel Database Systems 2015 SIGMOD 6.7281515e-05
4,152 openGauss: An Autonomous Database System 2021 VLDB 6.4060406e-05
4,240 Make Your Database System Dream of Electric Sheep: Towards Self-Driving Operation 2021 VLDB 6.3318228e-05
4,399 HUNTER: An Online Cloud Database Hybrid Tuning System for Personalized Requirements 2022 SIGMOD 6.2225151e-05
4,543 FACE: A Normalizing Flow based Cardinality Estimator 2022 VLDB 6.1011198e-05
5,118 AdaptDB: Adaptive Partitioning for Distributed Joins 2017 VLDB 5.6820984e-05
5,371 LearnedSQLGen: Constraint-aware SQL Generation using Reinforcement Learning 2022 SIGMOD 5.5428776e-05
5,685 Exact Cardinality Query Optimization with Bounded Execution Cost 2019 SIGMOD 5.3717535e-05
6,456 From Auto-tuning One Size Fits All to Self-designed and Learned Data-intensive Systems 2019 SIGMOD 5.0564619e-05
6,659 Fast and Effective Distribution-Key Recommendation for Amazon Redshift 2020 VLDB 4.9710856e-05
7,715 Query Centric Partitioning and Allocation for Partially Replicated Database Systems 2017 SIGMOD 4.6699261e-05
8,180 Demonstrating UDO: A Unified Approach for Optimizing Transaction Code, Physical Design, and System Parameters via Reinforcement Learning 2021 SIGMOD 4.5663204e-05
8,442 SageDB: An Instance-Optimized Data Analytics System 2022 VLDB 4.5120602e-05
Previous Page 1 / 1 Next

Semantically Similar Papers