Back to papers
BABOONS: Black-Box Optimization of Data Summaries in Natural Language
Summary: BABOONS optimizes natural-language data summaries as a black box under a user-defined utility, via RL with LLM scoring; utilities can be given in natural language or a domain model. Scalable evaluation via proactive query merging, sampling, and batching.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 12780
- Venue
- VLDB
- Year
- 2022
- Pagerank
- 4.1945683e-05
- Overall Rank
- 11,384 | 20.81%
- DOI
-
10.14778/3551793.3551846
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Incoming Citations (Sorted by Pagerank)
Showing 1 of 1 citing papers.
Outgoing Citations (Sorted by Pagerank)
Showing 13 of 13 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 14 |
Online Aggregation |
1997 |
SIGMOD |
0.0010801504 |
| 214 |
Scorpion: Explaining Away Outliers in Aggregate Queries |
2013 |
VLDB |
0.0003363692 |
| 942 |
A Formal Approach to Finding Explanations for Database Queries |
2014 |
SIGMOD |
0.00015155714 |
| 943 |
Wander Join: Online Aggregation via Random Walks |
2016 |
SIGMOD |
0.00015145883 |
| 1,137 |
User-adaptive exploration of multidimensional data |
2000 |
VLDB |
0.00013730532 |
| 2,154 |
DIFF: A Relational Interface for Large-Scale Data Explanation |
2019 |
VLDB |
9.4208667e-05 |
| 3,546 |
Extracting Top-K Insights from Multi-dimensional Data |
2017 |
SIGMOD |
6.9870745e-05 |
| 3,819 |
Promotion Analysis in Multi-Dimensional Space |
2009 |
VLDB |
6.7299866e-05 |
| 5,107 |
SeeDB: Automatically Generating Query Visualizations |
2014 |
VLDB |
5.6925578e-05 |
| 7,094 |
Data In, Fact Out: Automated Monitoring of Facts by FactWatcher |
2014 |
VLDB |
4.8366704e-05 |
| 7,586 |
Maverick: Discovering Exceptional Facts from Knowledge Graphs |
2018 |
SIGMOD |
4.7036704e-05 |
| 11,646 |
A Holistic Approach for Query Evaluation and Result Vocalization in Voice-Based OLAP |
2019 |
SIGMOD |
4.1945683e-05 |
| 11,800 |
Data Vocalization: Optimizing Voice Output of Relational Data |
2017 |
VLDB |
4.1945683e-05 |
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 10,217 |
This is Going to Sound Crazy, But What If We Used Large Language Models to Boost Automatic Database Tuning Algorithms By Leveraging Prior History? We Will Find Better Configurations More Quickly Than Retraining From Scratch! |
2026 |
SIGMOD |
4.1945683e-05 |
| 6,329 |
Utility-Driven Graph Summarization |
2019 |
VLDB |
5.1077685e-05 |
| 4,614 |
Interactive Summarization and Exploration of Top Aggregate Query Answers |
2018 |
VLDB |
6.0467204e-05 |
| 10,595 |
Optimized Batch Prompting for Cost-effective LLMs |
2025 |
VLDB |
4.1945683e-05 |
| 7,222 |
Guided Exploration of Data Summaries |
2022 |
VLDB |
4.797186e-05 |
| 9,219 |
Intelligent Agents for Data Exploration |
2024 |
VLDB |
4.3702863e-05 |
| 10,064 |
Cut Costs, Not Accuracy: LLM-Powered Data Processing with Guarantees |
2026 |
SIGMOD |
4.1945683e-05 |
| 9,521 |
SuDocu: Summarizing Documents by Example |
2020 |
VLDB |
4.3319585e-05 |
| 5,171 |
Abacus: A Cost-Based Optimizer for Semantic Operator Systems |
2026 |
VLDB |
5.6464993e-05 |
| 640 |
Bao: Making Learned Query Optimization Practical |
2021 |
SIGMOD |
0.00018759152 |