Back to papers
Synthesizing Natural Language to Visualization (NL2VIS) Benchmarks from NL2SQL Benchmarks
Summary: nvBench is the first nl2vis benchmark, synthesized from nl2sql (Spider) via an AST bridge unifying SQL and vis trees for multiple viz languages. It reduces benchmark creation to 5.7% of manual effort and a seq2vis model trained on nvBench outperforms prior nl2vis methods, with extensive validation.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 6152
- Venue
- SIGMOD
- Year
- 2021
- Pagerank
- 5.8946721e-05
- Overall Rank
- 4,825 | 66.44%
- DOI
-
10.1145/3448016.3457261
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 13 of 13 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
| 3,662 |
The Dawn of Natural Language to SQL: Are We Fully Ready? |
2024 |
VLDB |
6.8672143e-05 |
| 3,970 |
HAIChart: Human and AI Paired Visualization System |
2024 |
VLDB |
6.5784767e-05 |
| 4,102 |
GoodCore: Data-effective and Data-efficient Machine Learning through Coreset Selection over Incomplete Data |
2023 |
SIGMOD |
6.4522929e-05 |
| 5,381 |
Selective Data Acquisition in the Wild for Model Charging |
2022 |
VLDB |
5.5399508e-05 |
| 5,963 |
Automatic Data Acquisition for Deep Learning |
2021 |
VLDB |
5.2526794e-05 |
| 8,155 |
Automated Data Visualization from Natural Language via Large Language Models: An Exploratory Study |
2024 |
SIGMOD |
4.5745248e-05 |
| 8,268 |
Learned Data-aware Image Representations of Line Charts for Similarity Search |
2023 |
SIGMOD |
4.5456668e-05 |
| 8,281 |
Optimizing Data Acquisition to Enhance Machine Learning Performance |
2024 |
VLDB |
4.5435639e-05 |
| 9,829 |
Sevi: Speech-to-Visualization through Neural Machine Translation |
2022 |
SIGMOD |
4.2751057e-05 |
| 9,994 |
BridgeScope: A Universal Toolkit for Bridging Large Language Models and Databases |
2026 |
CIDR |
4.1945683e-05 |
| 10,185 |
MultiVis-Agent: A Multi-Agent Framework with Logic Rules for Reliable and Comprehensive Cross-Modal Data Visualization |
2026 |
SIGMOD |
4.1945683e-05 |
| 10,289 |
LEAD: Iterative Data Selection for Efficient LLM Instruction Tuning |
2026 |
VLDB |
4.1945683e-05 |
| 10,837 |
Natural Language to SQL: State of the Art and Open Problems |
2025 |
VLDB |
4.1945683e-05 |
Outgoing Citations (Sorted by Pagerank)
Showing 13 of 13 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 206 |
Constructing an Interactive Natural Language Interface for Relational Databases |
2015 |
VLDB |
0.00034667032 |
| 460 |
SeeDB: Efficient Data-Driven Visualization Recommendations to Support Visual Analytics |
2015 |
VLDB |
0.00022516069 |
| 984 |
Natural language to SQL: Where are we today? |
2020 |
VLDB |
0.00014857465 |
| 991 |
Effortless Data Exploration with zenvisage: An Expressive and Interactive Visual Analytics System |
2017 |
VLDB |
0.00014807273 |
| 2,129 |
IDEBench: A Benchmark for Interactive Data Exploration |
2020 |
SIGMOD |
9.480002e-05 |
| 2,321 |
DBPal: A Fully Pluggable NL2SQL Training Pipeline |
2020 |
SIGMOD |
9.03609e-05 |
| 2,415 |
VizQL: A Language for Query, Analysis and Visualization |
2006 |
SIGMOD |
8.8639497e-05 |
| 5,484 |
DeepEye: Creating Good Data Visualizations by Keyword Search |
2018 |
SIGMOD |
5.4826544e-05 |
| 5,810 |
Database Benchmarking for Supporting Real-Time Interactive Querying of Large Data |
2020 |
SIGMOD |
5.3178017e-05 |
| 6,842 |
Towards Democratizing Relational Data Visualization |
2019 |
SIGMOD |
4.9103931e-05 |
| 7,575 |
Human-in-the-loop Outlier Detection |
2020 |
SIGMOD |
4.7068909e-05 |
| 9,221 |
VisClean: Interactive Cleaning for Progressive Visualization |
2020 |
VLDB |
4.3699444e-05 |
| 13,283 |
DeepTrack: Monitoring and Exploring Spatio-Temporal Data – A Case of Tracking COVID-19 – |
2020 |
VLDB |
- |
Semantically Similar Papers