ScienceBenchmark: A Complex Real-World Benchmark for Evaluating Natural Language to SQL Systems
Summary: ScienceBenchmark: NL-to-SQL benchmark on three complex, domain-specific scientific databases with expert-curated NL/SQL pairs. Augments limited human data with GPT-3 syntheses and shows top Spider models fail, stressing need for domain-aware, data-efficient NL-to-SQL systems. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 11 of 11 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 7 of 7 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 206 | Constructing an Interactive Natural Language Interface for Relational Databases | 2015 | VLDB | 0.00034667032 |
| 535 | ATHENA: An Ontology-Driven System for Natural Language Querying over Relational Data Stores | 2016 | VLDB | 0.00020727678 |
| 567 | NaLIR: An Interactive Natural Language Interface for Querying Relational Databases | 2014 | SIGMOD | 0.00019966681 |
| 1,168 | SODA: Generating SQL for Business Users | 2012 | VLDB | 0.00013541143 |
| 1,591 | The SDSS SkyServer - Public Access to the Sloan Digital Sky Survey Data | 2002 | SIGMOD | 0.00011226338 |
| 2,321 | DBPal: A Fully Pluggable NL2SQL Training Pipeline | 2020 | SIGMOD | 9.03609e-05 |
| 3,501 | MT-TeQL: Evaluating and Augmenting Neural NLIDB on Real-world Linguistic and Schema Variations | 2022 | VLDB | 7.0366785e-05 |
Previous
Page 1 / 1
Next