Database Paper Browser

Back to papers

Symphony: Towards Natural Language Query Answering over Multi-modal Data Lakes

Summary: Symphony enables NL queries over multi-modal data lakes (tables + text) without upfront integration, using cross-modality representation learning to locate relevant sources. It decomposes NL questions into per-source subqueries, executes them and fuses results; preliminarily validated on Wikipedia tables/text. (summarized by gpt-5-mini on Feb 09 2026)

Paper ID
484
Venue
CIDR
Year
2023
Pagerank
0.00011456579
Overall Rank
1,541 | 89.29%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 18 of 18 citing papers.

Rank Citing Paper Year Venue Pagerank
1,082 CAESURA: Language Models as Multi-Modal Query Planners 2024 CIDR 0.00014214232
1,116 Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes 2024 VLDB 0.00013890154
4,212 Unicorn: A Unified Multi-tasking Model for Supporting Matching Tasks in Data Integration 2023 SIGMOD 6.3555142e-05
4,238 Panda: Performance Debugging for Databases using LLM Agents 2024 CIDR 6.331901e-05
4,908 Combining Small Language Models and Large Language Models for Zero-Shot NL2SQL 2024 VLDB 5.8339245e-05
5,214 ThalamusDB: Approximate Query Processing on Multi-Modal Data 2024 SIGMOD 5.624434e-05
5,462 RetClean: Retrieval-Based Data Cleaning Using LLMs and Data Lakes 2024 VLDB 5.494769e-05
5,840 Logical and Physical Optimizations for SQL Query Execution over Large Language Models 2025 SIGMOD 5.3042561e-05
7,705 AOP: Automated and Interactive LLM Pipeline Orchestration for Answering Complex Queries 2025 CIDR 4.6730494e-05
8,204 ELEET: Efficient Learned Query Execution over Text and Tables 2024 VLDB 4.5594273e-05
8,716 nsDB: Architecting the Next Generation Database by Integrating Neural and Symbolic Systems 2024 VLDB 4.4618187e-05
9,077 VerifAI: Verified Generative AI 2024 CIDR 4.4010762e-05
9,152 Doctopus: Budget-aware Structural Table Extraction from Unstructured Documents 2025 VLDB 4.3849295e-05
9,972 KathDB: Explainable Multimodal Database Management System with Human-AI Collaboration 2026 CIDR 4.1945683e-05
10,455 Sentence to Model: Cost-Effective Data Collection LLM Agent 2025 SIGMOD 4.1945683e-05
10,711 Cracking Vector Search Indexes 2025 VLDB 4.1945683e-05
10,752 QUEST: Query Optimization in Unstructured Document Analysis 2025 VLDB 4.1945683e-05
10,800 Unify: A System For Unstructured Data Analytics 2025 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 7 of 7 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers