Symphony: Towards Natural Language Query Answering over Multi-modal Data Lakes
Summary: Symphony enables NL queries over multi-modal data lakes (tables + text) without upfront integration, using cross-modality representation learning to locate relevant sources. It decomposes NL questions into per-source subqueries, executes them and fuses results; preliminarily validated on Wikipedia tables/text. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 18 of 18 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 7 of 7 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 221 | Deep Entity Matching with Pre-Trained Language Models | 2021 | VLDB | 0.00033121824 |
| 517 | Can Foundation Models Wrangle Your Data? | 2023 | VLDB | 0.00021169035 |
| 746 | Delta Lake: High-Performance ACID Table Storage over Cloud Object Stores | 2020 | VLDB | 0.00017326979 |
| 984 | Natural language to SQL: Where are we today? | 2020 | VLDB | 0.00014857465 |
| 1,277 | The Data Civilizer System | 2017 | CIDR | 0.00012879695 |
| 2,349 | RPT: Relational Pre-trained Transformer Is Almost All You Need towards Democratizing Data Preparation | 2021 | VLDB | 8.9876423e-05 |
| 8,000 | Data Civilizer 2.0: A Holistic Framework for Data Preparation and Analytics | 2019 | VLDB | 4.6092803e-05 |
Previous
Page 1 / 1
Next