The Pneuma Project: Reifying Information Needs as Relational Schemas to Automate Discovery, Guide Preparation, and Align Data with Intent
Summary: Reifies evolving information needs as relational schemas to iteratively steer LLM-powered discovery and data-prep toward fit-for-purpose artifacts. Combines context-specialization, a conductor-style planner, and shared-state convergence with RAG/agentic integration to automate discovery, guide preparation, and record institutional knowledge. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Authors
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 5 of 5 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 185 | DuckDB: an Embeddable Analytical Database | 2019 | SIGMOD | 0.00036538405 |
| 1,872 | ReAcTable: Enhancing ReAct for Table Question Answering | 2024 | VLDB | 0.00010259702 |
| 2,359 | Data Market Platforms: Trading Data Assets to Solve Data Problems | 2020 | VLDB | 8.9607667e-05 |
| 3,995 | How Large Language Models Will Disrupt Data Management | 2023 | VLDB | 6.5513237e-05 |
| 6,217 | Pneuma: Leveraging LLMs for Tabular Data Representation and Retrieval in an End-to-End System | 2025 | SIGMOD | 5.1534752e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 10,800 | Unify: A System For Unstructured Data Analytics | 2025 | VLDB | 4.1945683e-05 |
| 10,443 | LLM-Matcher: A Name-Based Schema Matching Tool using Large Language Models | 2025 | SIGMOD | 4.1945683e-05 |
| 10,455 | Sentence to Model: Cost-Effective Data Collection LLM Agent | 2025 | SIGMOD | 4.1945683e-05 |
| 3,876 | The Design of an LLM-powered Unstructured Analytics System | 2025 | CIDR | 6.6741456e-05 |
| 9,449 | An Interactive Multi-modal Query Answering System with Retrieval-Augmented Large Language Models | 2024 | VLDB | 4.3399593e-05 |
| 3,840 | Revisiting Prompt Engineering via Declarative Crowdsourcing | 2024 | CIDR | 6.7106924e-05 |
| 9,219 | Intelligent Agents for Data Exploration | 2024 | VLDB | 4.3702863e-05 |
| 10,973 | Unstructured Data Fusion for Schema and Data Extraction | 2024 | SIGMOD | 4.1945683e-05 |
| 7,020 | LLM for Data Management | 2024 | VLDB | 4.8595728e-05 |
| 6,217 | Pneuma: Leveraging LLMs for Tabular Data Representation and Retrieval in an End-to-End System | 2025 | SIGMOD | 5.1534752e-05 |