OpenForge: Probabilistic Metadata Integration
Summary: Two-stage prior–posterior framework for metadata-concept alignment: fine-tuned LLMs and ensemble scorers produce priors over candidate relationships. A Markov Random Field refines them by maximizing joint assignment probability and encoding constraints (e.g., transitivity), achieving ≈25 F1 gain over GPT‑4. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Authors
- 1. Tianji Cong
- 2. Fatemeh Nargesian
- 3. Junjie Xing
- 4. H. V. Jagadish
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 15 of 15 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 10,785 | GalaxyWeaver: Autonomous Table-to-Graph Conversion and Schema Optimization with Large Language Models | 2025 | VLDB | 4.1945683e-05 |
| 2,420 | From Data Fusion to Knowledge Fusion | 2014 | VLDB | 8.8530994e-05 |
| 9,723 | Discovering and Ranking Semantic Associations over a Large RDF Metabase | 2004 | VLDB | 4.2958329e-05 |
| 3,631 | On-the-Fly Entity-Aware Query Processing in the Presence of Linkage | 2010 | VLDB | 6.9014378e-05 |
| 8,765 | Efficient Query Answering in Probabilistic RDF Graphs | 2011 | SIGMOD | 4.456315e-05 |
| 9,175 | Efficient Exploration of Interesting Aggregates in RDF Graphs | 2021 | SIGMOD | 4.383548e-05 |
| 7,026 | Mind the Data Gap: Bridging LLMs to Enterprise Data Integration | 2025 | CIDR | 4.8570811e-05 |
| 2,832 | Intensional Associations Between Data and Metadata | 2007 | SIGMOD | 8.050082e-05 |
| 8,917 | Data Lakes Empowered by Knowledge Graph Technologies | 2021 | SIGMOD | 4.427232e-05 |
| 1,178 | Table Union Search on Open Data | 2018 | VLDB | 0.00013468118 |