Back to papers
Auto-BI: Automatically Build BI-Models Leveraging Local Join Prediction and Global Schema Graph
Summary: Auto-BI predicts BI models from input tables via k-Min-Cost-Arborescence (k‑MCA), merging local join prediction with a global schema-graph arborescence. k‑MCA proved intractable/inapproximable; authors provide practical optimal solvers (sub-second, ~100 tables) and validate on 100K real BI models + TPC benchmarks with >0.9 F1.
(summarized by gpt-5-mini on Feb 09 2026)
- Paper ID
- 13104
- Venue
- VLDB
- Year
- 2023
- Pagerank
- 4.3341665e-05
- Overall Rank
- 9,490 | 33.98%
- DOI
-
10.14778/3603581.3603596
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 4 of 4 citing papers.
Outgoing Citations (Sorted by Pagerank)
Showing 13 of 13 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 42 |
A Comparison of Approaches to Large-Scale Data Analysis |
2009 |
SIGMOD |
0.00073498298 |
| 98 |
XMark: A Benchmark for XML Data Management |
2002 |
VLDB |
0.00050023808 |
| 104 |
Inclusion dependencies and their interaction with functional dependencies (Extended abstract) |
1982 |
PODS |
0.00048766186 |
| 1,510 |
Summarizing Relational Databases |
2009 |
VLDB |
0.00011606901 |
| 1,664 |
On Multi-Column Foreign Key Discovery |
2010 |
VLDB |
0.00010976887 |
| 2,415 |
VizQL: A Language for Query, Analysis and Visualization |
2006 |
SIGMOD |
8.8639497e-05 |
| 3,252 |
Auto-Suggest: Learning-to-Recommend Data Preparation Steps Using Data Science Notebooks |
2020 |
SIGMOD |
7.3178277e-05 |
| 3,328 |
Multi-column Substring Matching for Database Schema Translation |
2006 |
VLDB |
7.2174278e-05 |
| 3,735 |
Auto-Join: Joining Tables by Leveraging Transformations |
2017 |
VLDB |
6.8061318e-05 |
| 4,784 |
Divide & Conquer-based Inclusion Dependency Discovery |
2015 |
VLDB |
5.9240851e-05 |
| 4,850 |
SEMA-JOIN: Joining Semantically-Related Tables Using Big Table Corpora |
2015 |
VLDB |
5.8768452e-05 |
| 5,434 |
Auto-FuzzyJoin: Auto-Program Fuzzy Similarity Joins Without Labeled Examples |
2021 |
SIGMOD |
5.5045402e-05 |
| 5,486 |
Fast Foreign-Key Detection in Microsoft SQL Server PowerPivot for Excel |
2014 |
VLDB |
5.4811603e-05 |
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 9,614 |
Auto-Approximation of Graph Computing |
2014 |
VLDB |
4.3177432e-05 |
| 3,252 |
Auto-Suggest: Learning-to-Recommend Data Preparation Steps Using Data Science Notebooks |
2020 |
SIGMOD |
7.3178277e-05 |
| 5,304 |
A Scalable AutoML Approach Based on Graph Neural Networks |
2022 |
VLDB |
5.5779335e-05 |
| 4,739 |
AutoTQA: Towards Autonomous Tabular Question Answering through Multi-Agent Large Language Models |
2024 |
VLDB |
5.959592e-05 |
| 9,886 |
Scalable and Usable Relational Learning With Automatic Language Bias |
2021 |
SIGMOD |
4.2621158e-05 |
| 5,275 |
Auto-Tables: Synthesizing Multi-Step Transformations to Relationalize Tables without Using Examples |
2023 |
VLDB |
5.5905507e-05 |
| 5,096 |
Auto-Transform: Learning-to-Transform by Patterns |
2020 |
VLDB |
5.7011825e-05 |
| 5,383 |
Auto-Pipeline: Synthesizing Complex Data Pipelines By-Target Using Reinforcement Learning and Search |
2021 |
VLDB |
5.5393038e-05 |
| 3,735 |
Auto-Join: Joining Tables by Leveraging Transformations |
2017 |
VLDB |
6.8061318e-05 |
| 10,598 |
Auto-Prep: Holistic Prediction of Data Preparation Steps for Self-Service Business Intelligence |
2025 |
VLDB |
4.1945683e-05 |