Causal Data Integration
Summary: Defines Causal Data Integration (CDI): mining missing attributes from external sources and auto-building causal DAGs to enable causal inference over partial datasets. Gives a system architecture, key challenges and algorithms, and preliminary experiments demonstrating feasibility for recovering missing covariates and correcting mis-specified variable selection. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Brit Youngmann
- 2. Michael Cafarella
- 3. Babak Salimi
- 4. Anna Zeng
Incoming Citations (Sorted by Pagerank)
Showing 11 of 11 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 6,077 | The Fast and the Private: Task-based Dataset Search | 2024 | CIDR | 5.2229324e-05 |
| 7,172 | Summarized Causal Explanations For Aggregate Views | 2024 | SIGMOD | 4.8114797e-05 |
| 9,644 | Fair and Actionable Causal Prescription Ruleset | 2025 | SIGMOD | 4.3109001e-05 |
| 9,871 | From Logs to Causal Inference: Diagnosing Large Systems | 2025 | VLDB | 4.2667743e-05 |
| 10,101 | Privacy-preserving and Verifiable Causal Prescriptive Analytics | 2026 | SIGMOD | 4.1945683e-05 |
| 10,147 | Causal Explanations for Disparate Trends: Where and Why? | 2026 | SIGMOD | 4.1945683e-05 |
| 10,581 | Causal DAG Summarization | 2025 | VLDB | 4.1945683e-05 |
| 10,715 | What If: Causal Analysis with Graph Databases | 2025 | VLDB | 4.1945683e-05 |
| 10,725 | Suna: Scalable Causal Confounder Discovery over Relational Data | 2025 | VLDB | 4.1945683e-05 |
| 10,954 | Counterfactual Explanation at Will, with Zero Privacy Leakage | 2024 | SIGMOD | 4.1945683e-05 |
| 11,054 | Enriching Relations with Additional Attributes for ER | 2024 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 12 of 12 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 10,427 | CausaLens: A System for Summarizing Causal DAGs | 2025 | SIGMOD | 4.1945683e-05 |
| 1,623 | Scalable Techniques for Mining Causal Structures | 1998 | VLDB | 0.00011102927 |
| 13,288 | Demonstration of Inferring Causality from Relational Databases with CaRL | 2020 | VLDB | - |
| 855 | Integrating Conflicting Data: The Role of Source Dependence | 2009 | VLDB | 0.00015906735 |
| 6,565 | Toward Interpretable and Actionable Data Analysis with Explanations and Causality | 2022 | VLDB | 5.0081626e-05 |
| 9,871 | From Logs to Causal Inference: Diagnosing Large Systems | 2025 | VLDB | 4.2667743e-05 |
| 1,449 | Causal Relational Learning | 2020 | SIGMOD | 0.0001193267 |
| 371 | A Bayesian Approach to Discovering Truth from Conflicting Sources for Data Integration | 2012 | VLDB | 0.00025389696 |
| 10,581 | Causal DAG Summarization | 2025 | VLDB | 4.1945683e-05 |
| 10,715 | What If: Causal Analysis with Graph Databases | 2025 | VLDB | 4.1945683e-05 |