Back to papers
Chameleon: Foundation Models for Fairness-aware Multi-modal Data Augmentation to Enhance Coverage of Minorities
Summary: Chameleon leverages foundation models to generate minimal, targeted multi-modal synthetic tuples to boost coverage of under-represented groups. It couples prompt-guidance strategies with quality and outlier-detection filters to preserve semantic integrity and significantly reduce downstream unfairness.
(summarized by gpt-5-mini on Feb 09 2026)
- Paper ID
- 13557
- Venue
- VLDB
- Year
- 2024
- Pagerank
- 4.1945683e-05
- Overall Rank
- 11,068 | 23.01%
- DOI
-
10.14778/3681954.3682014
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Incoming Citations (Sorted by Pagerank)
Showing 1 of 1 citing papers.
Outgoing Citations (Sorted by Pagerank)
Showing 12 of 12 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 1,041 |
Interventional Fairness : Causal Database Repair for Algorithmic Fairness |
2019 |
SIGMOD |
0.00014482047 |
| 1,116 |
Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes |
2024 |
VLDB |
0.00013890154 |
| 4,018 |
Through the Fairness Lens: Experimental Analysis and Evaluation of Entity Matching |
2023 |
VLDB |
6.5244015e-05 |
| 4,884 |
Relational Data Synthesis using Generative Adversarial Networks: A Design Space Exploration |
2020 |
VLDB |
5.8540287e-05 |
| 5,349 |
PrivLava: Synthesizing Relational Data with Foreign Keys under Differential Privacy |
2023 |
SIGMOD |
5.553869e-05 |
| 5,509 |
Can Large Language Models Predict Data Correlations from Column Names? |
2023 |
VLDB |
5.4703368e-05 |
| 5,777 |
ImDiffusion: Imputed Diffusion Models for Multivariate Time Series Anomaly Detection |
2024 |
VLDB |
5.3308813e-05 |
| 5,976 |
Responsible Data Integration: Next-generation Challenges |
2022 |
SIGMOD |
5.245976e-05 |
| 6,467 |
Tailoring Data Source Distributions for Fairness-aware Data Integration |
2021 |
VLDB |
5.0528156e-05 |
| 6,737 |
Demonstrating GPT-DB: Generating Query-Specific and Customizable Code for SQL Processing with GPT-4 |
2023 |
VLDB |
4.9457488e-05 |
| 6,892 |
Identifying Insufficient Data Coverage for Ordinal Continuous-Valued Attributes |
2021 |
SIGMOD |
4.8925683e-05 |
| 7,714 |
Identifying Insufficient Data Coverage in Databases with Multiple Relations |
2020 |
VLDB |
4.6700455e-05 |
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 13,101 |
Interactive Fairness Auditing: Leveraging AVOIR for Dynamic Evaluation and Mitigation |
2025 |
SIGMOD |
- |
| 7,046 |
Through the Data Management Lens: Experimental Analysis and Evaluation of Fair Classification |
2022 |
SIGMOD |
4.8525913e-05 |
| 8,847 |
Towards Foundation Database Models |
2025 |
CIDR |
4.4371897e-05 |
| 10,555 |
Mining the Minoria: Unknown, Under-represented, and Under-performing Minority Groups |
2025 |
VLDB |
4.1945683e-05 |
| 8,175 |
Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language Models |
2025 |
VLDB |
4.5676289e-05 |
| 7,602 |
Causal Feature Selection for Algorithmic Fairness |
2022 |
SIGMOD |
4.6988081e-05 |
| 1,041 |
Interventional Fairness : Causal Database Repair for Algorithmic Fairness |
2019 |
SIGMOD |
0.00014482047 |
| 9,365 |
Falcon: Fair Active Learning using Multi-armed Bandits |
2024 |
VLDB |
4.3502315e-05 |
| 4,935 |
OmniFair: A Declarative System for Model-Agnostic Group Fairness in Machine Learning |
2021 |
SIGMOD |
5.8198727e-05 |
| 4,769 |
Automated Feature Engineering for Algorithmic Fairness |
2021 |
VLDB |
5.934329e-05 |