Back to papers
Tailoring Data Source Distributions for Fairness-aware Data Integration
Summary: Tailors data-source mixes to satisfy fairness-based distributions under cost constraints. Known distributions with equal costs yield optimal results; for unknowns or unequal costs, use approximations with exploration–exploitation and validation.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 12426
- Venue
- VLDB
- Year
- 2021
- Pagerank
- 5.0528156e-05
- Overall Rank
- 6,467 | 55.02%
- DOI
-
10.14778/3476249.3476299
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 11 of 11 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
| 4,018 |
Through the Fairness Lens: Experimental Analysis and Evaluation of Entity Matching |
2023 |
VLDB |
6.5244015e-05 |
| 5,024 |
Towards Distribution-aware Query Answering in Data Markets |
2022 |
VLDB |
5.7535043e-05 |
| 5,381 |
Selective Data Acquisition in the Wild for Model Charging |
2022 |
VLDB |
5.5399508e-05 |
| 5,976 |
Responsible Data Integration: Next-generation Challenges |
2022 |
SIGMOD |
5.245976e-05 |
| 9,644 |
Fair and Actionable Causal Prescription Ruleset |
2025 |
SIGMOD |
4.3109001e-05 |
| 9,712 |
Maximizing Fair Content Spread via Edge Suggestion in Social Networks |
2022 |
VLDB |
4.299267e-05 |
| 9,928 |
Fainder: A Fast and Accurate Index for Distribution-Aware Dataset Search |
2024 |
VLDB |
4.2511622e-05 |
| 10,341 |
A Theoretical Framework for Distribution-Aware Dataset Search |
2025 |
PODS |
4.1945683e-05 |
| 10,617 |
Deduplicated Sampling On-Demand |
2025 |
VLDB |
4.1945683e-05 |
| 10,960 |
FairHash: A Fair and Memory/Time-efficient Hashmap |
2024 |
SIGMOD |
4.1945683e-05 |
| 11,068 |
Chameleon: Foundation Models for Fairness-aware Multi-modal Data Augmentation to Enhance Coverage of Minorities |
2024 |
VLDB |
4.1945683e-05 |
Outgoing Citations (Sorted by Pagerank)
Showing 12 of 12 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 943 |
Wander Join: Online Aggregation via Random Walks |
2016 |
SIGMOD |
0.00015145883 |
| 1,463 |
ARDA: Automatic Relational Data Augmentation for Machine Learning |
2020 |
VLDB |
0.00011869295 |
| 1,597 |
Designing Fair Ranking Schemes |
2019 |
SIGMOD |
0.00011209846 |
| 2,259 |
MithraCoverage: A System for Investigating Population Bias for Intersectional Fairness |
2020 |
SIGMOD |
9.167331e-05 |
| 4,526 |
Responsible Data Science |
2019 |
SIGMOD |
6.1092845e-05 |
| 5,555 |
On Obtaining Stable Rankings |
2019 |
VLDB |
5.4386174e-05 |
| 6,892 |
Identifying Insufficient Data Coverage for Ordinal Continuous-Valued Attributes |
2021 |
SIGMOD |
4.8925683e-05 |
| 7,685 |
Fairly Evaluating and Scoring Items in a Data Set |
2020 |
VLDB |
4.6788921e-05 |
| 7,714 |
Identifying Insufficient Data Coverage in Databases with Multiple Relations |
2020 |
VLDB |
4.6700455e-05 |
| 8,129 |
Discovering the Skyline of Web Databases |
2016 |
VLDB |
4.5784968e-05 |
| 11,883 |
Query Reranking As A Service |
2016 |
VLDB |
4.1945683e-05 |
| 13,297 |
MithraRanking: A System for Responsible Ranking Design |
2019 |
SIGMOD |
- |
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 1,041 |
Interventional Fairness : Causal Database Repair for Algorithmic Fairness |
2019 |
SIGMOD |
0.00014482047 |
| 7,490 |
Models and Mechanisms for Spatial Data Fairness |
2023 |
VLDB |
4.7180617e-05 |
| 9,246 |
Happiness Maximizing Sets under Group Fairness Constraints |
2023 |
VLDB |
4.3690661e-05 |
| 4,101 |
Less is More: Selecting Sources Wisely for Integration |
2013 |
VLDB |
6.4523909e-05 |
| 6,354 |
Characterizing and Selecting Fresh Data Sources |
2014 |
SIGMOD |
5.0990729e-05 |
| 7,046 |
Through the Data Management Lens: Experimental Analysis and Evaluation of Fair Classification |
2022 |
SIGMOD |
4.8525913e-05 |
| 3,750 |
Data Acquisition for Improving Machine Learning Models |
2021 |
VLDB |
6.7895763e-05 |
| 10,223 |
On Fair Epsilon Net and Geometric Hitting Set |
2026 |
VLDB |
4.1945683e-05 |
| 7,602 |
Causal Feature Selection for Algorithmic Fairness |
2022 |
SIGMOD |
4.6988081e-05 |
| 10,961 |
Faster Algorithms for Fair Max-Min Diversification in Rd |
2024 |
SIGMOD |
4.1945683e-05 |