Back to papers
Enabling SQL-based Training Data Debugging for Federated Learning
Summary: Extends Rain-based SQL-based debugging to federated learning (FedRain) to prune mislabeled data causing unexpected model behavior. Then Frog refines the protocol for federated debugging, delivering security, accuracy, and efficiency over FedRain.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 12899
- Venue
- VLDB
- Year
- 2022
- Pagerank
- 5.6210545e-05
- Overall Rank
- 5,222 | 63.68%
- DOI
-
10.14778/3494124.3494125
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 4 of 4 citing papers.
Outgoing Citations (Sorted by Pagerank)
Showing 18 of 18 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 31 |
Provenance Semirings |
2007 |
PODS |
0.0007857786 |
| 192 |
HoloClean: Holistic Data Repairs with Probabilistic Inference |
2017 |
VLDB |
0.00035728858 |
| 214 |
Scorpion: Explaining Away Outliers in Aggregate Queries |
2013 |
VLDB |
0.0003363692 |
| 791 |
ActiveClean: Interactive Data Cleaning For Statistical Modeling |
2016 |
VLDB |
0.00016629664 |
| 942 |
A Formal Approach to Finding Explanations for Database Queries |
2014 |
SIGMOD |
0.00015155714 |
| 1,106 |
Provenance for Aggregate Queries |
2011 |
PODS |
0.0001398766 |
| 1,143 |
Privacy Preserving Vertical Federated Learning for Tree-based Models |
2020 |
VLDB |
0.00013710269 |
| 1,337 |
HoloDetect: Few-Shot Learning for Error Detection |
2019 |
SIGMOD |
0.00012497164 |
| 1,371 |
Tiresias: The Database Oracle for How-To Queries |
2012 |
SIGMOD |
0.00012323502 |
| 1,420 |
Data Management Challenges in Production Machine Learning |
2017 |
SIGMOD |
0.00012057956 |
| 1,699 |
Sensitivity Analysis and Explanations for Robust Query Evaluation in Probabilistic Databases |
2011 |
SIGMOD |
0.00010858983 |
| 2,154 |
DIFF: A Relational Interface for Large-Scale Data Explanation |
2019 |
VLDB |
9.4208667e-05 |
| 2,402 |
Causality and Explanations in Databases |
2014 |
VLDB |
8.8928361e-05 |
| 2,649 |
Explaining Query Answers with Explanation-Ready Databases |
2016 |
VLDB |
8.3719123e-05 |
| 2,753 |
Complaint-driven Training Data Debugging for Query 2.0 |
2020 |
SIGMOD |
8.1724339e-05 |
| 2,968 |
Raha: A Configuration-Free Error Detection System |
2019 |
SIGMOD |
7.7985097e-05 |
| 3,299 |
SCODED: Statistical Constraint Oriented Data Error Detection |
2020 |
SIGMOD |
7.2546659e-05 |
| 6,817 |
Error Diagnosis and Data Profiling with Data X-Ray |
2015 |
VLDB |
4.9171711e-05 |
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 13,153 |
From Zero to Hero: Detecting Leaked Data through Synthetic Data Injection and Model Querying |
2024 |
VLDB |
- |
| 4,290 |
FedTSC: A Secure Federated Learning System for Interpretable Time Series Classification |
2022 |
VLDB |
6.2885419e-05 |
| 8,747 |
A Blockchain System for Clustered Federated Learning with Peer-to-Peer Knowledge Transfer |
2024 |
VLDB |
4.456315e-05 |
| 8,651 |
FederatedScope: A Flexible Federated Learning Platform for Heterogeneity |
2023 |
VLDB |
4.4757309e-05 |
| 11,263 |
Federated Calibration and Evaluation of Binary Classifiers |
2023 |
VLDB |
4.1945683e-05 |
| 6,502 |
Falcon: A Privacy-Preserving and Interpretable Vertical Federated Learning System |
2023 |
VLDB |
5.0361846e-05 |
| 8,666 |
Contributions Estimation in Federated Learning: A Comprehensive Experimental Evaluation |
2024 |
VLDB |
4.471975e-05 |
| 8,853 |
Complaint-Driven Training Data Debugging at Interactive Speeds |
2022 |
SIGMOD |
4.4350727e-05 |
| 7,072 |
FedSQ: A Secure System for Federated Vector Similarity Queries |
2024 |
VLDB |
4.842703e-05 |
| 2,753 |
Complaint-driven Training Data Debugging for Query 2.0 |
2020 |
SIGMOD |
8.1724339e-05 |