Efficient Sampling Approaches to Shapley Value Approximation
Summary: Treats Shapley value estimation as stratified sampling to accelerate Monte Carlo approximation for data-management tasks. Proposes a novel stratification design with Neyman and empirical Bernstein-based allocations, showing improved efficiency and accuracy on real and synthetic datasets. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Jiayao Zhang
- 2. Qiheng Sun
- 3. Jinfei Liu
- 4. Li Xiong
- 5. Jian Pei
- 6. Kui Ren
Incoming Citations (Sorted by Pagerank)
Showing 9 of 9 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 6,263 | Equitable Data Valuation Meets the Right to Be Forgotten in Model Markets | 2023 | VLDB | 5.1349507e-05 |
| 7,932 | P-Shapley: Shapley Values on Probabilistic Classifiers | 2024 | VLDB | 4.613363e-05 |
| 8,281 | Optimizing Data Acquisition to Enhance Machine Learning Performance | 2024 | VLDB | 4.5435639e-05 |
| 10,107 | Reliable and Private Utility Signaling for Data Markets | 2026 | SIGMOD | 4.1945683e-05 |
| 10,392 | Shapley Value Estimation Based on Differential Matrix | 2025 | SIGMOD | 4.1945683e-05 |
| 10,524 | Understanding the Black Box: A Deep Empirical Dive into Shapley Value Approximations for Tabular Data | 2025 | SIGMOD | 4.1945683e-05 |
| 10,686 | PS-MI: Accurate, Efficient, and Private Data Valuation in Vertical Federated Learning | 2025 | VLDB | 4.1945683e-05 |
| 10,981 | Enabling Adaptive Sampling for Intra-Window Join: Simultaneously Optimizing Quantity and Quality | 2024 | SIGMOD | 4.1945683e-05 |
| 11,119 | DataPrice: An Interactive System for Pricing Datasets in Data Marketplaces | 2024 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 7 of 7 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,298 | Efficient Task-Specific Data Valuation for Nearest Neighbor Algorithms | 2019 | VLDB | 0.00012758104 |
| 1,891 | Towards Model-based Pricing for Machine Learning in a Data Marketplace | 2019 | SIGMOD | 0.00010194092 |
| 2,868 | Computing the Shapley Value of Facts in Query Answering | 2022 | SIGMOD | 7.9816425e-05 |
| 3,836 | Dealer: An End-to-End Model Marketplace with Differential Privacy | 2021 | VLDB | 6.7153977e-05 |
| 4,138 | Protecting Data Markets from Strategic Buyers | 2022 | SIGMOD | 6.4175758e-05 |
| 6,549 | Demonstration of Nimbus: Model-based Pricing for Machine Learning in a Data Marketplace | 2019 | SIGMOD | 5.0175568e-05 |
| 8,053 | Demonstration of Dealer: An End-to-End Model Marketplace with Differential Privacy | 2021 | VLDB | 4.5950705e-05 |
Previous
Page 1 / 1
Next