Database Paper Browser

Back to papers

Understanding the Black Box: A Deep Empirical Dive into Shapley Value Approximations for Tabular Data

Summary: Shapley value approximations for tabular data: empirical eval. Evaluates 8 replacement and 17 estimation strategies across 200 datasets; model-based methods beat model-agnostic ones in accuracy and speed, while sampling-based approaches miss interactions. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
7306
Venue
SIGMOD
Year
2025
Pagerank
4.1945683e-05
Overall Rank
10,524 | 26.79%
DOI
10.1145/3725420

Incoming Non-self Citations Over Time

No non-self incoming citations found for this paper in this database.

Authors

Incoming Citations (Sorted by Pagerank)

Showing 3 of 3 citing papers.

Rank Citing Paper Year Venue Pagerank
9,599 SPARTAN: Data-Adaptive Symbolic Time-Series Approximation 2025 SIGMOD 4.3177432e-05
10,466 A Structured Study of Multivariate Time-Series Distance Measures 2025 SIGMOD 4.1945683e-05
13,110 ShapX Engine: A Demonstration of Shapley Value Approximations 2025 SIGMOD -
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 33 of 33 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
1,516 k-Shape: Efficient and Accurate Clustering of Time Series 2015 SIGMOD 0.00011586255
1,867 Interpretable Data-Based Explanations for Fairness Debugging 2022 SIGMOD 0.00010272055
2,029 SAND: Streaming Subsequence Anomaly Detection 2021 VLDB 9.740868e-05
2,381 TSB-UAD: An End-to-End Benchmark Suite for Univariate Time-Series Anomaly Detection 2022 VLDB 8.9327638e-05
2,613 Decomposed Bounded Floats for Fast Compression and Queries 2021 VLDB 8.4503824e-05
2,868 Computing the Shapley Value of Facts in Query Answering 2022 SIGMOD 7.9816425e-05
2,923 Explaining Black-Box Algorithms Using Probabilistic Contrastive Counterfactuals 2021 SIGMOD 7.8953538e-05
3,162 Looking for Trouble: Analyzing Classifier Behavior via Pattern Divergence 2021 SIGMOD 7.4589576e-05
3,943 Volume Under the Surface: A New Accuracy Evaluation Measure for Time-Series Anomaly Detection 2022 VLDB 6.6099833e-05
4,059 GRAIL: Efficient Time-Series Representation Learning 2019 VLDB 6.4854417e-05
4,079 Choose Wisely: An Extensive Evaluation of Model Selection for Anomaly Detection in Time Series 2023 VLDB 6.4663636e-05
4,591 From Shapley Value to Model Counting and Back 2024 PODS 6.0619399e-05
4,853 Debunking Four Long-Standing Misconceptions of Time-Series Distance Measures 2020 SIGMOD 5.8760276e-05
5,959 Expected Shapley-Like Scores of Boolean Functions: Complexity and Applications to Probabilistic Databases 2024 PODS 5.2562342e-05
5,976 Responsible Data Integration: Next-generation Challenges 2022 SIGMOD 5.245976e-05
6,055 When is Shapley Value Computation a Matter of Counting? 2024 PODS 5.2324399e-05
6,262 Fast Shapley Value Computation in Data Assemblage Tasks as Cooperative Simple Games 2024 SIGMOD 5.1349507e-05
6,263 Equitable Data Valuation Meets the Right to Be Forgotten in Model Markets 2023 VLDB 5.1349507e-05
6,311 VergeDB: A Database for IoT Analytics on Edge Devices 2021 CIDR 5.1161316e-05
6,367 Good to the Last Bit: Data-Driven Encoding with CodecDB 2021 SIGMOD 5.0941072e-05
6,723 On Shapley Value in Data Assemblage Under Independent Utility 2022 VLDB 4.9490816e-05
7,000 Generating Interpretable Data-Based Explanations for Fairness Debugging using Gopher 2022 SIGMOD 4.8676312e-05
7,321 Counterfactual Explanation of Shapley Value in Data Coalitions 2024 VLDB 4.7629325e-05
7,380 Efficient Sampling Approaches to Shapley Value Approximation 2023 SIGMOD 4.746272e-05
8,088 PIDS: Attribute Decomposition for Improved Compression and Query Performance in Columnar Storage 2020 VLDB 4.5897316e-05
9,294 Theseus: Navigating the Labyrinth of Time-Series Anomaly Detection 2022 VLDB 4.3608061e-05
9,329 Odyssey: An Engine Enabling The Time-Series Clustering Journey 2023 VLDB 4.3556432e-05
9,599 SPARTAN: Data-Adaptive Symbolic Time-Series Approximation 2025 SIGMOD 4.3177432e-05
10,466 A Structured Study of Multivariate Time-Series Distance Measures 2025 SIGMOD 4.1945683e-05
11,094 Time-Series Anomaly Detection: Overview and New Trends 2024 VLDB 4.1945683e-05
11,235 Accelerating Similarity Search for Elastic Measures: A Study and New Generalization of Lower Bounding Distances 2023 VLDB 4.1945683e-05
13,110 ShapX Engine: A Demonstration of Shapley Value Approximations 2025 SIGMOD -
13,261 SAND in Action: Subsequence Anomaly Detection for Streams 2021 VLDB -
Previous Page 1 / 1 Next

Semantically Similar Papers