Database Paper Browser

Back to papers

Efficient Task-Specific Data Valuation for Nearest Neighbor Algorithms

Summary: Data valuation via Shapley values for ML training; exact Shapley for unweighted KNN computed in O(N log N), a major leap from 2^N. LSH-based epsilon-delta approximations yield sublinear O(N h(epsilon,K) log N); extensions include weighted KNN, multi-curator data, and Monte Carlo with O(N (log N)^2/(log K)^2), tested up to 10M points. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
11851
Venue
VLDB
Year
2019
Pagerank
0.00012758104
Overall Rank
1,298 | 90.98%
DOI
10.14778/3342263.3342637

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 23 of 23 citing papers.

Rank Citing Paper Year Venue Pagerank
2,302 Nearest Neighbor Classifiers over Incomplete Information: From Certain Answers to Certain Predictions 2021 VLDB 9.0668832e-05
2,359 Data Market Platforms: Trading Data Assets to Solve Data Problems 2020 VLDB 8.9607667e-05
3,836 Dealer: An End-to-End Model Marketplace with Differential Privacy 2021 VLDB 6.7153977e-05
4,138 Protecting Data Markets from Strategic Buyers 2022 SIGMOD 6.4175758e-05
4,753 Secure Shapley Value for Cross-Silo Federated Learning 2023 VLDB 5.9469115e-05
4,863 Data-Sharing Markets: Model, Protocol, and Algorithms to Incentivize the Formation of Data-Sharing Consortia 2023 SIGMOD 5.8697471e-05
6,262 Fast Shapley Value Computation in Data Assemblage Tasks as Cooperative Simple Games 2024 SIGMOD 5.1349507e-05
6,263 Equitable Data Valuation Meets the Right to Be Forgotten in Model Markets 2023 VLDB 5.1349507e-05
6,723 On Shapley Value in Data Assemblage Under Independent Utility 2022 VLDB 4.9490816e-05
7,380 Efficient Sampling Approaches to Shapley Value Approximation 2023 SIGMOD 4.746272e-05
7,932 P-Shapley: Shapley Values on Probabilistic Classifiers 2024 VLDB 4.613363e-05
8,053 Demonstration of Dealer: An End-to-End Model Marketplace with Differential Privacy 2021 VLDB 4.5950705e-05
8,114 mlwhatif: What If You Could Stop Re-Implementing Your Machine Learning Pipeline Analyses Over and Over? 2023 VLDB 4.5823351e-05
8,257 Automating and Optimizing Data-Centric What-If Analyses on Native Machine Learning Pipelines 2023 SIGMOD 4.5487511e-05
8,281 Optimizing Data Acquisition to Enhance Machine Learning Performance 2024 VLDB 4.5435639e-05
8,666 Contributions Estimation in Federated Learning: A Comprehensive Experimental Evaluation 2024 VLDB 4.471975e-05
10,107 Reliable and Private Utility Signaling for Data Markets 2026 SIGMOD 4.1945683e-05
10,392 Shapley Value Estimation Based on Differential Matrix 2025 SIGMOD 4.1945683e-05
10,533 WeShap: Weak Supervision Source Evaluation with Shapley Values 2025 VLDB 4.1945683e-05
10,686 PS-MI: Accurate, Efficient, and Private Data Valuation in Vertical Federated Learning 2025 VLDB 4.1945683e-05
10,816 mlidea: Interactively Improving ML Data Preparation Code via "Shadow Pipelines" 2025 VLDB 4.1945683e-05
11,003 Performance-Based Pricing for Federated Learning via Auction 2024 VLDB 4.1945683e-05
11,431 Ease.ML: A Lifecycle Management System for MLDev and MLOps 2021 CIDR 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 8 of 8 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
34 Similarity Search in High Dimensions via Hashing 1999 VLDB 0.00076637636
1,771 On Arbitrage-free Pricing for General Data Queries 2014 VLDB 0.00010617356
2,743 Toward Practical Query Pricing with QueryMarket 2013 SIGMOD 8.1897331e-05
2,820 Price-Optimal Querying with Data APIs 2016 VLDB 8.062913e-05
4,477 How to Price Shared Optimizations in the Cloud 2012 VLDB 6.1509882e-05
5,800 QueryMarket Demonstration: Pricing for Online Data Markets 2012 VLDB 5.3211601e-05
6,344 QIRANA Demonstration: Real Time Scalable Query Pricing 2017 VLDB 5.1023673e-05
7,044 A Demonstration of Sterling: A Privacy-Preserving Data Marketplace 2018 VLDB 4.8529797e-05
Previous Page 1 / 1 Next

Semantically Similar Papers