Adaptive Sampling for Rapidly Matching Histograms
Summary: FastMatch: an end-to-end system using adaptive sampling to interactively retrieve histograms most similar to a user-specified target. HistSim: a probabilistic, sampling-based top-k L1 histogram matcher with asynchronous block-based sampling; up to 35× speedups and near-perfect accuracy vs brute-force. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Stephen Macke
- 2. Yiming Zhang
- 3. Silu Huang
- 4. Aditya Parameswaran
Incoming Citations (Sorted by Pagerank)
Showing 6 of 6 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,350 | Northstar: An Interactive Data Science System | 2018 | VLDB | 0.00012431059 |
| 1,427 | Towards Scalable Dataframe Systems | 2020 | VLDB | 0.0001204248 |
| 2,825 | Smile: A System to Support Machine Learning on EEG Data at Scale | 2019 | VLDB | 8.0563426e-05 |
| 4,468 | Comprehensive and Efficient Workload Compression | 2021 | VLDB | 6.1584035e-05 |
| 9,850 | COMPARE: Accelerating Groupwise Comparison in Relational Databases for Data Analytics | 2021 | VLDB | 4.2721228e-05 |
| 10,685 | LakeVisage: Towards Scalable, Flexible and Interactive Visualization Recommendation for Data Discovery over Data Lakes | 2025 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 26 of 26 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 10,706 | Extensible and Robust Evaluation of Similarity Queries | 2025 | VLDB | 4.1945683e-05 |
| 996 | Approximating Multi-Dimensional Aggregate Range Queries Over Real Attributes | 2000 | SIGMOD | 0.00014741524 |
| 1,797 | Effective Use of Block-Level Sampling in Statistics Estimation | 2004 | SIGMOD | 0.00010523169 |
| 3,619 | Fast Algorithms For Hierarchical Range Histogram Construction | 2002 | PODS | 6.9084829e-05 |
| 471 | FastMap: A Fast Algorithm for Indexing, Data-Mining and Visualization of Traditional and Multimedia Datasets | 1995 | SIGMOD | 0.00022364776 |
| 269 | Fast Incremental Maintenance of Approximate Histograms | 1997 | VLDB | 0.00029656549 |
| 361 | Histogram-Based Approximation of Set-Valued Query Answers | 1999 | VLDB | 0.00025775749 |
| 852 | Dynamic Multidimensional Histograms | 2002 | SIGMOD | 0.00015941524 |
| 2,011 | Rapid Sampling for Visualizations with Ordering Guarantees | 2015 | VLDB | 9.7964875e-05 |
| 5,879 | Fast and Near–Optimal Algorithms for Approximating Distributions by Histograms | 2015 | PODS | 5.2908101e-05 |