Building Wavelet Histograms on Large Data in MapReduce
Summary: Proposes exact and approximate wavelet histogram algorithms for MapReduce, cutting communication and runtime vs naive approaches. Implemented in Hadoop and evaluated on a 16-node cluster with real and synthetic data, showing large improvements. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Jeffrey Jestes
- 2. Ke Yi
- 3. Feifei Li
Incoming Citations (Sorted by Pagerank)
Showing 7 of 7 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,737 | QuickSel: Quick Selectivity Learning with Mixture Models | 2020 | SIGMOD | 0.00010720294 |
| 2,674 | Minimal MapReduce Algorithms | 2013 | SIGMOD | 8.3328645e-05 |
| 3,703 | Multi-Query Optimization in MapReduce Framework | 2014 | VLDB | 6.8289978e-05 |
| 7,534 | Enabling Efficient and General Subpopulation Analytics in Multidimensional Data Streams | 2022 | VLDB | 4.7180004e-05 |
| 8,948 | One Seed, Two Birds: A Unified Learned Structure for Exact and Approximate Counting | 2024 | SIGMOD | 4.423786e-05 |
| 9,950 | Distributed Wavelet Thresholding for Maximum Error Metrics | 2016 | SIGMOD | 4.2421586e-05 |
| 11,751 | Efficient Haar+ Synopsis Construction for the Maximum Absolute Error Measure | 2018 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 22 of 22 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,476 | A Platform for Scalable One-Pass Analytics using MapReduce | 2011 | SIGMOD | 8.6960139e-05 |
| 3,310 | Optimal and Approximate Computation of Summary Statistics for Range Aggregates | 2001 | PODS | 7.2408955e-05 |
| 222 | Wavelet-Based Histograms for Selectivity Estimation | 1998 | SIGMOD | 0.00032828302 |
| 1,615 | The Performance of MapReduce: An In-depth Study | 2010 | VLDB | 0.00011132319 |
| 344 | Surfing Wavelets on Streams: One-Pass Summaries for Approximate Aggregate Queries | 2001 | VLDB | 0.00026702512 |
| 5,783 | Extended Wavelets for Multiple Measures | 2003 | SIGMOD | 5.3289633e-05 |
| 4,659 | One-Pass Wavelet Synopses for Maximum-Error Metrics | 2005 | VLDB | 6.0160083e-05 |
| 1,127 | Dynamic Maintenance of Wavelet-Based Histograms | 2000 | VLDB | 0.00013819179 |
| 273 | Approximate Computation of Multidimensional Aggregates of Sparse Data Using Wavelets | 1999 | SIGMOD | 0.00029390945 |
| 9,950 | Distributed Wavelet Thresholding for Maximum Error Metrics | 2016 | SIGMOD | 4.2421586e-05 |