HypDB: A Demonstration of Detecting, Explaining and Resolving Bias in OLAP queries
Summary: Demonstrates HypDB, the first system to detect, explain, and resolve bias in OLAP queries. It uses real-world datasets to expose biases (e.g., Simpson’s paradox) and rewrites queries to remove bias, enabling unbiased decision-support insights. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Babak Salimi
- 2. Corey Cole
- 3. Peter Li
- 4. Johannes Gehrke
- 5. Dan Suciu
Incoming Citations (Sorted by Pagerank)
Showing 7 of 7 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,753 | Complaint-driven Training Data Debugging for Query 2.0 | 2020 | SIGMOD | 8.1724339e-05 |
| 2,923 | Explaining Black-Box Algorithms Using Probabilistic Contrastive Counterfactuals | 2021 | SIGMOD | 7.8953538e-05 |
| 3,299 | SCODED: Statistical Constraint Oriented Data Error Detection | 2020 | SIGMOD | 7.2546659e-05 |
| 4,872 | Explainable AI: Foundations, Applications, Opportunities for Data Management Research | 2022 | SIGMOD | 5.8609352e-05 |
| 6,247 | Optimizing In-memory Database Engine for AI-powered On-line Decision Augmentation Using Persistent Memory | 2021 | VLDB | 5.1389201e-05 |
| 10,740 | Finding Convincing Views to Endorse a Claim | 2025 | VLDB | 4.1945683e-05 |
| 10,954 | Counterfactual Explanation at Will, with Zero Privacy Leakage | 2024 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 3 of 3 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,132 | Towards Sustainable Insights or why polygamy is bad for you | 2017 | CIDR | 9.4770432e-05 |
| 2,810 | Bias in OLAP Queries: Detection, Explanation, and Removal (Or Think Twice About Your AVG-Query) | 2018 | SIGMOD | 8.0810163e-05 |
| 8,420 | ZaliQL: Causal Inference from Observational Data at Scale | 2017 | VLDB | 4.5173249e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 6,959 | Supporting OLAP Operations over Imperfectly Integrated Taxonomies | 2008 | SIGMOD | 4.8857059e-05 |
| 2,067 | HippogriffDB: Balancing I/O and GPU Bandwidth in Big Data Analytics | 2016 | VLDB | 9.6392739e-05 |
| 767 | Explaining differences in multidimensional aggregates | 1999 | VLDB | 0.00016981309 |
| 5,674 | Efficient Allocation Algorithms for OLAP over Imprecise Data | 2006 | VLDB | 5.377195e-05 |
| 11,427 | Accelerating Complex Analytics using Speculation | 2021 | CIDR | 4.1945683e-05 |
| 2,530 | HyPer-sonic Combined Transaction AND Query Processing | 2011 | VLDB | 8.5938865e-05 |
| 2,268 | OLAP Over Uncertain and Imprecise Data | 2005 | VLDB | 9.1497575e-05 |
| 3,372 | OLAP over Imprecise Data with Domain Constraints | 2007 | VLDB | 7.1683982e-05 |
| 5,923 | HyBench: A New Benchmark for HTAP Databases | 2024 | VLDB | 5.2721765e-05 |
| 2,810 | Bias in OLAP Queries: Detection, Explanation, and Removal (Or Think Twice About Your AVG-Query) | 2018 | SIGMOD | 8.0810163e-05 |