Reptile: Aggregation-level Explanations for Hierarchical Data
Summary: Iterative, human-in-the-loop system that explains and cleans hierarchical data by learning group-level statistics and guiding drill-downs to fix distributive aggregation errors. Introduces factorised learning for aggregation-join queries with hierarchical optimisations, delivering >6× speedups and real-world deployments on Covid-19 and African farmer surveys used for policy-relevant data cleaning. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Zezhou Huang
- 2. Eugene Wu
Incoming Citations (Sorted by Pagerank)
Showing 1 of 1 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 10,875 | SDEcho: Efficient Explanation of Aggregated Sequence Difference | 2025 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 27 of 27 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,658 | ExplainIt! - A Declarative Root-cause Analysis Engine for Time Series Data | 2019 | SIGMOD | 6.0183783e-05 |
| 5,445 | QFix: Diagnosing Errors through Query Histories | 2017 | SIGMOD | 5.5020909e-05 |
| 11,111 | Rock: Cleaning Data with both ML and Logic Rules | 2024 | VLDB | 4.1945683e-05 |
| 4,972 | Verifying Text Summaries of Relational Data Sets | 2019 | SIGMOD | 5.7931494e-05 |
| 214 | Scorpion: Explaining Away Outliers in Aggregate Queries | 2013 | VLDB | 0.0003363692 |
| 9,278 | Interactive and Deterministic Data Cleaning: A Tossed Stone Raises a Thousand Ripples | 2016 | SIGMOD | 4.3639892e-05 |
| 10,821 | Demonstrating Matelda for Multi-Table Error Detection | 2025 | VLDB | 4.1945683e-05 |
| 7,731 | AggChecker: A Fact-Checking System for Text Summaries of Relational Data Sets | 2019 | VLDB | 4.6658615e-05 |
| 7,450 | SystemER: A Human-in-the-loop System for Explainable Entity Resolution | 2019 | VLDB | 4.7265276e-05 |
| 9,683 | Hierarchical Entity Resolution using an Oracle | 2022 | SIGMOD | 4.3047774e-05 |