Erebus: Explaining the Outputs of Data Streaming Queries
Summary: Erebus is the first system for explaining missing answers in data streams, extending beyond why-provenance by letting users declare runtime expectations and detecting divergence under streaming constraints (limited storage, unbounded input). It synthesizes compact explanations for absent results with low overheads on both low- and high-end devices, validated on real workloads. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 2 of 2 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 10,024 | LPStream: Fine-grained Lazy Provenance for Stream Processing | 2026 | SIGMOD | 4.1945683e-05 |
| 10,546 | Evaluating Continuous Queries with Inconsistency Annotations | 2025 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 7 of 7 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 487 | Why Not? | 2009 | SIGMOD | 0.00022050218 |
| 538 | The Dataflow Model: A Practical Approach to Balancing Correctness, Latency, and Cost in Massive-Scale, Unbounded, Out-of-Order Data Processing | 2015 | VLDB | 0.00020678804 |
| 600 | Linear Road: A Stream Data Management Benchmark | 2004 | VLDB | 0.0001938744 |
| 3,095 | Answering Why-not Questions on Reverse Top-k Queries | 2015 | VLDB | 7.5692859e-05 |
| 7,678 | To Not Miss the Forest for the Trees - A Holistic Approach for Explaining Missing Answers over Nested Data | 2021 | SIGMOD | 4.6813062e-05 |
| 7,710 | Ananke: A Streaming Framework for Live Forward Provenance | 2021 | VLDB | 4.6719822e-05 |
| 8,149 | Why Not Match: On Explanations of Event Pattern Queries | 2021 | SIGMOD | 4.5752863e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 6,384 | A Demonstration of DBWipes: Clean as You Query | 2012 | VLDB | 5.0880333e-05 |
| 12,109 | Declarative Error Management for Robust Data-Intensive Applications | 2012 | SIGMOD | 4.1945683e-05 |
| 1,222 | Querying and Mining Data Streams: You Only Get One Look | 2002 | SIGMOD | 0.00013213129 |
| 7,660 | Scalable Delivery of Stream Query Result | 2009 | VLDB | 4.6862657e-05 |
| 1,699 | Sensitivity Analysis and Explanations for Robust Query Evaluation in Probabilistic Databases | 2011 | SIGMOD | 0.00010858983 |
| 652 | On the Provenance of Non-Answers to Queries over Extracted Data | 2008 | VLDB | 0.00018634477 |
| 4,326 | Fast Queries Over Heterogeneous Data Through Engine Customization | 2016 | VLDB | 6.288323e-05 |
| 2,562 | Explaining Missing Answers to SPJUA Queries | 2010 | VLDB | 8.5386194e-05 |
| 9,144 | EIRES: Efficient Integration of Remote Data in Event Stream Processing | 2021 | SIGMOD | 4.3850401e-05 |
| 1,125 | How to ConQueR Why-Not Questions | 2010 | SIGMOD | 0.00013845652 |