CM-Explorer: Dissecting Data Ingestion Problems
Summary: CM-Explorer: system for disentangling correlated violated conditional unit tests over conditional metrics (CMs) to surface minimal subrelations (tuples) that explain ingestion errors. Uses a graph explorer (correlation visualization), relation explorer (tuple browsing) and history explorer (temporal diagnostics) to help stewards find the most relevant tests. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Authors
- 1. Niels Bylois
- 2. Frank Neven
- 3. Stijn Vansummeren
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 3 of 3 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,482 | Automating Large-Scale Data Quality Verification | 2018 | VLDB | 0.00011725533 |
| 3,491 | TensorFlow Data Validation: Data Analysis and Validation in Continuous ML Pipelines | 2020 | SIGMOD | 7.0451276e-05 |
| 5,257 | Probabilistic Demand Forecasting at Scale | 2017 | VLDB | 5.6003925e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 10,821 | Demonstrating Matelda for Multi-Table Error Detection | 2025 | VLDB | 4.1945683e-05 |
| 11,462 | INCA: Inconsistency-Aware Data Profiling and Querying | 2021 | SIGMOD | 4.1945683e-05 |
| 5,445 | QFix: Diagnosing Errors through Query Histories | 2017 | SIGMOD | 5.5020909e-05 |
| 3,976 | UGuide – User-Guided Discovery of FD-Detectable Errors | 2017 | SIGMOD | 6.5736462e-05 |
| 7,556 | Interactive Query Explanations Using Fine Grained Provenance | 2022 | SIGMOD | 4.7117814e-05 |
| 5,660 | Descriptive and Prescriptive Data Cleaning | 2014 | SIGMOD | 5.3847321e-05 |
| 1,482 | Automating Large-Scale Data Quality Verification | 2018 | VLDB | 0.00011725533 |
| 11,874 | Graph-based Exploration of Non-graph Datasets | 2016 | VLDB | 4.1945683e-05 |
| 732 | Discovering Data Quality Rules | 2008 | VLDB | 0.00017465093 |
| 4,929 | Data Auditor: Exploring Data Quality and Semantics using Pattern Tableaux | 2010 | VLDB | 5.8217296e-05 |