Database Paper Browser

Back to papers

CM-Explorer: Dissecting Data Ingestion Problems

Summary: CM-Explorer: system for disentangling correlated violated conditional unit tests over conditional metrics (CMs) to surface minimal subrelations (tuples) that explain ingestion errors. Uses a graph explorer (correlation visualization), relation explorer (tuple browsing) and history explorer (temporal diagnostics) to help stewards find the most relevant tests. (summarized by gpt-5-mini on Feb 09 2026)

Paper ID
13239
Venue
VLDB
Year
2023
Pagerank
4.1945683e-05
Overall Rank
11,280 | 21.53%
DOI
10.14778/3611540.3611595

Incoming Non-self Citations Over Time

No non-self incoming citations found for this paper in this database.

Authors

Incoming Citations (Sorted by Pagerank)

Showing 0 of 0 citing papers.

Rank Citing Paper Year Venue Pagerank
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 3 of 3 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
1,482 Automating Large-Scale Data Quality Verification 2018 VLDB 0.00011725533
3,491 TensorFlow Data Validation: Data Analysis and Validation in Continuous ML Pipelines 2020 SIGMOD 7.0451276e-05
5,257 Probabilistic Demand Forecasting at Scale 2017 VLDB 5.6003925e-05
Previous Page 1 / 1 Next

Semantically Similar Papers

Overall Rank Paper Year Venue Pagerank
10,821 Demonstrating Matelda for Multi-Table Error Detection 2025 VLDB 4.1945683e-05
11,462 INCA: Inconsistency-Aware Data Profiling and Querying 2021 SIGMOD 4.1945683e-05
5,445 QFix: Diagnosing Errors through Query Histories 2017 SIGMOD 5.5020909e-05
3,976 UGuide – User-Guided Discovery of FD-Detectable Errors 2017 SIGMOD 6.5736462e-05
7,556 Interactive Query Explanations Using Fine Grained Provenance 2022 SIGMOD 4.7117814e-05
5,660 Descriptive and Prescriptive Data Cleaning 2014 SIGMOD 5.3847321e-05
1,482 Automating Large-Scale Data Quality Verification 2018 VLDB 0.00011725533
11,874 Graph-based Exploration of Non-graph Datasets 2016 VLDB 4.1945683e-05
732 Discovering Data Quality Rules 2008 VLDB 0.00017465093
4,929 Data Auditor: Exploring Data Quality and Semantics using Pattern Tableaux 2010 VLDB 5.8217296e-05