Database Paper Browser

Back to papers

MLINSPECT: A Data Distribution Debugger for Machine Learning Pipelines

Summary: MLINSPECT: a data distribution debugger for ML pipelines, using lightweight lineage-based inspection to pinpoint distribution bugs in preprocessing. It operates on declarative estimator/transformer abstractions, handles relational and matrix data, and requires no manual code instrumentation. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
6041
Venue
SIGMOD
Year
2021
Pagerank
5.9615384e-05
Overall Rank
4,734 | 67.07%
DOI
10.1145/3448016.3452759

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 9 of 9 citing papers.

Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 2 of 2 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
1,404 Responsible Data Management 2020 VLDB 0.00012174977
6,291 Lightweight Inspection of Data Preprocessing in Native Machine Learning Pipelines 2021 CIDR 5.1269764e-05
Previous Page 1 / 1 Next

Semantically Similar Papers