MISTIQUE: A System to Store and Query Model Intermediates for Model Diagnosis
Summary: MISTIQUE stores and queries model intermediates for ML diagnosis, spanning traditional pipelines and deep nets. It chooses re-run vs. reuse per query and uses quantization, summarization, dedup to cut storage up to 110× and speed queries up to 390× (ML) / 210× (DL). (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 28 of 28 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 10 of 10 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 21 | C-Store: A Column-oriented DBMS | 2005 | VLDB | 0.00086087497 |
| 101 | ULDBs: Databases with Uncertainty and Lineage | 2006 | VLDB | 0.0004955674 |
| 734 | The TileDB Array Data Storage Manager | 2017 | VLDB | 0.00017455248 |
| 1,413 | VisTrails: Visualization meets Data Management | 2006 | SIGMOD | 0.00012121257 |
| 1,565 | Principles of Dataset Versioning: Exploring the Recreation/Storage Tradeoff | 2015 | VLDB | 0.00011345567 |
| 1,967 | Compressed Linear Algebra for Large-Scale Machine Learning | 2016 | VLDB | 9.9131712e-05 |
| 2,027 | Titian: Data Provenance Support in Spark | 2016 | VLDB | 9.7437067e-05 |
| 2,037 | OrpheusDB: Bolt-on Versioning for Relational Databases | 2017 | VLDB | 9.7120139e-05 |
| 2,430 | Decibel: The Relational Dataset Branching System | 2016 | VLDB | 8.8330417e-05 |
| 3,347 | Collaborative Data Analytics with DataHub | 2015 | VLDB | 7.1921364e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 10,095 | NeurStore: Efficient In-database Deep Learning Model Management System | 2026 | SIGMOD | 4.1945683e-05 |
| 10,816 | mlidea: Interactively Improving ML Data Preparation Code via "Shadow Pipelines" | 2025 | VLDB | 4.1945683e-05 |
| 10,499 | Privacy and Accuracy-Aware AI/ML Model Deduplication | 2025 | SIGMOD | 4.1945683e-05 |
| 11,000 | MisDetect: Iterative Mislabel Detection using Early Loss | 2024 | VLDB | 4.1945683e-05 |
| 706 | MYSTIQ: A system for finding more answers by using probabilities | 2005 | SIGMOD | 0.00017845469 |
| 4,734 | MLINSPECT: A Data Distribution Debugger for Machine Learning Pipelines | 2021 | SIGMOD | 5.9615384e-05 |
| 9,231 | Modyn: Data-Centric Machine Learning Pipeline Orchestration | 2025 | SIGMOD | 4.3690661e-05 |
| 10,183 | Mixtera: A Data Plane for Foundation Model Training | 2026 | SIGMOD | 4.1945683e-05 |
| 11,008 | MetaStore: Analyzing Deep Learning Meta-Data at Scale | 2024 | VLDB | 4.1945683e-05 |
| 11,147 | Reconstructing and Querying ML Pipeline Intermediates | 2023 | CIDR | 4.1945683e-05 |