Dependency-Driven Analytics: a Compass for Uncharted Data Oceans
Summary: Proposes DDA: extract a compact dependency graph from raw telemetry to index and constrain analyses, reducing cognitive and compute costs for petabyte-scale log analytics. Deployed at Microsoft on job/file lineage using off-the-shelf Big Data+graph DBs. (summarized by gpt-5-mini on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 4 of 4 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,174 | Computation Reuse in Analytics Job Service at Microsoft | 2018 | SIGMOD | 6.3856219e-05 |
| 5,086 | Improving Reproducibility of Data Science Pipelines through Transparent Provenance Capture | 2020 | VLDB | 5.7078462e-05 |
| 8,729 | OneProvenance: Efficient Extraction of Dynamic Coarse-Grained Provenance From Database Query Event Logs | 2023 | VLDB | 4.4582221e-05 |
| 11,667 | Peering through the Dark: An Owl's View of Inter-job Dependencies and Jobs' Impact in Shared Clusters | 2019 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 11 of 11 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next