Back to papers
Unified Lineage System: Tracking Data Provenance at Scale
Summary: ULS is an end-to-end lineage aggregator tracking data flows across heterogeneous assets with a general cross-asset model. It stitches traces, enables multi-granularity navigation, and offers tunable precision/recall, scaling to billions of nodes/edges in Meta for 100+ use cases including privacy.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 7121
- Venue
- SIGMOD
- Year
- 2025
- Pagerank
- 4.1945683e-05
- Overall Rank
- 10,419 | 27.52%
- DOI
-
10.1145/3722212.3724458
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
Outgoing Citations (Sorted by Pagerank)
Showing 22 of 22 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 396 |
One Trillion Edges: Graph Processing at Facebook-Scale |
2015 |
VLDB |
0.00024424102 |
| 611 |
Lineage Tracing for General Data Warehouse Transformations |
2001 |
VLDB |
0.00019231115 |
| 1,613 |
Realtime Data Processing at Facebook |
2016 |
SIGMOD |
0.00011140777 |
| 1,765 |
Efficient Lineage Tracking For Scientific Workflows |
2008 |
SIGMOD |
0.00010630348 |
| 1,861 |
Efficient Provenance Storage |
2008 |
SIGMOD |
0.00010287053 |
| 1,970 |
Approximate Lineage for Probabilistic Databases |
2008 |
VLDB |
9.896375e-05 |
| 2,027 |
Titian: Data Provenance Support in Spark |
2016 |
VLDB |
9.7437067e-05 |
| 2,028 |
Putting Lipstick on Pig: Enabling Database-style Workflow Provenance |
2012 |
VLDB |
9.7433981e-05 |
| 2,280 |
SMOKE: Fine-grained Lineage at Interactive Speed |
2018 |
VLDB |
9.1111033e-05 |
| 2,524 |
Provenance Management in Curated Databases |
2006 |
SIGMOD |
8.6017899e-05 |
| 2,658 |
Data Warehousing and Analytics Infrastructure at Facebook |
2010 |
SIGMOD |
8.3607429e-05 |
| 3,149 |
Fine-Grained, Secure and Efficient Data Provenance on Blockchain Systems |
2019 |
VLDB |
7.4741595e-05 |
| 4,774 |
LIMA: Fine-grained Lineage Tracing and Reuse in Machine Learning Systems |
2021 |
SIGMOD |
5.9316087e-05 |
| 5,086 |
Improving Reproducibility of Data Science Pipelines through Transparent Provenance Capture |
2020 |
VLDB |
5.7078462e-05 |
| 5,708 |
Lineage-driven Fault Injection |
2015 |
SIGMOD |
5.3603939e-05 |
| 6,696 |
Approximate Summaries for Why and Why-not Provenance |
2020 |
VLDB |
4.9581958e-05 |
| 7,710 |
Ananke: A Streaming Framework for Live Forward Provenance |
2021 |
VLDB |
4.6719822e-05 |
| 8,163 |
Capturing and Querying Fine-grained Provenance of Preprocessing Pipelines in Data Science |
2021 |
VLDB |
4.5723431e-05 |
| 8,729 |
OneProvenance: Efficient Extraction of Dynamic Coarse-Grained Provenance From Database Query Event Logs |
2023 |
VLDB |
4.4582221e-05 |
| 8,960 |
Computing How-Provenance for SPARQL Queries via Query Rewriting |
2021 |
VLDB |
4.4206222e-05 |
| 9,202 |
Compact, Tamper-Resistant Archival of Fine-Grained Provenance |
2021 |
VLDB |
4.3742967e-05 |
| 9,753 |
Microsoft Purview: A System for Central Governance of Data |
2023 |
VLDB |
4.2897489e-05 |
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 5,086 |
Improving Reproducibility of Data Science Pipelines through Transparent Provenance Capture |
2020 |
VLDB |
5.7078462e-05 |
| 8,729 |
OneProvenance: Efficient Extraction of Dynamic Coarse-Grained Provenance From Database Query Event Logs |
2023 |
VLDB |
4.4582221e-05 |
| 9,202 |
Compact, Tamper-Resistant Archival of Fine-Grained Provenance |
2021 |
VLDB |
4.3742967e-05 |
| 2,892 |
Data Provenance at Internet Scale: Architecture, Experiences, and the Road Ahead |
2017 |
CIDR |
7.9480559e-05 |
| 5,843 |
Tracing Lineage Beyond Relational Operators |
2007 |
VLDB |
5.3032967e-05 |
| 4,783 |
Ibis: A Provenance Manager for Multi-Layer Systems |
2011 |
CIDR |
5.9253575e-05 |
| 611 |
Lineage Tracing for General Data Warehouse Transformations |
2001 |
VLDB |
0.00019231115 |
| 3,149 |
Fine-Grained, Secure and Efficient Data Provenance on Blockchain Systems |
2019 |
VLDB |
7.4741595e-05 |
| 11,665 |
Ursprung: Provenance for Large-Scale Analytics Environments |
2019 |
SIGMOD |
4.1945683e-05 |
| 1,765 |
Efficient Lineage Tracking For Scientific Workflows |
2008 |
SIGMOD |
0.00010630348 |