Database Paper Browser

Back to papers

Unified Lineage System: Tracking Data Provenance at Scale

Summary: ULS is an end-to-end lineage aggregator tracking data flows across heterogeneous assets with a general cross-asset model. It stitches traces, enables multi-granularity navigation, and offers tunable precision/recall, scaling to billions of nodes/edges in Meta for 100+ use cases including privacy. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
7121
Venue
SIGMOD
Year
2025
Pagerank
4.1945683e-05
Overall Rank
10,419 | 27.52%
DOI
10.1145/3722212.3724458

Incoming Non-self Citations Over Time

No non-self incoming citations found for this paper in this database.

Authors

Incoming Citations (Sorted by Pagerank)

Showing 0 of 0 citing papers.

Rank Citing Paper Year Venue Pagerank
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 22 of 22 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
396 One Trillion Edges: Graph Processing at Facebook-Scale 2015 VLDB 0.00024424102
611 Lineage Tracing for General Data Warehouse Transformations 2001 VLDB 0.00019231115
1,613 Realtime Data Processing at Facebook 2016 SIGMOD 0.00011140777
1,765 Efficient Lineage Tracking For Scientific Workflows 2008 SIGMOD 0.00010630348
1,861 Efficient Provenance Storage 2008 SIGMOD 0.00010287053
1,970 Approximate Lineage for Probabilistic Databases 2008 VLDB 9.896375e-05
2,027 Titian: Data Provenance Support in Spark 2016 VLDB 9.7437067e-05
2,028 Putting Lipstick on Pig: Enabling Database-style Workflow Provenance 2012 VLDB 9.7433981e-05
2,280 SMOKE: Fine-grained Lineage at Interactive Speed 2018 VLDB 9.1111033e-05
2,524 Provenance Management in Curated Databases 2006 SIGMOD 8.6017899e-05
2,658 Data Warehousing and Analytics Infrastructure at Facebook 2010 SIGMOD 8.3607429e-05
3,149 Fine-Grained, Secure and Efficient Data Provenance on Blockchain Systems 2019 VLDB 7.4741595e-05
4,774 LIMA: Fine-grained Lineage Tracing and Reuse in Machine Learning Systems 2021 SIGMOD 5.9316087e-05
5,086 Improving Reproducibility of Data Science Pipelines through Transparent Provenance Capture 2020 VLDB 5.7078462e-05
5,708 Lineage-driven Fault Injection 2015 SIGMOD 5.3603939e-05
6,696 Approximate Summaries for Why and Why-not Provenance 2020 VLDB 4.9581958e-05
7,710 Ananke: A Streaming Framework for Live Forward Provenance 2021 VLDB 4.6719822e-05
8,163 Capturing and Querying Fine-grained Provenance of Preprocessing Pipelines in Data Science 2021 VLDB 4.5723431e-05
8,729 OneProvenance: Efficient Extraction of Dynamic Coarse-Grained Provenance From Database Query Event Logs 2023 VLDB 4.4582221e-05
8,960 Computing How-Provenance for SPARQL Queries via Query Rewriting 2021 VLDB 4.4206222e-05
9,202 Compact, Tamper-Resistant Archival of Fine-Grained Provenance 2021 VLDB 4.3742967e-05
9,753 Microsoft Purview: A System for Central Governance of Data 2023 VLDB 4.2897489e-05
Previous Page 1 / 1 Next

Semantically Similar Papers