SMOKE: Fine-grained Lineage at Interactive Speed
Summary: SMOKE is an in-memory DB engine that tightly integrates lineage capture into physical operators to minimize overhead and accelerate lineage queries. It uses compact lineage representations and upfront-query-aware optimizations to deliver interactive latency (sub-150 ms) and multi-order-of-magnitude improvements over prior systems on real workloads. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Fotis Psallidas
- 2. Eugene Wu
Incoming Citations (Sorted by Pagerank)
Showing 25 of 25 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 28 of 28 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 8,047 | Thrifty Query Execution via Incrementability | 2020 | SIGMOD | 4.5983505e-05 |
| 2,173 | Querying Data Provenance | 2010 | SIGMOD | 9.3676609e-05 |
| 7,556 | Interactive Query Explanations Using Fine Grained Provenance | 2022 | SIGMOD | 4.7117814e-05 |
| 8,729 | OneProvenance: Efficient Extraction of Dynamic Coarse-Grained Provenance From Database Query Event Logs | 2023 | VLDB | 4.4582221e-05 |
| 10,419 | Unified Lineage System: Tracking Data Provenance at Scale | 2025 | SIGMOD | 4.1945683e-05 |
| 7,754 | Lineage Processing over Correlated Probabilistic Databases | 2010 | SIGMOD | 4.6600967e-05 |
| 5,843 | Tracing Lineage Beyond Relational Operators | 2007 | VLDB | 5.3032967e-05 |
| 11,353 | Lineage Resource Manager | 2022 | SIGMOD | 4.1945683e-05 |
| 1,765 | Efficient Lineage Tracking For Scientific Workflows | 2008 | SIGMOD | 0.00010630348 |
| 11,710 | Demonstration of Smoke: A Deep Breath of Data-Intensive Lineage Applications | 2018 | SIGMOD | 4.1945683e-05 |