Distributed implementations of dependency discovery algorithms
Summary: Distributed dependency discovery on big data reveals computation-communication tradeoffs in shared-nothing environments. Identifies six primitives in dependency pipelines, enabling design-space tradeoffs and communication-aware real-data validated implementations. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Hemant Saxena
- 2. Lukasz Golab
- 3. Ihab F. Ilyas
Incoming Citations (Sorted by Pagerank)
Showing 9 of 9 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 6,756 | Fast Incremental Discovery of Pointwise Order Dependencies | 2020 | VLDB | 4.9379361e-05 |
| 8,836 | Fast Approximate Denial Constraint Discovery | 2023 | VLDB | 4.4393184e-05 |
| 9,355 | Discovering Top-k Rules using Subjective and Objective Criteria | 2023 | SIGMOD | 4.3514328e-05 |
| 9,646 | Discovering Functional Dependencies through Hitting Set Enumeration | 2024 | SIGMOD | 4.3109001e-05 |
| 9,749 | Efficient Differential Dependency Discovery | 2024 | VLDB | 4.2897489e-05 |
| 9,963 | Parallel Rule Discovery from Large Datasets by Sampling | 2022 | SIGMOD | 4.2294678e-05 |
| 10,540 | Discovering Approximate Inclusion Dependencies | 2025 | VLDB | 4.1945683e-05 |
| 10,587 | Efficient Discovery of Relaxed Functional Dependencies | 2025 | VLDB | 4.1945683e-05 |
| 11,010 | Mixed Covers of Keys and Functional Dependencies for Maintaining the Integrity of Data under Updates | 2024 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 6 of 6 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 555 | Discovering Denial Constraints | 2013 | VLDB | 0.00020254908 |
| 894 | A Hybrid Approach to Functional Dependency Discovery | 2016 | SIGMOD | 0.00015556428 |
| 1,047 | Functional Dependency Discovery: An Experimental Evaluation of Seven Algorithms | 2015 | VLDB | 0.00014459715 |
| 2,253 | Efficient Denial Constraint Discovery with Hydra | 2018 | VLDB | 9.1937209e-05 |
| 3,528 | Distributed Data Deduplication | 2016 | VLDB | 7.0066139e-05 |
| 4,744 | Effective and Complete Discovery of Order Dependencies via Set-based Axiomatization | 2017 | VLDB | 5.957936e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 9,749 | Efficient Differential Dependency Discovery | 2024 | VLDB | 4.2897489e-05 |
| 8,462 | Topology-aware Parallel Data Processing: Models, Algorithms and Systems at Scale | 2020 | CIDR | 4.5056381e-05 |
| 13,305 | Formal Approaches to Querying Big Data in Shared-Nothing Systems | 2019 | SIGMOD | - |
| 2,848 | Exploiting Matrix Dependency for Efficient Distributed Matrix Computation | 2015 | SIGMOD | 8.0208832e-05 |
| 1,953 | Distributed Evaluation of Subgraph Queries Using Worst-case Optimal Low-Memory Dataflows | 2018 | VLDB | 9.9665955e-05 |
| 3,528 | Distributed Data Deduplication | 2016 | VLDB | 7.0066139e-05 |
| 7,833 | Dependency-Driven Analytics: a Compass for Uncharted Data Oceans | 2017 | CIDR | 4.6382648e-05 |
| 7,366 | Discovery Algorithms for Embedded Functional Dependencies | 2020 | SIGMOD | 4.7515248e-05 |
| 1,047 | Functional Dependency Discovery: An Experimental Evaluation of Seven Algorithms | 2015 | VLDB | 0.00014459715 |
| 13,817 | Communication-Efficient Distributed Mining of Association Rules | 2001 | SIGMOD | - |