RDFind: Scalable Conditional Inclusion Dependency Discovery in RDF Datasets
Summary: Introduces pertinent conditional inclusion dependencies (cinds) and RDFind, a scalable distributed system for discovering them in RDF data. Lazy pruning and aggressive parallelization drastically shrink the search space, delivering up to 419x speedup and enabling billions-of-triples RDF graphs. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 2 of 2 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3,976 | UGuide – User-Guided Discovery of FD-Detectable Errors | 2017 | SIGMOD | 6.5736462e-05 |
| 8,289 | Knowledge Graph Exploration Systems: are we lost? | 2022 | CIDR | 4.5435639e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 8 of 8 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 36 | Fast Algorithms for Mining Association Rules | 1994 | VLDB | 0.00076161096 |
| 224 | CORDS: Automatic Discovery of Correlations and Soft Functional Dependencies | 2004 | SIGMOD | 0.00032746205 |
| 560 | Dependencies Revisited for Improving Data Quality | 2008 | PODS | 0.00020141923 |
| 872 | An Efficient SQL-based RDF Querying Scheme | 2005 | VLDB | 0.00015759968 |
| 1,401 | Extending Dependencies with Conditions | 2007 | VLDB | 0.00012187775 |
| 1,664 | On Multi-Column Foreign Key Discovery | 2010 | VLDB | 0.00010976887 |
| 1,959 | Building an Efficient RDF Store Over a Relational Database | 2013 | SIGMOD | 9.9563798e-05 |
| 4,784 | Divide & Conquer-based Inclusion Dependency Discovery | 2015 | VLDB | 5.9240851e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 7,263 | H2 RDF+: An Efficient Data Management System for Big RDF Graphs | 2014 | SIGMOD | 4.7851876e-05 |
| 6,401 | Scaling Queries over Big RDF Graphs with Semantic Hash Partitioning | 2013 | VLDB | 5.0801167e-05 |
| 1,675 | A Distributed Graph Engine for Web Scale RDF Data | 2013 | VLDB | 0.00010947606 |
| 11,701 | Efficient Exploration of Linked Data | 2018 | SIGMOD | 4.1945683e-05 |
| 9,175 | Efficient Exploration of Interesting Aggregates in RDF Graphs | 2021 | SIGMOD | 4.383548e-05 |
| 872 | An Efficient SQL-based RDF Querying Scheme | 2005 | VLDB | 0.00015759968 |
| 582 | Scalable SPARQL Querying of Large RDF Graphs | 2011 | VLDB | 0.00019723083 |
| 10,540 | Discovering Approximate Inclusion Dependencies | 2025 | VLDB | 4.1945683e-05 |
| 2,410 | Scalable Join Processing on Very Large RDF Graphs | 2009 | SIGMOD | 8.8773796e-05 |
| 4,784 | Divide & Conquer-based Inclusion Dependency Discovery | 2015 | VLDB | 5.9240851e-05 |