Back to papers
Rule-Based Graph Cleaning with GPUs on a Single Machine
Summary: Rule-based graph cleaning on a single machine with GPU accel; MiniClean embeds ML predicates for discovery and correction. Memory-footprint reduction via bundling and compression; CPU-GPU-I/O pipeline with SIMD and parallelism to boost CPU-GPU synergy.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 7228
- Venue
- SIGMOD
- Year
- 2025
- Pagerank
- 4.1945683e-05
- Overall Rank
- 10,486 | 27.06%
- DOI
-
10.1145/3725303
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Incoming Citations (Sorted by Pagerank)
Showing 0 of 0 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
Outgoing Citations (Sorted by Pagerank)
Showing 30 of 30 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank |
Cited Paper |
Year |
Venue |
Pagerank |
| 4 |
Pregel: A System for Large-Scale Graph Processing |
2010 |
SIGMOD |
0.0019005923 |
| 49 |
Consistent Query Answers in Inconsistent Databases |
1999 |
PODS |
0.00067660624 |
| 109 |
Dremel: Interactive Analysis of Web-Scale Datasets |
2010 |
VLDB |
0.00048186983 |
| 221 |
Deep Entity Matching with Pre-Trained Language Models |
2021 |
VLDB |
0.00033121824 |
| 297 |
Complexity of Answering Queries Using Materialized Views |
1998 |
PODS |
0.00028596715 |
| 300 |
Deep Learning for Entity Matching: A Design Space Exploration |
2018 |
SIGMOD |
0.00028441466 |
| 444 |
Parallelizing Sequential Graph Computations |
2017 |
SIGMOD |
0.00022987918 |
| 754 |
Distributed Representations of Tuples for Entity Resolution |
2018 |
VLDB |
0.00017117211 |
| 2,450 |
Functional Dependencies for Graphs |
2016 |
SIGMOD |
8.7882979e-05 |
| 2,527 |
Dependencies for Graphs |
2017 |
PODS |
8.5954406e-05 |
| 3,287 |
GraphScope: A Unified Engine For Big Graph Processing |
2021 |
VLDB |
7.2739447e-05 |
| 3,418 |
CoroGraph: Bridging Cache Efficiency and Work Efficiency for Graph Algorithm Execution |
2024 |
VLDB |
7.1188618e-05 |
| 3,462 |
Efficient and Provable Multi-Query Optimization |
2017 |
PODS |
7.0703696e-05 |
| 3,525 |
Single Machine Graph Analytics on Massive Datasets Using Intel Optane DC Persistent Memory |
2020 |
VLDB |
7.0080401e-05 |
| 3,641 |
GPU-Accelerated Subgraph Enumeration on Partitioned Graphs |
2020 |
SIGMOD |
6.8884895e-05 |
| 3,646 |
G-CARE: A Framework for Performance Benchmarking of Cardinality Estimation Techniques for Subgraph Matching |
2020 |
SIGMOD |
6.8853079e-05 |
| 3,694 |
Keys for Graphs |
2015 |
VLDB |
6.8345712e-05 |
| 4,494 |
Multi-Query Optimization for Subgraph Isomorphism Search |
2017 |
VLDB |
6.1414196e-05 |
| 4,968 |
Efficient GPU-Accelerated Subgraph Matching |
2023 |
SIGMOD |
5.7956205e-05 |
| 6,658 |
Scalable Querying of Nested Data |
2021 |
VLDB |
4.9711629e-05 |
| 6,674 |
Exploiting Common Patterns for Tree-Structured Data |
2017 |
SIGMOD |
4.9663344e-05 |
| 6,703 |
Discovering Graph Functional Dependencies |
2018 |
SIGMOD |
4.9555163e-05 |
| 7,287 |
Discovering Association Rules from Big Graphs |
2022 |
VLDB |
4.7762276e-05 |
| 8,133 |
Towards Event Prediction in Temporal Graphs |
2022 |
VLDB |
4.5784634e-05 |
| 8,146 |
MiniGraph: Querying Big Graphs with a Single Machine |
2023 |
VLDB |
4.5755031e-05 |
| 8,211 |
Capturing Associations in Graphs |
2020 |
VLDB |
4.5581054e-05 |
| 8,422 |
Deducing Certain Fixes to Graphs |
2019 |
VLDB |
4.5167705e-05 |
| 9,487 |
Making It Tractable to Catch Duplicates and Conflicts in Graphs |
2023 |
SIGMOD |
4.3341665e-05 |
| 9,564 |
Catching Numeric Inconsistencies in Graphs |
2018 |
SIGMOD |
4.3254416e-05 |
| 9,846 |
HyperBlocker: Accelerating Rule-based Blocking in Entity Resolution using GPUs |
2025 |
VLDB |
4.2721228e-05 |
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 7,287 |
Discovering Association Rules from Big Graphs |
2022 |
VLDB |
4.7762276e-05 |
| 7,225 |
Self-adaptive Graph Traversal on GPUs |
2021 |
SIGMOD |
4.7956162e-05 |
| 4,577 |
Accelerating Dynamic Graph Analytics on GPUs |
2018 |
VLDB |
6.0709631e-05 |
| 4,522 |
GPU-based Graph Traversal on Compressed Graphs |
2019 |
SIGMOD |
6.1146374e-05 |
| 3,641 |
GPU-Accelerated Subgraph Enumeration on Partitioned Graphs |
2020 |
SIGMOD |
6.8884895e-05 |
| 6,985 |
CompressGraph: Efficient Parallel Graph Analytics with Rule-Based Compression |
2023 |
SIGMOD |
4.8729387e-05 |
| 4,254 |
Fast Sparse Matrix-Vector Multiplication on GPUs: Implications for Graph Mining |
2011 |
VLDB |
6.3213177e-05 |
| 9,487 |
Making It Tractable to Catch Duplicates and Conflicts in Graphs |
2023 |
SIGMOD |
4.3341665e-05 |
| 8,146 |
MiniGraph: Querying Big Graphs with a Single Machine |
2023 |
VLDB |
4.5755031e-05 |
| 10,446 |
MiniClean: A Single-Machine System for Cleaning Big Graphs |
2025 |
SIGMOD |
4.1945683e-05 |