Database Paper Browser

Back to papers

HyperBlocker: Accelerating Rule-based Blocking in Entity Resolution using GPUs

Summary: GPU-accelerated rule-based blocking for ER using a pipelined CPU–GPU architecture that overlaps data transfer and kernel execution and a data- and rule-aware CPU planner to schedule evaluations. Hardware-aware optimizations enable massive GPU parallelism, yielding 6.8x–9.1x speedups vs prior CPU/GPU systems and cutting end-to-end ER time by ≥30% with comparable accuracy. (summarized by gpt-5-mini on Feb 09 2026)

Paper ID
13944
Venue
VLDB
Year
2025
Pagerank
4.2721228e-05
Overall Rank
9,846 | 31.51%
DOI
10.14778/3705829.3705847

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 2 of 2 citing papers.

Rank Citing Paper Year Venue Pagerank
10,486 Rule-Based Graph Cleaning with GPUs on a Single Machine 2025 SIGMOD 4.1945683e-05
10,852 CloudGlide: Deconstructing the Landscape of Cloud-Based Analytics 2025 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 22 of 22 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
102 The Case for Learned Index Structures 2018 SIGMOD 0.00049545203
179 Efficient and Extensible Algorithms for Multi Query Optimization 2000 SIGMOD 0.00037672155
221 Deep Entity Matching with Pre-Trained Language Models 2021 VLDB 0.00033121824
300 Deep Learning for Entity Matching: A Design Space Exploration 2018 SIGMOD 0.00028441466
333 Neo: A Learned Query Optimizer 2019 VLDB 0.00027206884
754 Distributed Representations of Tuples for Entity Resolution 2018 VLDB 0.00017117211
1,882 Tuplex: Data Science in Python at Native Code Speed 2021 SIGMOD 0.0001021625
2,175 Falcon: Scaling Up Hands-Off Crowdsourced Entity Matching to Build Cloud Services 2017 SIGMOD 9.3644117e-05
2,231 Dedoop: Efficient Deduplication with Hadoop 2012 VLDB 9.2304499e-05
2,611 Opening the Black Boxes in Data Flow Optimization 2012 VLDB 8.4536967e-05
3,013 Cardinality Estimation Using Sample Views with Quality Assurance 2007 SIGMOD 7.7137441e-05
3,528 Distributed Data Deduplication 2016 VLDB 7.0066139e-05
3,640 Deep Learning for Blocking in Entity Matching: A Design Space Exploration 2021 VLDB 6.8891671e-05
3,645 Large-Scale Collective Entity Matching 2011 VLDB 6.8853274e-05
3,977 BLAST: a Loosely Schema-aware Meta-blocking Approach for Entity Resolution 2016 VLDB 6.5736268e-05
4,464 Magellan: Toward Building Entity Matching Management Systems over Data Science Stacks 2016 VLDB 6.1606042e-05
6,690 Parallel Discrepancy Detection and Incremental Detection 2021 VLDB 4.9621556e-05
7,427 Selection Pushdown in Column Stores using Bit Manipulation Instructions 2023 SIGMOD 4.7327406e-05
8,583 Efficient Execution of User-Defined Functions in SQL Queries 2023 VLDB 4.4919445e-05
9,355 Discovering Top-k Rules using Subjective and Objective Criteria 2023 SIGMOD 4.3514328e-05
9,434 Rock: Cleaning Data by Embedding ML in Logic Rules 2024 SIGMOD 4.3430376e-05
9,963 Parallel Rule Discovery from Large Datasets by Sampling 2022 SIGMOD 4.2294678e-05
Previous Page 1 / 1 Next

Semantically Similar Papers