Lowering the Latency of Data Processing Pipelines Through FPGA based Hardware Acceleration
Summary: FPGA-based acceleration lowers pipeline latency by reducing data movement and speeding scoring via a decision-tree ensemble. The compact FPGA engine boosts throughput, integrates with earlier stages, and delivers two orders of magnitude speedup over CPU on a real Amazon F1 baseline. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 5 of 5 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3,327 | Pump Up the Volume: Processing Large Data on GPUs with Fast Interconnects | 2020 | SIGMOD | 7.2205738e-05 |
| 5,088 | TCUDB: Accelerating Database with Tensor Processors | 2022 | SIGMOD | 5.7072189e-05 |
| 6,282 | Cheetah: Accelerating Database Queries with Switch Pruning | 2020 | SIGMOD | 5.128797e-05 |
| 10,507 | SwiftSpatial: Spatial Joins on Modern Hardware | 2025 | SIGMOD | 4.1945683e-05 |
| 11,589 | Making Search Engines Faster by Lowering the Cost of Querying Business Rules Through FPGAs | 2020 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 10 of 10 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,455 | RainForest - A Framework for Fast Decision Tree Construction of Large Datasets | 1998 | VLDB | 0.00011899821 |
| 2,372 | Predictable Performance for Unpredictable Workloads | 2009 | VLDB | 8.947963e-05 |
| 2,630 | PLANET: Massively Parallel Learning of Tree Ensembles with MapReduce | 2009 | VLDB | 8.4128091e-05 |
| 2,687 | BOAT—Optimistic Decision Tree Construction | 1999 | SIGMOD | 8.3050259e-05 |
| 4,033 | In-RDBMS Hardware Acceleration of Advanced Analytics | 2018 | VLDB | 6.5113267e-05 |
| 5,123 | Accelerating Generalized Linear Models with MLWeaving: A One-Size-Fits-All System for Any-Precision Learning | 2019 | VLDB | 5.6796998e-05 |
| 5,178 | FPGA-based Data Partitioning | 2017 | SIGMOD | 5.6438393e-05 |
| 6,404 | ColumnML: Column-Store Machine Learning with On-The-Fly Data Transformation | 2019 | VLDB | 5.0786954e-05 |
| 8,202 | Accelerating Pattern Matching Queries in Hybrid CPU-FPGA Architectures | 2017 | SIGMOD | 4.5598793e-05 |
| 8,423 | doppioDB: A Hardware Accelerated Database | 2017 | SIGMOD | 4.5163448e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 12,462 | Optimization of Frequent Itemset Mining on Multiple-Core Processor | 2007 | VLDB | 4.1945683e-05 |
| 381 | FAST: Fast Architecture Sensitive Tree Search on Modern CPUs and GPUs | 2010 | SIGMOD | 0.00024873637 |
| 9,785 | Is FPGA Useful for Hash Joins? Exploring Hash Joins on Coupled CPU-FPGA Architecture | 2020 | CIDR | 4.284797e-05 |
| 5,721 | FPGA-based Multithreading for In-Memory Hash Joins | 2015 | CIDR | 5.3525009e-05 |
| 5,178 | FPGA-based Data Partitioning | 2017 | SIGMOD | 5.6438393e-05 |
| 4,385 | Flexible Query Processor on FPGAs | 2013 | VLDB | 6.2331718e-05 |
| 9,882 | DASH: Asynchronous Hardware Data Processing Services | 2023 | CIDR | 4.2643674e-05 |
| 950 | Data Processing on FPGAs | 2009 | VLDB | 0.00015108484 |
| 10,703 | Fast Graph Vector Search via Hardware Acceleration and Delayed-Synchronization Traversal | 2025 | VLDB | 4.1945683e-05 |
| 11,589 | Making Search Engines Faster by Lowering the Cost of Querying Business Rules Through FPGAs | 2020 | SIGMOD | 4.1945683e-05 |