GenBase: A Complex Analytics Genomics Benchmark
Summary: GenBase presents a mixed-workload benchmark combining DBMS tasks (joins, filters) with genomics analytics (regression, SVD). Evaluates row/column stores, Hadoop, and array DBMS on single and multi-node clusters, exposing scalability issues; data and tools released. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 11 of 11 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 4 of 4 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 10 | Benchmarking Database Systems: A Systematic Approach | 1983 | VLDB | 0.0012103754 |
| 140 | The MADlib Analytics Library or MAD Skills, the SQL | 2012 | VLDB | 0.00042270404 |
| 600 | Linear Road: A Stream Data Management Benchmark | 2004 | VLDB | 0.0001938744 |
| 3,979 | The BUCKY Object-Relational Benchmark | 1997 | SIGMOD | 6.5681058e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 10,366 | An Adaptive Benchmark for Modeling User Exploration of Large Datasets | 2025 | SIGMOD | 4.1945683e-05 |
| 11,894 | Building Highly-Optimized, Low-Latency Pipelines for Genomic Data Analysis | 2015 | CIDR | 4.1945683e-05 |
| 7,902 | Building Highly-Optimized, Low-Latency Pipelines for Genomic Data Analysis | 2015 | CIDR | 4.6215911e-05 |
| 4,517 | Generating Databases for Query Workloads | 2010 | VLDB | 6.1178732e-05 |
| 7,892 | M2Bench: A Database Benchmark for Multi-Model Analytic Workloads | 2023 | VLDB | 4.6245179e-05 |
| 1,727 | BigBench: Towards an Industry Standard Benchmark for Big Data Analytics | 2013 | SIGMOD | 0.00010740936 |
| 340 | OLTP-Bench: An Extensible Testbed for Benchmarking Relational Databases | 2014 | VLDB | 0.00026841628 |
| 42 | A Comparison of Approaches to Large-Scale Data Analysis | 2009 | SIGMOD | 0.00073498298 |
| 6,413 | Managing Data from High-Throughput Genomic Processing: A Case Study | 2004 | VLDB | 5.0735389e-05 |
| 12,289 | Data Management for High-Throughput Genomics | 2009 | CIDR | 4.1945683e-05 |