| 66 |
Spark SQL: Relational Data Processing in Spark |
2015 |
SIGMOD |
0.00061639801 |
| 1,435 |
Simba: Efficient In-Memory Spatial Analytics |
2016 |
SIGMOD |
0.00012004456 |
| 1,477 |
Fine-grained Partitioning for Aggressive Data Skipping |
2014 |
SIGMOD |
0.00011770865 |
| 1,487 |
Scuba: Diving into Data at Facebook |
2013 |
VLDB |
0.00011701099 |
| 1,814 |
Mesa: Geo-Replicated, Near Real-Time, Scalable Data Warehousing |
2014 |
VLDB |
0.00010458107 |
| 1,874 |
Knowing When You’re Wrong: Building Fast and Reliable Approximate Query Processing Systems |
2014 |
SIGMOD |
0.00010244443 |
| 1,939 |
From Theory to Practice: Efficient Join Query Evaluation in a Parallel Database System |
2015 |
SIGMOD |
0.00010025655 |
| 2,127 |
SQL-on-Hadoop: Full Circle Back to Shared-Nothing Database Architectures |
2014 |
VLDB |
9.4863172e-05 |
| 2,212 |
Skew in Parallel Query Processing |
2014 |
PODS |
9.2771827e-05 |
| 2,412 |
WideTable: An Accelerator for Analytical Data Processing |
2014 |
VLDB |
8.8726508e-05 |
| 2,772 |
Quickstep: A Data Platform Based on the Scaling-Up Approach |
2018 |
VLDB |
8.1401661e-05 |
| 2,844 |
Towards Scalable Real-time Analytics: An Architecture for Scale-out of OLxP Workloads |
2015 |
VLDB |
8.0243849e-05 |
| 2,928 |
WANalytics: Analytics for a Geo-Distributed Data-Intensive World |
2015 |
CIDR |
7.8812874e-05 |
| 2,946 |
BigDansing: A System for Big Data Cleansing |
2015 |
SIGMOD |
7.8372441e-05 |
| 3,066 |
HAWQ: A Massively Parallel Processing SQL Engine in Hadoop |
2014 |
SIGMOD |
7.6221974e-05 |
| 3,608 |
Column Sketches: A Scan Accelerator for Rapid and Robust Predicate Evaluation |
2018 |
SIGMOD |
6.924272e-05 |
| 3,821 |
Locality-aware Partitioning in Parallel Database Systems |
2015 |
SIGMOD |
6.7281515e-05 |
| 4,033 |
In-RDBMS Hardware Acceleration of Advanced Analytics |
2018 |
VLDB |
6.5113267e-05 |
| 4,046 |
WANalytics: Geo-Distributed Analytics for a Data Intensive World |
2015 |
SIGMOD |
6.4979392e-05 |
| 4,161 |
Access Path Selection in Main-Memory Optimized Data Systems: Should I Scan or Should I Probe? |
2017 |
SIGMOD |
6.3938006e-05 |
| 5,014 |
Dynamically Optimizing Queries over Large Scale Data Platforms |
2014 |
SIGMOD |
5.7586174e-05 |
| 5,119 |
Design Tradeoffs of Data Access Methods |
2016 |
SIGMOD |
5.6807904e-05 |
| 5,368 |
Fine-Grained Modeling and Optimization for Intelligent Resource Management in Big Data Processing |
2022 |
VLDB |
5.5457532e-05 |
| 5,829 |
A Performance Study of Big Data on Small Nodes |
2015 |
VLDB |
5.3113542e-05 |
| 6,075 |
Opportunistic Physical Design for Big Data Analytics |
2014 |
SIGMOD |
5.223901e-05 |
| 6,304 |
Elastic Pipelining in an In-Memory Database Cluster |
2016 |
SIGMOD |
5.1210182e-05 |
| 6,784 |
SparkR: Scaling R Programs with Spark |
2016 |
SIGMOD |
4.9265155e-05 |
| 6,802 |
Understanding Insights into the Basic Structure and Essential Issues of Table Placement Methods in Clusters |
2013 |
VLDB |
4.9226626e-05 |
| 6,809 |
Adaptive Data Skipping in Main-Memory Systems |
2016 |
SIGMOD |
4.9206606e-05 |
| 6,856 |
Liquid: Unifying Nearline and Offline Big Data Integration |
2015 |
CIDR |
4.9060615e-05 |
| 6,871 |
Towards General and Efficient Online Tuning for Spark |
2023 |
VLDB |
4.8997004e-05 |
| 6,895 |
Decentralized Actor Scheduling and Reference-based Storage in Xorbits: a Native Scalable Data Science Engine |
2025 |
VLDB |
4.8925595e-05 |
| 7,059 |
Adaptive and Robust Query Execution for Lakehouses at Scale |
2024 |
VLDB |
4.8477825e-05 |
| 7,067 |
JetScope: Reliable and Interactive Analytics at Cloud Scale |
2015 |
VLDB |
4.8440936e-05 |
| 7,207 |
Kodiak: Leveraging Materialized Views For Very Low-Latency Analytics Over High-Dimensional Web-Scale Data |
2016 |
VLDB |
4.800763e-05 |
| 7,369 |
Using VDMS to Index and Search 100M Images |
2021 |
VLDB |
4.750437e-05 |
| 7,387 |
Bubble Execution: Resource-aware Reliable Analytics at Cloud Scale |
2018 |
VLDB |
4.7438193e-05 |
| 7,599 |
Quill: Efficient, Transferable, and Rich Analytics at Scale |
2016 |
VLDB |
4.7003593e-05 |
| 7,920 |
JoinBoost: Grow Trees Over Normalized Data Using Only SQL |
2023 |
VLDB |
4.6163888e-05 |
| 8,197 |
SparkCruise: Workload Optimization in Managed Spark Clusters at Microsoft |
2021 |
VLDB |
4.5607121e-05 |
| 8,215 |
Parallel-Correctness and Transferability for Conjunctive Queries |
2015 |
PODS |
4.5577562e-05 |
| 8,464 |
Piranha: Optimizing Short Jobs in Hadoop |
2013 |
VLDB |
4.5052127e-05 |
| 8,617 |
A Spark Optimizer for Adaptive, Fine-Grained Parameter Tuning |
2024 |
VLDB |
4.4846425e-05 |
| 8,924 |
QMapper for Smart Grid: Migrating SQL-based Application to Hive |
2015 |
SIGMOD |
4.427232e-05 |
| 9,448 |
Cost-based Fault-tolerance for Parallel Data Processing |
2015 |
SIGMOD |
4.3401906e-05 |
| 9,584 |
Introduction to Spark 2.0 for Database Researchers |
2016 |
SIGMOD |
4.3218691e-05 |
| 10,404 |
Dynamic Pruning for Recursive Joins |
2025 |
SIGMOD |
4.1945683e-05 |
| 11,690 |
Integration of Large-Scale Data Processing Systems and Traditional Parallel Database Technology |
2019 |
VLDB |
4.1945683e-05 |
| 11,831 |
Logical Aspects of Massively Parallel and Distributed Systems |
2016 |
PODS |
4.1945683e-05 |
| 11,882 |
Parallel Evaluation of Multi-Semi-Joins |
2016 |
VLDB |
4.1945683e-05 |