HAWQ: A Massively Parallel Processing SQL Engine in Hadoop
Summary: MPP SQL on HDFS combining DBMS-style parallelism with Hadoop; standard SQL with full ACID transactions. UDP interconnect, fault tolerance, read-optimized storage, and extensible data-store support for Hadoop formats; ~40x Stinger, ~35–45x Hive. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Lei Chang
- 2. Zhanwei Wang
- 3. Tao Ma
- 4. Lirong Jian
- 5. Lili Ma
- 6. Alon Goldshuv
- 7. Luke Lonergan
- 8. Jeffrey Cohen
- 9. Caleb Welton
- 10. Gavin Sherry
- 11. Milind Bhandarkar
Incoming Citations (Sorted by Pagerank)
Showing 10 of 10 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,127 | SQL-on-Hadoop: Full Circle Back to Shared-Nothing Database Architectures | 2014 | VLDB | 9.4863172e-05 |
| 4,368 | Evolving Databases for New-Gen Big Data Applications | 2017 | CIDR | 6.2491345e-05 |
| 5,441 | Using Cloud Functions as Accelerator for Elastic Data Analytics | 2023 | SIGMOD | 5.5028093e-05 |
| 5,535 | Lightweight Cardinality Estimation in LSM-based Systems | 2018 | SIGMOD | 5.4539235e-05 |
| 6,264 | VectorH: Taking SQL-on-Hadoop to the Next Level | 2016 | SIGMOD | 5.1348427e-05 |
| 6,407 | Just-In-Time Data Virtualization: Lightweight Data Management with ViDa | 2015 | CIDR | 5.076547e-05 |
| 8,483 | Optimization of Common Table Expressions in MPP Database Systems | 2015 | VLDB | 4.5008949e-05 |
| 11,690 | Integration of Large-Scale Data Processing Systems and Traditional Parallel Database Technology | 2019 | VLDB | 4.1945683e-05 |
| 11,845 | Datometry Hyper-Q: Bridging the Gap Between Real-Time and Historical Analytics | 2016 | SIGMOD | 4.1945683e-05 |
| 11,948 | Tutorial: SQL-on-Hadoop Systems | 2015 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 20 of 20 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,998 | Major Technical Advancements in Apache Hive | 2014 | SIGMOD | 7.753765e-05 |
| 542 | Shark: SQL and Rich Analytics at Scale | 2013 | SIGMOD | 0.00020595648 |
| 476 | Impala: A Modern, Open-Source SQL Engine for Hadoop | 2015 | CIDR | 0.00022226941 |
| 8,924 | QMapper for Smart Grid: Migrating SQL-based Application to Hive | 2015 | SIGMOD | 4.427232e-05 |
| 1,261 | Hadoop-GIS: A High Performance Spatial Data Warehousing System over MapReduce | 2013 | VLDB | 0.00012989236 |
| 3,973 | Apache Hive: From MapReduce to Enterprise-grade Big Data Warehousing | 2019 | SIGMOD | 6.5758017e-05 |
| 70 | Hive - A Warehousing Solution Over a Map-Reduce Framework | 2009 | VLDB | 0.00059533166 |
| 6,264 | VectorH: Taking SQL-on-Hadoop to the Next Level | 2016 | SIGMOD | 5.1348427e-05 |
| 2,337 | Efficient Processing of Data Warehousing Queries in a Split Execution Environment | 2011 | SIGMOD | 9.0098186e-05 |
| 2,127 | SQL-on-Hadoop: Full Circle Back to Shared-Nothing Database Architectures | 2014 | VLDB | 9.4863172e-05 |