Tutorial: SQL-on-Hadoop Systems
Summary: Tutorial surveying SQL-on-Hadoop systems for declarative SQL over HDFS/NoSQL data. Contrasts Hadoop ecosystems with traditional warehouses, covers architectures, formats, UDFs, and runtimes; compares Impala, Presto, Big SQL, Hive/Tez, SparkSQL; notes open problems in performance, flexibility, and integration. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
No non-self incoming citations found for this paper in this database.
Authors
- 1. Daniel Abadi
- 2. Shivnath Babu
- 3. Fatma Özcan
- 4. Ippokratis Pandis
Incoming Citations (Sorted by Pagerank)
Showing 3 of 3 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,792 | Hybrid Transactional/Analytical Processing: A Survey | 2017 | SIGMOD | 0.00010537893 |
| 4,368 | Evolving Databases for New-Gen Big Data Applications | 2017 | CIDR | 6.2491345e-05 |
| 13,225 | Reflections On My Data Management Research Journey (VLDB Women in Database Research Award Talk) | 2022 | VLDB | - |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 9 of 9 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 66 | Spark SQL: Relational Data Processing in Spark | 2015 | SIGMOD | 0.00061639801 |
| 157 | HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads | 2009 | VLDB | 0.00040397359 |
| 476 | Impala: A Modern, Open-Source SQL Engine for Hadoop | 2015 | CIDR | 0.00022226941 |
| 542 | Shark: SQL and Rich Analytics at Scale | 2013 | SIGMOD | 0.00020595648 |
| 1,869 | WinMagic : Subquery Elimination Using Window Aggregation | 2003 | SIGMOD | 0.00010265836 |
| 2,001 | Sinew: A SQL System for Multi-Structured Data | 2014 | SIGMOD | 9.8186417e-05 |
| 2,337 | Efficient Processing of Data Warehousing Queries in a Split Execution Environment | 2011 | SIGMOD | 9.0098186e-05 |
| 3,066 | HAWQ: A Massively Parallel Processing SQL Engine in Hadoop | 2014 | SIGMOD | 7.6221974e-05 |
| 4,188 | Apache Tez: A Unifying Framework for Modeling and Building Data Processing Applications | 2015 | SIGMOD | 6.3753681e-05 |
Previous
Page 1 / 1
Next