Database Paper Browser

Back to papers

Integration of Large-Scale Data Processing Systems and Traditional Parallel Database Technology

Summary: Hybrid SQL analytics unites MapReduce/Hadoop with parallel DBMS (Greenplum/Vertica); HadoopDB prototype shows strong SQL performance, scalability, and fault tolerance. Tracing a decade from research to Hadapt/Teradata, the paper surveys open-source ecosystems sustaining the integrated data-processing and DBMS paradigm. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
11949
Venue
VLDB
Year
2019
Pagerank
4.1945683e-05
Overall Rank
11,690 | 18.68%
DOI
10.14778/3352063.3352145

Incoming Non-self Citations Over Time

No non-self incoming citations found for this paper in this database.

Authors

Incoming Citations (Sorted by Pagerank)

Showing 0 of 0 citing papers.

Rank Citing Paper Year Venue Pagerank
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 20 of 20 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
3 Pig Latin: A Not-So-Foreign Language for Data Processing 2008 SIGMOD 0.0024183614
22 SCOPE: Easy and Efficient Parallel Processing of Massive Data Sets 2008 VLDB 0.0008456613
42 A Comparison of Approaches to Large-Scale Data Analysis 2009 SIGMOD 0.00073498298
66 Spark SQL: Relational Data Processing in Spark 2015 SIGMOD 0.00061639801
70 Hive - A Warehousing Solution Over a Map-Reduce Framework 2009 VLDB 0.00059533166
80 Weaving Relations for Cache Performance 2001 VLDB 0.00055721729
109 Dremel: Interactive Analysis of Web-Scale Datasets 2010 VLDB 0.00048186983
157 HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads 2009 VLDB 0.00040397359
476 Impala: A Modern, Open-Source SQL Engine for Hadoop 2015 CIDR 0.00022226941
542 Shark: SQL and Rich Analytics at Scale 2013 SIGMOD 0.00020595648
544 Apache Calcite: A Foundational Framework for Optimized Query Processing Over Heterogeneous Data Sources 2018 SIGMOD 0.00020521965
582 Scalable SPARQL Querying of Large RDF Graphs 2011 VLDB 0.00019723083
2,001 Sinew: A SQL System for Multi-Structured Data 2014 SIGMOD 9.8186417e-05
2,269 Ground: A Data Context Service 2017 CIDR 9.147379e-05
2,337 Efficient Processing of Data Warehousing Queries in a Split Execution Environment 2011 SIGMOD 9.0098186e-05
2,595 LEOPARD: Lightweight Edge-Oriented Partitioning and Replication for Dynamic Graphs 2016 VLDB 8.4735292e-05
3,066 HAWQ: A Massively Parallel Processing SQL Engine in Hadoop 2014 SIGMOD 7.6221974e-05
3,973 Apache Hive: From MapReduce to Enterprise-grade Big Data Warehousing 2019 SIGMOD 6.5758017e-05
4,188 Apache Tez: A Unifying Framework for Modeling and Building Data Processing Applications 2015 SIGMOD 6.3753681e-05
4,489 Automatic Generation of Normalized Relational Schemas from Nested Key-Value Data 2016 SIGMOD 6.1434237e-05
Previous Page 1 / 1 Next

Semantically Similar Papers