Database Paper Browser

Back to papers

Dremel: Interactive Analysis of Web-Scale Datasets

Summary: Dremel: scalable interactive query system for nested data, using multilevel execution trees and columnar layout for aggregations. Nested-columnar storage and tree-based execution enable petabyte-scale analysis on thousands of CPUs, complementing MapReduce. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
10088
Venue
VLDB
Year
2010
Pagerank
0.00048186983
Overall Rank
109 | 99.25%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 50 of 128 citing papers.

Rank Citing Paper Year Venue Pagerank
66 Spark SQL: Relational Data Processing in Spark 2015 SIGMOD 0.00061639801
167 The Snowflake Elastic Data Warehouse 2016 SIGMOD 0.00039180521
476 Impala: A Modern, Open-Source SQL Engine for Hadoop 2015 CIDR 0.00022226941
542 Shark: SQL and Rich Analytics at Scale 2013 SIGMOD 0.00020595648
544 Apache Calcite: A Foundational Framework for Optimized Query Processing Over Heterogeneous Data Sources 2018 SIGMOD 0.00020521965
610 Goods: Organizing Google's Datasets 2016 SIGMOD 0.00019232674
913 Tenzing A SQL Implementation On The MapReduce Framework 2011 VLDB 0.00015408131
979 Interactive Analytical Processing in Big Data Systems: A Cross-Industry Study of MapReduce Workloads 2012 VLDB 0.0001488055
1,015 Spanner: Becoming a SQL System 2017 SIGMOD 0.00014638696
1,110 Parallel Evaluation of Conjunctive Queries 2011 PODS 0.00013968198
1,323 Quickr: Lazily Approximating Complex AdHoc Queries in BigData Clusters 2016 SIGMOD 0.00012601997
1,411 Communication Steps for Parallel Query Processing 2013 PODS 0.0001212565
1,470 Processing a Trillion Cells per Mouse Click 2012 VLDB 0.00011833779
1,477 Fine-grained Partitioning for Aggressive Data Skipping 2014 SIGMOD 0.00011770865
1,487 Scuba: Diving into Data at Facebook 2013 VLDB 0.00011701099
1,574 Approximate Query Processing: No Silver Bullet 2017 SIGMOD 0.00011287495
1,588 Druid: A Real-time Analytical Data Store 2014 SIGMOD 0.00011239313
1,814 Mesa: Geo-Replicated, Near Real-Time, Scalable Data Warehousing 2014 VLDB 0.00010458107
1,939 From Theory to Practice: Efficient Join Query Evaluation in a Parallel Database System 2015 SIGMOD 0.00010025655
1,943 Procella: Unifying serving and analytical data at YouTube 2019 VLDB 0.00010012569
2,001 Sinew: A SQL System for Multi-Structured Data 2014 SIGMOD 9.8186417e-05
2,062 Dremel: A Decade of Interactive SQL Analysis at Web Scale 2020 VLDB 9.6481955e-05
2,099 Axiomatic Foundations and Algorithms for Deciding Semantic Equivalences of SQL Queries 2018 VLDB 9.5479391e-05
2,127 SQL-on-Hadoop: Full Circle Back to Shared-Nothing Database Architectures 2014 VLDB 9.4863172e-05
2,154 DIFF: A Relational Interface for Large-Scale Data Explanation 2019 VLDB 9.4208667e-05
2,249 Orca: A Modular Query Optimizer Architecture for Big Data 2014 SIGMOD 9.2034693e-05
2,262 Manu: A Cloud Native Vector Database Management System 2022 VLDB 9.1624446e-05
2,545 POLARIS: The Distributed SQL Engine in Azure Synapse 2020 VLDB 8.5725413e-05
2,674 Minimal MapReduce Algorithms 2013 SIGMOD 8.3328645e-05
2,716 Davos: A System for Interactive Data-Driven Decision Making 2021 VLDB 8.2429172e-05
2,819 Mison: A Fast JSON Parser for Data Analytics 2017 VLDB 8.0651326e-05
2,998 Major Technical Advancements in Apache Hive 2014 SIGMOD 7.753765e-05
3,058 Rethinking Data-Intensive Science Using Scalable Analytics Systems 2015 SIGMOD 7.6410159e-05
3,066 HAWQ: A Massively Parallel Processing SQL Engine in Hadoop 2014 SIGMOD 7.6221974e-05
3,115 Llama: Leveraging Columnar Storage for Scalable Join Processing in the MapReduce Framework 2011 SIGMOD 7.543505e-05
3,152 AnalyticDB: Real-time OLAP Database System at Alibaba Cloud 2019 VLDB 7.4711766e-05
3,208 Column-Oriented Storage Techniques for MapReduce 2011 VLDB 7.3781897e-05
3,355 F1 Query: Declarative Querying at Scale 2018 VLDB 7.1829142e-05
3,437 Speculative Distributed CSV Data Parsing for Big Data Analytics 2019 SIGMOD 7.0942161e-05
3,548 Adaptive Query Processing on RAW Data 2014 VLDB 6.9859242e-05
3,644 BtrBlocks: Efficient Columnar Compression for Data Lakes 2023 SIGMOD 6.8854928e-05
3,737 Skipping-oriented Partitioning for Columnar Layouts 2017 VLDB 6.8033227e-05
3,763 Flexible Rule-Based Decomposition and Metadata Independence in Modin: A Parallel Dataframe System 2022 VLDB 6.7801795e-05
3,768 F1 Lightning: HTAP as a Service 2020 VLDB 6.7782774e-05
3,789 DIAMetrics: Benchmarking Query Engines at Scale 2020 VLDB 6.7644737e-05
3,891 Slalom: Coasting Through Raw Data via Adaptive Partitioning and Indexing 2017 VLDB 6.659442e-05
3,944 AQP++: Connecting Approximate Query Processing With Aggregate Precomputation for Interactive Analytics 2018 SIGMOD 6.6078243e-05
3,973 Apache Hive: From MapReduce to Enterprise-grade Big Data Warehousing 2019 SIGMOD 6.5758017e-05
4,061 Advanced Partitioning Techniques for Massively Distributed Computation 2012 SIGMOD 6.483587e-05
4,188 Apache Tez: A Unifying Framework for Modeling and Building Data Processing Applications 2015 SIGMOD 6.3753681e-05
Previous Page 1 / 3 Next

Outgoing Citations (Sorted by Pagerank)

Showing 7 of 7 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers