Database Paper Browser

Back to papers

Dremel: Interactive Analysis of Web-Scale Datasets

Summary: Dremel: scalable interactive query system for nested data, using multilevel execution trees and columnar layout for aggregations. Nested-columnar storage and tree-based execution enable petabyte-scale analysis on thousands of CPUs, complementing MapReduce. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
10088
Venue
VLDB
Year
2010
Pagerank
0.00048186983
Overall Rank
109 | 99.25%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 50 of 128 citing papers.

Rank Citing Paper Year Venue Pagerank
4,326 Fast Queries Over Heterogeneous Data Through Engine Customization 2016 VLDB 6.288323e-05
4,451 CLAMShell: Speeding up Crowds for Low-latency Data Labeling 2016 VLDB 6.1738675e-05
4,514 An Empirical Evaluation of Columnar Storage Formats 2024 VLDB 6.1204636e-05
4,530 Big Metadata: When Metadata is Big Data 2021 VLDB 6.1075429e-05
4,549 Database-Agnostic Workload Management 2019 CIDR 6.0926728e-05
4,641 VIVA: An End-to-End System for Interactive Video Analytics 2022 CIDR 6.027004e-05
4,667 FlexPushdownDB: Hybrid Pushdown and Caching in a Cloud DBMS 2021 VLDB 6.0116919e-05
4,670 Napa: Powering Scalable Data Warehousing with Robust Query Performance at Google 2021 VLDB 6.0104466e-05
4,689 Algorithmic Aspects of Parallel Query Processing 2018 SIGMOD 5.9980099e-05
4,704 JSON Tiles: Fast Analytics on Semi-Structured Data 2021 SIGMOD 5.9853687e-05
4,767 Pinot: Realtime OLAP for 530 Million Users 2018 SIGMOD 5.9364731e-05
4,870 Exploiting Cloud Object Storage for High-Performance Analytics 2023 VLDB 5.8613885e-05
4,905 Randomized Error Removal for Online Spread Estimation in Data Streaming 2021 VLDB 5.8398332e-05
5,120 Fast Database Restarts at Facebook 2014 SIGMOD 5.6803959e-05
5,297 Continuous Cloud-Scale Query Optimization and Processing 2013 VLDB 5.5801669e-05
5,301 ReCache: Reactive Caching for Fast Analytics over Heterogeneous Data 2018 VLDB 5.5790928e-05
5,441 Using Cloud Functions as Accelerator for Elastic Data Analytics 2023 SIGMOD 5.5028093e-05
5,453 Semistructured Models, Queries and Algebras in the Big Data Era 2016 SIGMOD 5.4989459e-05
5,531 Presto: A Decade of SQL Analytics at Meta 2023 SIGMOD 5.4549499e-05
5,532 A Padded Encoding Scheme to Accelerate Scans by Leveraging Skew 2015 SIGMOD 5.4548897e-05
5,562 A Deep Dive into Common Open Formats for Analytical DBMSs 2023 VLDB 5.4331334e-05
5,727 Enabling Incremental Query Re-Optimization 2016 SIGMOD 5.3510544e-05
5,790 AQWA: Adaptive Query-Workload-Aware Partitioning of Big Spatial Data 2015 VLDB 5.3269734e-05
6,136 Scalable Progressive Analytics on Big Data in the Cloud 2013 VLDB 5.1928748e-05
6,231 An LSM-based Tuple Compaction Framework for Apache AsterixDB 2020 VLDB 5.1457863e-05
6,264 VectorH: Taking SQL-on-Hadoop to the Next Level 2016 SIGMOD 5.1348427e-05
6,298 Hillview: A trillion-cell spreadsheet for big data 2019 VLDB 5.1226987e-05
6,302 Diva: Making MVCC Systems HTAP-Friendly 2022 SIGMOD 5.1215989e-05
6,340 Apache Arrow DataFusion: A Fast, Embeddable, Modular Analytic Query Engine 2024 SIGMOD 5.1051018e-05
6,402 BigLake: BigQuery’s Evolution toward a Multi-Cloud Lakehouse 2024 SIGMOD 5.079818e-05
6,407 Just-In-Time Data Virtualization: Lightweight Data Management with ViDa 2015 CIDR 5.076547e-05
6,658 Scalable Querying of Nested Data 2021 VLDB 4.9711629e-05
6,674 Exploiting Common Patterns for Tree-Structured Data 2017 SIGMOD 4.9663344e-05
6,870 Stat! - An Interactive Analytics Environment for Big Data 2013 SIGMOD 4.9004414e-05
7,007 Closing the functional and Performance Gap between SQL and NoSQL 2016 SIGMOD 4.8653116e-05
7,067 JetScope: Reliable and Interactive Analytics at Cloud Scale 2015 VLDB 4.8440936e-05
7,171 Leveraging Compression in the Tableau Data Engine 2014 SIGMOD 4.8117476e-05
7,296 Multi-Tenant Cloud Data Services: State-of-the-Art, Challenges and Opportunities 2022 SIGMOD 4.7723197e-05
7,350 STEED: An Analytical Database System for TrEE-structured Data 2017 VLDB 4.754748e-05
7,387 Bubble Execution: Resource-aware Reliable Analytics at Cloud Scale 2018 VLDB 4.7438193e-05
7,427 Selection Pushdown in Column Stores using Bit Manipulation Instructions 2023 SIGMOD 4.7327406e-05
7,429 CompressDB: Enabling Efficient Compressed Data Direct Processing for Various Databases 2022 SIGMOD 4.7320139e-05
7,534 Enabling Efficient and General Subpopulation Analytics in Multidimensional Data Streams 2022 VLDB 4.7180004e-05
7,554 Storing and Querying Tree-Structured Records in Dremel 2014 VLDB 4.712434e-05
8,088 PIDS: Attribute Decomposition for Improved Compression and Query Performance in Columnar Storage 2020 VLDB 4.5897316e-05
8,173 Sigma Workbook: A Spreadsheet for Cloud Data Warehouses 2022 VLDB 4.568186e-05
8,464 Piranha: Optimizing Short Jobs in Hadoop 2013 VLDB 4.5052127e-05
8,519 Extending Polaris to Support Transactions 2024 SIGMOD 4.494088e-05
8,599 Bias-Aware Sketches 2017 VLDB 4.4879268e-05
8,719 Native JSON Datatype Support: Maturing SQL and NoSQL convergence in Oracle Database 2020 VLDB 4.4612589e-05
Previous Page 2 / 3 Next

Outgoing Citations (Sorted by Pagerank)

Showing 7 of 7 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers