Database Paper Browser

Back to papers

Dremel: Interactive Analysis of Web-Scale Datasets

Summary: Dremel: scalable interactive query system for nested data, using multilevel execution trees and columnar layout for aggregations. Nested-columnar storage and tree-based execution enable petabyte-scale analysis on thousands of CPUs, complementing MapReduce. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
10088
Venue
VLDB
Year
2010
Pagerank
0.00048186983
Overall Rank
109 | 99.25%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 28 of 128 citing papers.

Rank Citing Paper Year Venue Pagerank
8,731 Columnar Formats for Schemaless LSM-based Document Stores 2022 VLDB 4.4577278e-05
8,884 Workload Insights From The Snowflake Data Cloud: What Do Production Analytic Queries Really Look Like? 2025 VLDB 4.4283999e-05
8,997 Chasing Similarity: Distribution-aware Aggregation Scheduling 2019 VLDB 4.4120041e-05
9,111 Meta's Next-generation Realtime Monitoring and Analytics Platform 2022 VLDB 4.3942367e-05
9,128 Apache TsFile: An IoT-native Time Series File Format 2024 VLDB 4.3909921e-05
9,138 Management of Flexible Schema Data in RDBMSs - Opportunities and Limitations for NoSQL 2015 CIDR 4.3869509e-05
9,201 F3: The Open-Source Data File Format for the Future 2026 SIGMOD 4.3743539e-05
9,387 Overview of Turn Data Management Platform for Digital Advertising 2013 VLDB 4.3443757e-05
9,401 Vortex: A Stream-oriented Storage Engine For Big Data Analytics 2024 SIGMOD 4.3441378e-05
9,699 The Story of AWS Glue 2023 VLDB 4.3018844e-05
9,702 Evaluating Query Languages and Systems for High-Energy Physics Data 2022 VLDB 4.3008468e-05
9,881 VStream: A Distributed Streaming Vector Search System 2025 VLDB 4.2643674e-05
9,894 OceanRT: Real-Time Analytics over Large Temporal Data 2014 SIGMOD 4.2602616e-05
9,901 AnyBlox: A Framework for Self-Decoding Datasets 2025 VLDB 4.258022e-05
9,944 Out-of-order Execution of Database Queries 2020 VLDB 4.2446672e-05
10,403 CockroachDB Serverless: Sub-second Scaling from Zero with Multi-region Cluster Virtualization 2025 SIGMOD 4.1945683e-05
10,486 Rule-Based Graph Cleaning with GPUs on a Single Machine 2025 SIGMOD 4.1945683e-05
10,494 Nested Parquet Is Flat, Why Not Use It? How To Scan Nested Data With On-the-Fly Key Generation and Joins 2025 SIGMOD 4.1945683e-05
10,763 TuskFlow: An Efficient Graph Database for Long-Running Transactions 2025 VLDB 4.1945683e-05
11,007 Breathing New Life into An Old Tree: Resolving Logging Dilemma of B+-tree on Modern Computational Storage Drives 2024 VLDB 4.1945683e-05
11,055 Enhancing Accuracy for Super Spreader Identification in High-Speed Data Streams 2024 VLDB 4.1945683e-05
11,067 Partition, Don’t Sort! Compression Boosters for Cloud Data Ingestion Pipelines 2024 VLDB 4.1945683e-05
11,150 Zed: Leveraging Data Types to Process Eclectic Data 2023 CIDR 4.1945683e-05
11,690 Integration of Large-Scale Data Processing Systems and Traditional Parallel Database Technology 2019 VLDB 4.1945683e-05
11,831 Logical Aspects of Massively Parallel and Distributed Systems 2016 PODS 4.1945683e-05
11,901 Compact Summaries over Large Datasets 2015 PODS 4.1945683e-05
12,005 Design and Implementation of a Real-Time Interactive Analytics System for Large Spatio-Temporal Data 2014 VLDB 4.1945683e-05
12,203 Resiliency-Aware Data Management 2011 VLDB 4.1945683e-05
Previous Page 3 / 3 Next

Outgoing Citations (Sorted by Pagerank)

Showing 7 of 7 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers