Database Paper Browser

Back to papers

NoDB: Efficient Query Execution on Raw Data Files

Summary: NoDB enables in-situ querying of raw data, avoiding loading by making raw files first-class in the engine. Adaptive indexing with caching cuts parsing and type-conversion costs; PostgresRaw delivers load-free queries with strong PostgreSQL performance. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
4527
Venue
SIGMOD
Year
2012
Pagerank
0.00012482538
Overall Rank
1,343 | 90.66%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 43 of 43 citing papers.

Rank Citing Paper Year Venue Pagerank
1,552 Overview of Data Exploration Techniques 2015 SIGMOD 0.00011408814
1,750 Weld: A Common Runtime for High Performance Data Analytics 2017 CIDR 0.00010683647
1,807 H2O: A Hands-free Adaptive Store 2014 SIGMOD 0.00010487796
1,840 dbTouch: Analytics at your Fingertips 2013 CIDR 0.0001034905
2,001 Sinew: A SQL System for Multi-Structured Data 2014 SIGMOD 9.8186417e-05
2,062 Dremel: A Decade of Interactive SQL Analysis at Web Scale 2020 VLDB 9.6481955e-05
2,122 SystemDS: A Declarative Machine Learning System for the End-to-End Data Science Lifecycle 2020 CIDR 9.4989076e-05
2,322 Instant Loading for Main Memory Databases 2013 VLDB 9.034874e-05
2,700 Filter Before You Parse: Faster Analytics on Raw Data with Sparser 2018 VLDB 8.2728509e-05
2,757 Parallel Data Analysis Directly on Scientific File Formats 2014 SIGMOD 8.1679384e-05
2,819 Mison: A Fast JSON Parser for Data Analytics 2017 VLDB 8.0651326e-05
2,973 Parallel In-Situ Data Processing with Speculative Loading 2014 SIGMOD 7.7902322e-05
3,343 Comparative Evaluation of Big-Data Systems on Scientific Image Analytics Workloads 2017 VLDB 7.1967343e-05
3,437 Speculative Distributed CSV Data Parsing for Big Data Analytics 2019 SIGMOD 7.0942161e-05
3,548 Adaptive Query Processing on RAW Data 2014 VLDB 6.9859242e-05
3,891 Slalom: Coasting Through Raw Data via Adaptive Partitioning and Indexing 2017 VLDB 6.659442e-05
3,940 NoDB in Action: Adaptive Query Processing on Raw Data 2012 VLDB 6.6153423e-05
4,326 Fast Queries Over Heterogeneous Data Through Engine Customization 2016 VLDB 6.288323e-05
4,602 Accelerating Raw Data Analysis with the ACCORDA Software and Hardware Architecture 2019 VLDB 6.0567387e-05
4,704 JSON Tiles: Fast Analytics on Semi-Structured Data 2021 SIGMOD 5.9853687e-05
5,301 ReCache: Reactive Caching for Fast Analytics over Heterogeneous Data 2018 VLDB 5.5790928e-05
5,376 Holistic Indexing in Main-memory Column-stores 2015 SIGMOD 5.5417421e-05
5,924 HMAB: Self-Driving Hierarchy of Bandits for Integrated Physical Database Design Tuning 2023 VLDB 5.2719183e-05
6,407 Just-In-Time Data Virtualization: Lightweight Data Management with ViDa 2015 CIDR 5.076547e-05
6,977 FishStore: Fast Ingestion and Indexing of Raw Data 2019 VLDB 4.8761802e-05
6,981 Dataset Relationship Management 2019 CIDR 4.8743957e-05
6,988 CrocodileDB: Efficient Database Execution through Intelligent Deferment 2020 CIDR 4.8718019e-05
7,237 CleanM: An Optimizable Query Language for Unified Scale-Out Data Cleaning 2017 VLDB 4.7928651e-05
7,254 DEX: Query Execution in a Delta-based Storage System 2017 SIGMOD 4.7885915e-05
7,360 ParPaRaw: Massively Parallel Parsing of Delimiter-Separated Raw Data 2020 VLDB 4.7525925e-05
7,704 ExDRa: Exploratory Data Science on Federated Raw Data 2021 SIGMOD 4.6733838e-05
7,830 Scalable Structural Index Construction for JSON Analytics 2021 VLDB 4.6388763e-05
8,271 Rumble: Data Independence for Large Messy Data Sets 2021 VLDB 4.5453618e-05
8,788 FishStore: Faster Ingestion with Subset Hashing 2019 SIGMOD 4.451039e-05
9,052 RawVis: A System for Efficient In-situ Visual Analytics 2021 SIGMOD 4.4039656e-05
9,083 AT-GIS: Highly Parallel Spatial Query Processing with Associative Transducers 2016 SIGMOD 4.399861e-05
9,379 GIO: Generating Efficient Matrix and Frame Readers for Custom Data Formats by Example 2023 SIGMOD 4.3462787e-05
9,918 Shared Load(ing): Efficient Bulk Loading into Optimized Storage 2020 CIDR 4.2561557e-05
10,482 Fast and Scalable Data Transfer Across Data Systems 2025 SIGMOD 4.1945683e-05
11,021 SeLeP: Learning Based Semantic Prefetching for Exploratory Database Workloads 2024 VLDB 4.1945683e-05
11,784 Alpine: Efficient In situ Data Exploration in the Presence of Updates 2017 SIGMOD 4.1945683e-05
11,850 Vectorizing an In Situ Query Engine 2016 SIGMOD 4.1945683e-05
12,073 Lazy ETL in Action: ETL Technology Dates Scientific Data 2013 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 20 of 20 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
158 Automated Selection of Materialized Views and Indexes for SQL Databases 2000 VLDB 0.00040071492
168 MAD Skills: New Analysis Practices for Big Data 2009 VLDB 0.00038946305
237 An Efficient, Cost-Driven Index Selection Tool for Microsoft SQL Server 1997 VLDB 0.00031726304
257 Making Database Systems Usable 2007 SIGMOD 0.00030223397
258 DB2 Design Advisor: Integrated Automatic Physical Database Design 2004 VLDB 0.0003022091
286 Integrating Vertical and Horizontal Partitioning into Automated Physical Database Design 2004 SIGMOD 0.00028990057
346 Don't Scrap It, Wrap It! A Wrapper Architecture for Legacy Data Sources 1997 VLDB 0.00026656272
408 Database Cracking 2007 CIDR 0.00023953844
496 Automatic SQL Tuning in Oracle 10g 2004 VLDB 0.00021728655
661 Database Tuning Advisor for Microsoft SQL Server 2005 2004 VLDB 0.00018481174
928 Requirements for Science Data Bases and SciDB 2009 CIDR 0.00015247726
1,017 Automatic Physical Database Tuning: A Relaxation-based Approach 2005 SIGMOD 0.00014634307
1,907 Guided Interaction: Rethinking the Query-Result Paradigm 2011 VLDB 0.00010136636
2,017 The Researcher’s Guide to the Data Deluge: Querying a Scientific Database in Just a Few Seconds 2011 VLDB 9.7810458e-05
2,229 Self-organizing Tuple Reconstruction in Column-stores 2009 SIGMOD 9.2350274e-05
2,363 Merging What’s Cracked, Cracking What’s Merged: Adaptive Indexing in Main-Memory Column-Stores 2011 VLDB 8.9580928e-05
2,367 Here are my Data Files. Here are my Queries. Where are my Results? 2011 CIDR 8.9511058e-05
2,470 CoPhy: A Scalable, Portable, and Interactive Index Advisor for Large Workloads 2011 VLDB 8.7333019e-05
2,787 To Tune or not to Tune? A Lightweight Physical Design Alerter 2006 VLDB 8.1263608e-05
3,896 Updating a Cracked Database 2007 SIGMOD 6.6575888e-05
Previous Page 1 / 1 Next

Semantically Similar Papers