Back to papers
Adaptive Query Processing on RAW Data
Summary: Adapts the engine to raw data formats instead of forcing loading. RAW maps Just-In-Time access paths and column shreds to heterogeneous raw data, enabling transparent querying with reduced conversion costs and 100x speedup vs handcrafted solutions.
(summarized by gpt-5-nano on Feb 09 2026)
- Paper ID
- 10772
- Venue
- VLDB
- Year
- 2014
- Pagerank
- 6.9859242e-05
- Overall Rank
- 3,548 | 75.32%
- DOI
-
-
Incoming Non-self Citations Over Time
Incoming Citations (Sorted by Pagerank)
Showing 23 of 23 citing papers.
| Rank |
Citing Paper |
Year |
Venue |
Pagerank |
| 2,390 |
ByteSlice: Pushing the Envelop of Main Memory Data Processing with a New Storage Layout |
2015 |
SIGMOD |
8.9084657e-05 |
| 2,700 |
Filter Before You Parse: Faster Analytics on Raw Data with Sparser |
2018 |
VLDB |
8.2728509e-05 |
| 2,838 |
How to Architect a Query Compiler, Revisited |
2018 |
SIGMOD |
8.0408472e-05 |
| 3,437 |
Speculative Distributed CSV Data Parsing for Big Data Analytics |
2019 |
SIGMOD |
7.0942161e-05 |
| 3,891 |
Slalom: Coasting Through Raw Data via Adaptive Partitioning and Indexing |
2017 |
VLDB |
6.659442e-05 |
| 4,326 |
Fast Queries Over Heterogeneous Data Through Engine Customization |
2016 |
VLDB |
6.288323e-05 |
| 4,602 |
Accelerating Raw Data Analysis with the ACCORDA Software and Hardware Architecture |
2019 |
VLDB |
6.0567387e-05 |
| 4,770 |
The Case For Heterogeneous HTAP |
2017 |
CIDR |
5.9338845e-05 |
| 5,301 |
ReCache: Reactive Caching for Fast Analytics over Heterogeneous Data |
2018 |
VLDB |
5.5790928e-05 |
| 6,407 |
Just-In-Time Data Virtualization: Lightweight Data Management with ViDa |
2015 |
CIDR |
5.076547e-05 |
| 7,237 |
CleanM: An Optimizable Query Language for Unified Scale-Out Data Cleaning |
2017 |
VLDB |
4.7928651e-05 |
| 7,360 |
ParPaRaw: Massively Parallel Parsing of Delimiter-Separated Raw Data |
2020 |
VLDB |
4.7525925e-05 |
| 7,691 |
Bringing Compiling Databases to RISC Architectures |
2023 |
VLDB |
4.6762283e-05 |
| 7,830 |
Scalable Structural Index Construction for JSON Analytics |
2021 |
VLDB |
4.6388763e-05 |
| 9,052 |
RawVis: A System for Efficient In-situ Visual Analytics |
2021 |
SIGMOD |
4.4039656e-05 |
| 9,187 |
POLAR: Adaptive and Non-invasive Join Order Selection via Plans of Least Resistance |
2024 |
VLDB |
4.3780059e-05 |
| 9,379 |
GIO: Generating Efficient Matrix and Frame Readers for Custom Data Formats by Example |
2023 |
SIGMOD |
4.3462787e-05 |
| 9,655 |
FlashView: An Interactive Visual Explorer for Raw Data |
2017 |
VLDB |
4.3109001e-05 |
| 9,702 |
Evaluating Query Languages and Systems for High-Energy Physics Data |
2022 |
VLDB |
4.3008468e-05 |
| 10,482 |
Fast and Scalable Data Transfer Across Data Systems |
2025 |
SIGMOD |
4.1945683e-05 |
| 11,784 |
Alpine: Efficient In situ Data Exploration in the Presence of Updates |
2017 |
SIGMOD |
4.1945683e-05 |
| 11,850 |
Vectorizing an In Situ Query Engine |
2016 |
SIGMOD |
4.1945683e-05 |
| 11,950 |
Databases and Hardware: The Beginning and Sequel of a Beautiful Friendship |
2015 |
VLDB |
4.1945683e-05 |
Outgoing Citations (Sorted by Pagerank)
Showing 14 of 14 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Semantically Similar Papers
| Overall Rank |
Paper |
Year |
Venue |
Pagerank |
| 13,040 |
Query Processing on Personal Computers: A Pragmatic Approach (Extended Abstract) |
1984 |
VLDB |
4.1945683e-05 |
| 4,326 |
Fast Queries Over Heterogeneous Data Through Engine Customization |
2016 |
VLDB |
6.288323e-05 |
| 3,891 |
Slalom: Coasting Through Raw Data via Adaptive Partitioning and Indexing |
2017 |
VLDB |
6.659442e-05 |
| 5,825 |
Adaptive Query Processing: Why, How, When, What Next |
2006 |
SIGMOD |
5.3126934e-05 |
| 3,330 |
Adapting to Source Properties in Processing Data Integration Queries |
2004 |
SIGMOD |
7.2150831e-05 |
| 3,940 |
NoDB in Action: Adaptive Query Processing on Raw Data |
2012 |
VLDB |
6.6153423e-05 |
| 11,850 |
Vectorizing an In Situ Query Engine |
2016 |
SIGMOD |
4.1945683e-05 |
| 2,367 |
Here are my Data Files. Here are my Queries. Where are my Results? |
2011 |
CIDR |
8.9511058e-05 |
| 2,973 |
Parallel In-Situ Data Processing with Speculative Loading |
2014 |
SIGMOD |
7.7902322e-05 |
| 1,343 |
NoDB: Efficient Query Execution on Raw Data Files |
2012 |
SIGMOD |
0.00012482538 |