Database Paper Browser

Back to papers

Building Efficient Query Engines in a High-Level Language

Summary: LegoBase: a Scala-based query engine emitting specialized C code via generative programming, marrying high-level design with speed. By avoiding brittle template expansion, it enables cross-operator optimizations and runtime adaptivity; on TPC-H it beats an in-memory DB and a compiler with only hundreds of high-level lines. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
10963
Venue
VLDB
Year
2014
Pagerank
0.00017900583
Overall Rank
704 | 95.11%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 45 of 45 citing papers.

Rank Citing Paper Year Venue Pagerank
66 Spark SQL: Relational Data Processing in Spark 2015 SIGMOD 0.00061639801
1,552 Overview of Data Exploration Techniques 2015 SIGMOD 0.00011408814
1,750 Weld: A Common Runtime for High Performance Data Analytics 2017 CIDR 0.00010683647
1,864 Relaxed Operator Fusion for In-Memory Databases: Making Compilation, Vectorization, and Prefetching Work Together At Last 2018 VLDB 0.00010280966
1,873 An Architecture for Compiling UDF-centric Workflows 2015 VLDB 0.00010253002
1,882 Tuplex: Data Science in Python at Native Code Speed 2021 SIGMOD 0.0001021625
2,014 Voodoo - A Vector Algebra for Portable Database Performance on Modern Hardware 2016 VLDB 9.7904029e-05
2,126 MacroBase: Prioritizing Attention in Fast Data 2017 SIGMOD 9.4887794e-05
2,157 The Data Calculator*: Data Structure Design and Cost Synthesis from First Principles and Learned Cost Models 2018 SIGMOD 9.416022e-05
2,287 Pipelined Query Processing in Coprocessor Environments 2018 SIGMOD 9.0972606e-05
2,390 ByteSlice: Pushing the Envelop of Main Memory Data Processing with a New Storage Layout 2015 SIGMOD 8.9084657e-05
2,418 Tupleware: "Big" Data, Big Analytics, Small Clusters 2015 CIDR 8.8556595e-05
2,838 How to Architect a Query Compiler, Revisited 2018 SIGMOD 8.0408472e-05
2,896 Evaluating End-to-End Optimization for Data Analytics Applications in Weld 2018 VLDB 7.9452051e-05
3,704 How to Win a Hot Dog Eating Contest: Distributed Incremental View Maintenance with Batch Updates 2016 SIGMOD 6.827494e-05
3,918 On Optimizing Operator Fusion Plans for Large-Scale Machine Learning in SystemML 2018 VLDB 6.6315176e-05
4,326 Fast Queries Over Heterogeneous Data Through Engine Customization 2016 VLDB 6.288323e-05
4,505 SPOOF: Sum-Product Optimization and Operator Fusion for Large-Scale Machine Learning 2017 CIDR 6.1327108e-05
4,642 Fast Scans on Key-Value Stores 2017 VLDB 6.0267293e-05
4,677 Automatically Leveraging MapReduce Frameworks for Data-Intensive Applications 2018 SIGMOD 6.0047822e-05
4,770 The Case For Heterogeneous HTAP 2017 CIDR 5.9338845e-05
4,948 Designing an Open Framework for Query Optimization and Compilation 2022 VLDB 5.8116879e-05
5,530 Permutable Compiled Queries: Dynamically Adapting Compiled Queries without Recompiling 2021 VLDB 5.4554282e-05
5,731 Babelfish: Efficient Execution of Polyglot Queries 2022 VLDB 5.3502065e-05
6,245 Iterative Query Processing based on Unified Optimization Techniques 2019 SIGMOD 5.1391202e-05
6,287 What Makes a Good Physical Plan? — Experiencing Hardware-Conscious Query Optimization with Candomble 2016 SIGMOD 5.127646e-05
6,407 Just-In-Time Data Virtualization: Lightweight Data Management with ViDa 2015 CIDR 5.076547e-05
6,456 From Auto-tuning One Size Fits All to Self-designed and Learned Data-intensive Systems 2019 SIGMOD 5.0564619e-05
6,525 Database Technology for the Masses: Sub-Operators as First-Class Entities 2021 VLDB 5.027205e-05
6,648 Grizzly: Efficient Stream Processing Through Adaptive Query Compilation 2020 SIGMOD 4.9771723e-05
6,673 Incorporating Super-Operators in Big-Data Query Optimizers 2020 VLDB 4.966799e-05
6,863 Declarative Sub-Operators for Universal Data Processing 2023 VLDB 4.905092e-05
7,476 Lachesis: Automatic Partitioning for UDF-Centric Analytics 2021 VLDB 4.7188928e-05
7,925 Architecting a Query Compiler for Spatial Workloads 2020 SIGMOD 4.6153403e-05
8,094 Modularis: Modular Relational Analytics over Heterogeneous Distributed Platforms 2021 VLDB 4.5867812e-05
8,626 Adaptive Code Generation for Data-Intensive Analytics 2021 VLDB 4.4829152e-05
8,880 Live Programming in the LogicBlox System: A MetaLogiQL Approach 2015 VLDB 4.4301403e-05
9,029 Provenance for SQL through Abstract Interpretation: Value-less, but Worthwhile 2015 VLDB 4.4040532e-05
9,289 In-Browser Interactive SQL Analytics with Afterburner 2017 SIGMOD 4.362197e-05
9,332 PlinyCompute: A Platform for High-Performance, Distributed, Data-Intensive Tool Development 2018 SIGMOD 4.3556432e-05
9,823 Thriving in the No Man’s Land between Compilers and Databases 2019 CIDR 4.2754485e-05
10,897 Welding Natural Language Queries to Analytics IRs with LLMs 2024 CIDR 4.1945683e-05
10,969 Query Compilation Without Regrets 2024 SIGMOD 4.1945683e-05
11,148 mutable: A Modern DBMS for Research and Fast Prototyping 2023 CIDR 4.1945683e-05
11,950 Databases and Hardware: The Beginning and Sequel of a Beautiful Friendship 2015 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 10 of 10 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers