Database Paper Browser

Back to papers

Jaql: A Scripting Language for Large Scale Semistructured Data Analysis

Summary: Jaql: declarative scripting for large-scale semistructured data on Hadoop MapReduce. JSON-like flexible model with schema-on-read; higher-order functions and reusable modules; varying abstraction with compiler rewrites for parallel execution. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
10190
Venue
VLDB
Year
2011
Pagerank
0.00012947629
Overall Rank
1,265 | 91.21%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 27 of 27 citing papers.

Rank Citing Paper Year Venue Pagerank
1,402 Hybrid Parallelization Strategies for Large-Scale Machine Learning in SystemML 2014 VLDB 0.00012180605
2,001 Sinew: A SQL System for Multi-Structured Data 2014 SIGMOD 9.8186417e-05
2,205 ReStore: Reusing Results of MapReduce Jobs 2012 VLDB 9.2920002e-05
2,611 Opening the Black Boxes in Data Flow Optimization 2012 VLDB 8.4536967e-05
2,674 Minimal MapReduce Algorithms 2013 SIGMOD 8.3328645e-05
2,773 JSON Data Management – Supporting Schema-less Development in RDBMS 2014 SIGMOD 8.1386587e-05
2,818 Implicit Parallelism through Deep Language Embedding 2015 SIGMOD 8.0665558e-05
2,819 Mison: A Fast JSON Parser for Data Analytics 2017 VLDB 8.0651326e-05
3,504 M3R: Increased Performance for In-Memory Hadoop Jobs 2012 VLDB 7.0347515e-05
3,710 Optimizing Analytic Data Flows for Multiple Execution Engines 2012 SIGMOD 6.8238962e-05
4,061 Advanced Partitioning Techniques for Massively Distributed Computation 2012 SIGMOD 6.483587e-05
4,326 Fast Queries Over Heterogeneous Data Through Engine Customization 2016 VLDB 6.288323e-05
5,014 Dynamically Optimizing Queries over Large Scale Data Platforms 2014 SIGMOD 5.7586174e-05
5,297 Continuous Cloud-Scale Query Optimization and Processing 2013 VLDB 5.5801669e-05
5,453 Semistructured Models, Queries and Algebras in the Big Data Era 2016 SIGMOD 5.4989459e-05
5,595 Schemas and Types for JSON Data: from Theory to Practice 2019 SIGMOD 5.4191724e-05
8,271 Rumble: Data Independence for Large Messy Data Sets 2021 VLDB 4.5453618e-05
8,429 Handling Environments in a Nested Relational Algebra with Combinators and an Implementation in a Verified Query Compiler 2017 SIGMOD 4.5156925e-05
8,924 QMapper for Smart Grid: Migrating SQL-based Application to Hive 2015 SIGMOD 4.427232e-05
9,376 Versatile Optimization of UDF-heavy Data Flows with Sofa 2014 SIGMOD 4.347376e-05
9,379 GIO: Generating Efficient Matrix and Frame Readers for Custom Data Formats by Example 2023 SIGMOD 4.3462787e-05
9,535 Graph Data Models, Query Languages and Programming Paradigms 2018 VLDB 4.3265843e-05
9,750 ReCG: Bottom-Up JSON Schema Discovery Using a Repetitive Cluster-and-Generalize Framework 2024 VLDB 4.2897489e-05
11,189 dsJSON: A Distributed SQL JSON Processor 2023 SIGMOD 4.1945683e-05
12,062 Next Generation Data Analytics at IBM Research 2013 VLDB 4.1945683e-05
12,109 Declarative Error Management for Robust Data-Intensive Applications 2012 SIGMOD 4.1945683e-05
12,121 Surfacing Time-critical Insights from Social Media 2012 SIGMOD 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 11 of 11 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers