Rumble: Data Independence for Large Messy Data Sets
Summary: Rumble delivers data independence for large, nested JSON on Spark by compiling JSONiq into an iterator tree that switches between local and distributed execution. Bridging JSON nesting with Spark primitives, it overcomes impedance mismatch, scales to terabytes, and demonstrates Codd-like independence for heterogeneous data. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Ingo Müller
- 2. Ghislain Fourny
- 3. Stefan Irimescu
- 4. Can Berker Cikis
- 5. Gustavo Alonso
Incoming Citations (Sorted by Pagerank)
Showing 3 of 3 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 9,379 | GIO: Generating Efficient Matrix and Frame Readers for Custom Data Formats by Example | 2023 | SIGMOD | 4.3462787e-05 |
| 9,702 | Evaluating Query Languages and Systems for High-Energy Physics Data | 2022 | VLDB | 4.3008468e-05 |
| 11,513 | TraNCE: Transforming Nested Collections Efficiently | 2021 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 5 of 5 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,265 | Jaql: A Scripting Language for Large Scale Semistructured Data Analysis | 2011 | VLDB | 0.00012947629 |
| 1,343 | NoDB: Efficient Query Execution on Raw Data Files | 2012 | SIGMOD | 0.00012482538 |
| 1,438 | AsterixDB: A Scalable, Open Source BDMS | 2014 | VLDB | 0.00011973592 |
| 2,280 | SMOKE: Fine-grained Lineage at Interactive Speed | 2018 | VLDB | 9.1111033e-05 |
| 7,794 | Large-scale Complex Analytics on Semi-structured Datasets using AsterixDB and Spark | 2016 | VLDB | 4.6482977e-05 |
Previous
Page 1 / 1
Next