Large-scale Complex Analytics on Semi-structured Datasets using AsterixDB and Spark
Summary: Parallel Spark–AsterixDB integration enables large-scale complex analytics on semi-structured data. It fuses AsterixDB’s fast ingestion, indexing, geo-spatial and fuzzy-text querying with Spark ML and graph libraries for end-to-end analytics at scale. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Wail Y. Alkowaileet
- 2. Sattam Alsubaiee
- 3. Michael J. Carey
- 4. Till Westmann
- 5. Yingyi Bu
Incoming Citations (Sorted by Pagerank)
Showing 1 of 1 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 8,271 | Rumble: Data Independence for Large Messy Data Sets | 2021 | VLDB | 4.5453618e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 3 of 3 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 413 | HaLoop: Efficient Iterative Data Processing on Large Clusters | 2010 | VLDB | 0.00023904409 |
| 1,438 | AsterixDB: A Scalable, Open Source BDMS | 2014 | VLDB | 0.00011973592 |
| 2,021 | Storage Management in AsterixDB | 2014 | VLDB | 9.7601304e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 6,231 | An LSM-based Tuple Compaction Framework for Apache AsterixDB | 2020 | VLDB | 5.1457863e-05 |
| 9,516 | [Demo] Low-latency Spark Queries on Updatable Data | 2019 | SIGMOD | 4.3335877e-05 |
| 11,931 | StarDB: A Large-Scale DBMS for Strings | 2015 | VLDB | 4.1945683e-05 |
| 7,905 | S2RDF: RDF Querying with SPARQL on Spark | 2016 | VLDB | 4.6211706e-05 |
| 4,773 | PolyFrame: A Retargetable Query-based Approach to Scaling Dataframes | 2021 | VLDB | 5.9320139e-05 |
| 4,493 | ASTERIX: An Open Source System for "Big Data" Management and Analysis (Demo) | 2012 | VLDB | 6.141595e-05 |
| 2,021 | Storage Management in AsterixDB | 2014 | VLDB | 9.7601304e-05 |
| 1,438 | AsterixDB: A Scalable, Open Source BDMS | 2014 | VLDB | 0.00011973592 |
| 3,200 | Big Data Analytics with Datalog Queries on Spark | 2016 | SIGMOD | 7.3912411e-05 |
| 9,361 | An IDEA: An Ingestion Framework for Data Enrichment in AsterixDB | 2019 | VLDB | 4.3506168e-05 |