An IDEA: An Ingestion Framework for Data Enrichment in AsterixDB
Summary: Proposes an ingestion framework pushing enrichment into the pipeline, supporting compiled code, declarative queries, or ML models. Built on Apache AsterixDB, it adapts to changing reference data to preserve currency and correctness, with evaluation at scale. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Xikui Wang
- 2. Michael J. Carey
Incoming Citations (Sorted by Pagerank)
Showing 4 of 4 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,914 | On Performance Stability in LSM-based Storage Systems | 2020 | VLDB | 5.8315684e-05 |
| 5,918 | Breaking Down Memory Walls: Adaptive Memory Management in LSM-based Storage Systems | 2021 | VLDB | 5.2737135e-05 |
| 7,399 | SmartBench: A Benchmark For Data Management In Smart Spaces | 2020 | VLDB | 4.7410149e-05 |
| 7,643 | Cross Modal Data Discovery over Structured and Unstructured Data Lakes | 2023 | VLDB | 4.6901105e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 10 of 10 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 139 | Predicate Migration: Optimizing Queries with Expensive Predicates | 1993 | SIGMOD | 0.00042299329 |
| 241 | DB2 with BLU Acceleration: So Much More than Just a Column Store | 2013 | VLDB | 0.00031420034 |
| 1,438 | AsterixDB: A Scalable, Open Source BDMS | 2014 | VLDB | 0.00011973592 |
| 1,792 | Hybrid Transactional/Analytical Processing: A Survey | 2017 | SIGMOD | 0.00010537893 |
| 2,021 | Storage Management in AsterixDB | 2014 | VLDB | 9.7601304e-05 |
| 2,724 | Design and Implementation of an Extensible Database Management System Supporting User Defined Data Types and Functions | 1988 | VLDB | 8.2303311e-05 |
| 4,624 | Wildfire: Concurrent Blazing Data Ingest and Analytics | 2016 | SIGMOD | 6.0411906e-05 |
| 6,123 | Data Ingestion for the Connected World | 2017 | CIDR | 5.1991194e-05 |
| 8,467 | Creation and Interaction with Large-scale Domain-Specific Knowledge Bases | 2017 | VLDB | 4.504802e-05 |
| 9,910 | A BAD Demonstration: Towards Big Active Data | 2017 | VLDB | 4.2576157e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 3,200 | Big Data Analytics with Datalog Queries on Spark | 2016 | SIGMOD | 7.3912411e-05 |
| 7,907 | Petabyte-Scale Row-Level Operations in Data Lakehouses | 2024 | VLDB | 4.6205839e-05 |
| 6,123 | Data Ingestion for the Connected World | 2017 | CIDR | 5.1991194e-05 |
| 4,773 | PolyFrame: A Retargetable Query-based Approach to Scaling Dataframes | 2021 | VLDB | 5.9320139e-05 |
| 1,438 | AsterixDB: A Scalable, Open Source BDMS | 2014 | VLDB | 0.00011973592 |
| 5,535 | Lightweight Cardinality Estimation in LSM-based Systems | 2018 | SIGMOD | 5.4539235e-05 |
| 7,743 | Efficient Data Ingestion and Query Processing for LSM-Based Storage Systems | 2019 | VLDB | 4.6626575e-05 |
| 6,231 | An LSM-based Tuple Compaction Framework for Apache AsterixDB | 2020 | VLDB | 5.1457863e-05 |
| 2,021 | Storage Management in AsterixDB | 2014 | VLDB | 9.7601304e-05 |
| 7,794 | Large-scale Complex Analytics on Semi-structured Datasets using AsterixDB and Spark | 2016 | VLDB | 4.6482977e-05 |