Database Paper Browser

Back to papers

The Story of AWS Glue

Summary: Describes AWS Glue: a serverless ETL platform with fast-start, auto-scaling Spark/Python (custom resource manager), DynamicFrames for schema-free semi-structured data, and a shuffle plugin offloading shuffles to object storage. Includes a Hive-compatible Data Catalog with crawlers and Glue Studio visual ETL to simplify scalable, extensible data-lake ingestion. (summarized by gpt-5-mini on Feb 09 2026)

Paper ID
13186
Venue
VLDB
Year
2023
Pagerank
4.3018844e-05
Overall Rank
9,699 | 32.53%
DOI
10.14778/3611540.3611547

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 2 of 2 citing papers.

Rank Citing Paper Year Venue Pagerank
10,767 The HANA Native Query Engine for Lakehouse Systems 2025 VLDB 4.1945683e-05
10,772 veDB-HTAP: a Highly Integrated, Efficient and Adaptive HTAP System 2025 VLDB 4.1945683e-05
Previous Page 1 / 1 Next

Outgoing Citations (Sorted by Pagerank)

Showing 5 of 5 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Previous Page 1 / 1 Next

Semantically Similar Papers