Lifetime-Based Memory Management for Distributed Data Processing Systems

Summary: Proposes a lifetime-based memory manager that analyzes user data/types to predict lifetimes and allocate/release memory, reducing GC pressure in distributed processing. Deca on Spark groups same-lifetime objects into byte arrays and frees them at end-of-life, delivering up to 99.9% GC reduction, up to 41.6x speedups, and ~46% memory savings. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID: 11381
Venue: VLDB
Year: 2016
Pagerank: 6.3011982e-05
Overall Rank: 5,400 | 62.48%
DOI: -

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 5 of 5 citing papers.

Rank	Citing Paper	Year	Venue	Pagerank
7,738	Concurrent Log-Structured Memory for Many-Core Key-Value Stores	2018	VLDB	5.6141923e-05
7,879	Pangea: Monolithic Distributed Storage for Data Analytics	2019	VLDB	5.5941095e-05
9,342	PlinyCompute: A Platform for High-Performance, Distributed, Data-Intensive Tool Development	2018	SIGMOD	5.3449422e-05
9,923	Chukonu: A Fully-Featured High-Performance Big Data Framework that Integrates a Native Compute Engine into Spark	2022	VLDB	5.2412669e-05
11,699	An Experimental Evaluation of Garbage Collectors on Big Data Applications	2019	VLDB	5.1725247e-05

Outgoing Citations (Sorted by Pagerank)

Showing 4 of 4 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank	Cited Paper	Year	Venue	Pagerank
25	Spark SQL: Relational Data Processing in Spark	2015	SIGMOD	0.00055280049
2,239	A Platform for Scalable One-Pass Analytics using MapReduce	2011	SIGMOD	8.9526726e-05
3,660	M3R: Increased Performance for In-Memory Hadoop Jobs	2012	VLDB	7.282074e-05
4,143	Clash of the Titans: MapReduce vs. Spark for Large Scale Data Analytics	2015	VLDB	6.9380007e-05

Semantically Similar Papers

Overall Rank	Paper	Year	Venue	Pagerank
4,415	LocationSpark: A Distributed In-Memory Data Management System for Big Spatial Data	2016	VLDB	6.7783491e-05
1,636	Compaction management in distributed key-value datastores	2015	VLDB	0.0001021954
8,333	Towards Resource Efficiency: Practical Insights into Large-Scale Spark Workloads at ByteDance	2024	VLDB	5.5057547e-05
4,529	Resource Elasticity for Large-Scale Machine Learning	2015	SIGMOD	6.7150509e-05
2,567	Maintaining Time-Decaying Stream Aggregates	2003	PODS	8.4752829e-05
11,805	Runtime Optimization of Join Location in Parallel Data Management Systems	2017	VLDB	5.1725247e-05
9,510	[Demo] Low-latency Spark Queries on Updatable Data	2019	SIGMOD	5.32339e-05
2,635	Exploiting Matrix Dependency for Efficient Distributed Matrix Computation	2015	SIGMOD	8.3840343e-05
7,003	Efficient In-memory Data Management: An Analysis	2014	VLDB	5.7853501e-05
11,699	An Experimental Evaluation of Garbage Collectors on Big Data Applications	2019	VLDB	5.1725247e-05