Lifetime-Based Memory Management for Distributed Data Processing Systems
Summary: Proposes a lifetime-based memory manager that analyzes user data/types to predict lifetimes and allocate/release memory, reducing GC pressure in distributed processing. Deca on Spark groups same-lifetime objects into byte arrays and frees them at end-of-life, delivering up to 99.9% GC reduction, up to 41.6x speedups, and ~46% memory savings. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Lu Lu
- 2. Xuanhua Shi
- 3. Yongluan Zhou
- 4. Xiong Zhang
- 5. Hai Jin
- 6. Cheng Pei
- 7. Ligang He
- 8. Yuanzhen Geng
Incoming Citations (Sorted by Pagerank)
Showing 5 of 5 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 7,686 | Concurrent Log-Structured Memory for Many-Core Key-Value Stores | 2018 | VLDB | 4.6786758e-05 |
| 8,002 | Pangea: Monolithic Distributed Storage for Data Analytics | 2019 | VLDB | 4.6088289e-05 |
| 9,332 | PlinyCompute: A Platform for High-Performance, Distributed, Data-Intensive Tool Development | 2018 | SIGMOD | 4.3556432e-05 |
| 9,913 | Chukonu: A Fully-Featured High-Performance Big Data Framework that Integrates a Native Compute Engine into Spark | 2022 | VLDB | 4.2565279e-05 |
| 11,694 | An Experimental Evaluation of Garbage Collectors on Big Data Applications | 2019 | VLDB | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 4 of 4 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 66 | Spark SQL: Relational Data Processing in Spark | 2015 | SIGMOD | 0.00061639801 |
| 2,476 | A Platform for Scalable One-Pass Analytics using MapReduce | 2011 | SIGMOD | 8.6960139e-05 |
| 3,504 | M3R: Increased Performance for In-Memory Hadoop Jobs | 2012 | VLDB | 7.0347515e-05 |
| 4,437 | Clash of the Titans: MapReduce vs. Spark for Large Scale Data Analytics | 2015 | VLDB | 6.1907793e-05 |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,650 | LocationSpark: A Distributed In-Memory Data Management System for Big Spatial Data | 2016 | VLDB | 6.0234336e-05 |
| 1,960 | Compaction management in distributed key-value datastores | 2015 | VLDB | 9.9521444e-05 |
| 9,155 | Towards Resource Efficiency: Practical Insights into Large-Scale Spark Workloads at ByteDance | 2024 | VLDB | 4.3849295e-05 |
| 4,802 | Resource Elasticity for Large-Scale Machine Learning | 2015 | SIGMOD | 5.9114415e-05 |
| 2,090 | Maintaining Time-Decaying Stream Aggregates | 2003 | PODS | 9.5647927e-05 |
| 11,797 | Runtime Optimization of Join Location in Parallel Data Management Systems | 2017 | VLDB | 4.1945683e-05 |
| 9,516 | [Demo] Low-latency Spark Queries on Updatable Data | 2019 | SIGMOD | 4.3335877e-05 |
| 2,848 | Exploiting Matrix Dependency for Efficient Distributed Matrix Computation | 2015 | SIGMOD | 8.0208832e-05 |
| 6,560 | Efficient In-memory Data Management: An Analysis | 2014 | VLDB | 5.010074e-05 |
| 11,694 | An Experimental Evaluation of Garbage Collectors on Big Data Applications | 2019 | VLDB | 4.1945683e-05 |