Database Paper Browser

Back to papers

Distributed GraphLab: A Framework for Machine Learning and Data Mining in the Cloud

Summary: Distributed GraphLab extends graph-parallel ML/DM to the cloud with strong data consistency. Introduces graph-based pipelined locking and data versioning to reduce network latency, adds Chandy-Lamport snapshot fault tolerance, and demonstrates 1–2 orders of magnitude speedups over Hadoop on EC2. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
10514
Venue
VLDB
Year
2012
Pagerank
0.0007522744
Overall Rank
37 | 99.75%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 50 of 123 citing papers.

Rank Citing Paper Year Venue Pagerank
66 Spark SQL: Relational Data Processing in Spark 2015 SIGMOD 0.00061639801
444 Parallelizing Sequential Graph Computations 2017 SIGMOD 0.00022987918
542 Shark: SQL and Rich Analytics at Scale 2013 SIGMOD 0.00020595648
557 SystemML: Declarative Machine Learning on Spark 2016 VLDB 0.00020197988
574 From "Think Like a Vertex" to "Think Like a Graph" 2014 VLDB 0.00019883211
1,044 DimmWitted: A Study of Main-Memory Statistical Analytics 2014 VLDB 0.00014475229
1,150 K-Core Decomposition of Large Networks on a Single PC 2016 VLDB 0.00013657353
1,171 Blogel: A Block-Centric Framework for Distributed Computation on Real-World Graphs 2014 VLDB 0.00013511313
1,402 Hybrid Parallelization Strategies for Large-Scale Machine Learning in SystemML 2014 VLDB 0.00012180605
1,408 An Experimental Comparison of Pregel-like Graph Processing Systems 2014 VLDB 0.00012133511
1,452 Asynchronous Large-Scale Graph Processing Made Easy 2013 CIDR 0.00011919499
1,532 Data Management in Machine Learning: Challenges, Techniques, and Systems 2017 SIGMOD 0.00011472681
1,665 The More the Merrier: Efficient Multi-Source Graph Traversal 2015 VLDB 0.00010967716
1,666 HELIX: Holistic Optimization for Accelerating Iterative Machine Learning 2019 VLDB 0.0001096361
1,685 Fast Iterative Graph Computation with Block Updates 2013 VLDB 0.0001091808
1,692 Mostly-Optimistic Concurrency Control for Highly Contended Dynamic Workloads on a Thousand Cores 2017 VLDB 0.00010901611
1,750 Weld: A Common Runtime for High Performance Data Analytics 2017 CIDR 0.00010683647
1,821 Computing Personalized PageRank Quickly by Exploiting Graph Structures 2014 VLDB 0.00010423565
1,877 Large-Scale Distributed Graph Computing Systems: An Experimental Evaluation 2015 VLDB 0.00010236803
1,942 Heterogeneity-aware Distributed Parameter Servers 2017 SIGMOD 0.00010012691
1,968 An Experimental Comparison of Partitioning Strategies in Distributed Graph Processing 2017 VLDB 9.9071968e-05
2,033 NOMAD: Non-locking, stOchastic Multi-machine algorithm for Asynchronous and Decentralized matrix completion 2014 VLDB 9.7172731e-05
2,163 Elastic Machine Learning Algorithms in Amazon SageMaker 2020 SIGMOD 9.3949234e-05
2,440 FlexPS: Flexible Parallelism Control in Parameter Server Architecture 2018 VLDB 8.8119143e-05
2,458 REX: Recursive, Delta-Based Data-Centric Computation 2012 VLDB 8.7683462e-05
2,487 Large-Scale Graph Analytics in Aster 6: Bringing Context to Big Data Discovery 2014 VLDB 8.6710828e-05
2,529 Pregelix: Big(ger) Graph Analytics on A Dataflow Engine 2015 VLDB 8.5940768e-05
2,607 Graph Stream Summarization: From Big Bang to Big Crunch 2016 SIGMOD 8.4630211e-05
2,677 HET: Scaling out Huge Embedding Model Training via Cache-enabled Distributed Framework 2022 VLDB 8.3268401e-05
2,709 Vertexica: Your Relational Friend for Graph Analytics! 2014 VLDB 8.2530203e-05
2,754 Giraph Unchained: Barrierless Asynchronous Parallel Execution in Pregel-like Graph Processing Systems 2015 VLDB 8.169411e-05
2,919 RaSQL: Greater Power and Performance for Big Data Analytics with Recursive-aggregate-SQL on Spark 2019 SIGMOD 7.9047279e-05
2,927 Pregel Algorithms for Graph Connectivity Problems with Performance Guarantees 2014 VLDB 7.8823626e-05
3,081 Knowledge Expansion over Probabilistic Knowledge Bases 2014 SIGMOD 7.6031501e-05
3,099 DB4ML – An In-Memory Database Kernel with Machine Learning Support 2020 SIGMOD 7.5642871e-05
3,105 Data X-Ray: A Diagnostic Tool for Data Errors 2015 SIGMOD 7.5568954e-05
3,143 Extracting and Analyzing Hidden Graphs from Relational Databases 2017 SIGMOD 7.4804326e-05
3,200 Big Data Analytics with Datalog Queries on Spark 2016 SIGMOD 7.3912411e-05
3,642 Real-Time Multi-Criteria Social Graph Partitioning: A Game Theoretic Approach 2015 SIGMOD 6.8876257e-05
3,670 A Distributed Multi-GPU System for Fast Graph Processing 2018 VLDB 6.8567044e-05
3,694 Keys for Graphs 2015 VLDB 6.8345712e-05
3,834 GTS: A Fast and Scalable Graph Processing Method based on Streaming Topology to GPUs 2016 SIGMOD 6.7173094e-05
3,982 The Myria Big Data Management and Analytics System and Cloud Service 2017 CIDR 6.5651188e-05
4,020 TopoX: Topology Refactorization for Efficient Graph Partitioning and Processing 2019 VLDB 6.5237459e-05
4,211 Querying Big Graphs within Bounded Resources 2014 SIGMOD 6.3563454e-05
4,234 Distributed Edge Partitioning for Trillion-edge Graphs 2019 VLDB 6.3355073e-05
4,473 LogGP: A Log-based Dynamic Graph Partitioning Method 2014 VLDB 6.1542362e-05
4,497 Multi-Dimensional Balanced Graph Partitioning via Projected Gradient Descent 2019 VLDB 6.1387773e-05
4,556 Distributed Subgraph Matching on Timely Dataflow 2019 VLDB 6.0883757e-05
4,581 Beyond Macrobenchmarks: Microbenchmark-based Graph Database Evaluation 2019 VLDB 6.0703328e-05
Previous Page 1 / 3 Next

Outgoing Citations (Sorted by Pagerank)

Showing 3 of 3 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
4 Pregel: A System for Large-Scale Graph Processing 2010 SIGMOD 0.0019005923
328 An Architecture for Parallel Topic Models 2010 VLDB 0.0002728514
660 Large Graph Processing in the Cloud 2010 SIGMOD 0.00018493984
Previous Page 1 / 1 Next

Semantically Similar Papers