Database Paper Browser

Back to papers

Pregel: A System for Large-Scale Graph Processing

Summary: Vertex-centric computation model with supersteps: vertices send/receive messages, update state, and mutate edges or topology. Scales to billions of edges on commodity clusters; fault-tolerant, synchronous execution; distribution details hidden behind an API, enabling expressive, easy-to-program graph analytics. (summarized by gpt-5-nano on Feb 09 2026)

Paper ID
4239
Venue
SIGMOD
Year
2010
Pagerank
0.0019005923
Overall Rank
4 | 99.98%
DOI
-

Incoming Non-self Citations Over Time

Authors

Incoming Citations (Sorted by Pagerank)

Showing 50 of 238 citing papers.

Rank Citing Paper Year Venue Pagerank
37 Distributed GraphLab: A Framework for Machine Learning and Data Mining in the Cloud 2012 VLDB 0.0007522744
140 The MADlib Analytics Library or MAD Skills, the SQL 2012 VLDB 0.00042270404
260 Fast Exact Shortest-Path Distance Queries on Large Networks by Pruned Landmark Labeling 2013 SIGMOD 0.00030040036
331 The Ubiquity of Large Graphs and Surprising Challenges of Graph Processing 2018 VLDB 0.00027214222
342 EmptyHeaded: A Relational Engine for Graph Processing 2016 SIGMOD 0.00026795977
396 One Trillion Edges: Graph Processing at Facebook-Scale 2015 VLDB 0.00024424102
413 HaLoop: Efficient Iterative Data Processing on Large Clusters 2010 VLDB 0.00023904409
444 Parallelizing Sequential Graph Computations 2017 SIGMOD 0.00022987918
522 Differential dataflow 2013 CIDR 0.00021099241
542 Shark: SQL and Rich Analytics at Scale 2013 SIGMOD 0.00020595648
558 Trinity: A Distributed Graph Engine on a Memory Cloud 2013 SIGMOD 0.00020168032
574 From "Think Like a Vertex" to "Think Like a Graph" 2014 VLDB 0.00019883211
582 Scalable SPARQL Querying of Large RDF Graphs 2011 VLDB 0.00019723083
651 Efficient Subgraph Matching on Billion Node Graphs 2012 VLDB 0.00018648572
819 Persistent B+-Trees in Non-Volatile Main Memory 2015 VLDB 0.00016298164
851 The case against specialized graph analytics engines 2015 CIDR 0.0001594441
1,150 K-Core Decomposition of Large Networks on a Single PC 2016 VLDB 0.00013657353
1,160 Sancus: Staleness-Aware Communication-Avoiding Full-Graph Decentralized Training in Large-Scale Graph Neural Networks 2022 VLDB 0.00013586221
1,171 Blogel: A Block-Centric Framework for Distributed Computation on Real-World Graphs 2014 VLDB 0.00013511313
1,265 Jaql: A Scripting Language for Large Scale Semistructured Data Analysis 2011 VLDB 0.00012947629
1,294 Distributed SociaLite: A Datalog-Based Language for Large-Scale Graph Analysis 2013 VLDB 0.00012779484
1,394 Real-time Constrained Cycle Detection in Large Dynamic Graphs 2018 VLDB 0.0001221552
1,408 An Experimental Comparison of Pregel-like Graph Processing Systems 2014 VLDB 0.00012133511
1,438 AsterixDB: A Scalable, Open Source BDMS 2014 VLDB 0.00011973592
1,452 Asynchronous Large-Scale Graph Processing Made Easy 2013 CIDR 0.00011919499
1,500 Parallel Subgraph Listing in a Large-Scale Graph 2014 SIGMOD 0.00011674394
1,665 The More the Merrier: Efficient Multi-Source Graph Traversal 2015 VLDB 0.00010967716
1,675 A Distributed Graph Engine for Web Scale RDF Data 2013 VLDB 0.00010947606
1,685 Fast Iterative Graph Computation with Block Updates 2013 VLDB 0.0001091808
1,800 epiC: an Extensible and Scalable System for Processing Big Data 2014 VLDB 0.00010512649
1,821 Computing Personalized PageRank Quickly by Exploiting Graph Structures 2014 VLDB 0.00010423565
1,877 Large-Scale Distributed Graph Computing Systems: An Experimental Evaluation 2015 VLDB 0.00010236803
1,922 Selecting Subexpressions to Materialize at Datacenter Scale 2018 VLDB 0.00010082599
1,942 Heterogeneity-aware Distributed Parameter Servers 2017 SIGMOD 0.00010012691
1,953 Distributed Evaluation of Subgraph Queries Using Worst-case Optimal Low-Memory Dataflows 2018 VLDB 9.9665955e-05
1,968 An Experimental Comparison of Partitioning Strategies in Distributed Graph Processing 2017 VLDB 9.9071968e-05
1,976 Towards Effective Partition Management for Large Graphs 2012 SIGMOD 9.8844201e-05
2,006 PALM: Parallel Architecture-Friendly Latch-Free Modifications to B+ Trees on Many-Core Processors 2011 VLDB 9.8101551e-05
2,172 Spinning Fast Iterative Data Flows 2012 VLDB 9.3706587e-05
2,336 Optimizing Graph Algorithms on Pregel-like Systems 2014 VLDB 9.0109891e-05
2,337 Efficient Processing of Data Warehousing Queries in a Split Execution Environment 2011 SIGMOD 9.0098186e-05
2,400 ByteGNN: Efficient Graph Neural Network Training at Large Scale 2022 VLDB 8.8955105e-05
2,458 REX: Recursive, Delta-Based Data-Centric Computation 2012 VLDB 8.7683462e-05
2,487 Large-Scale Graph Analytics in Aster 6: Bringing Context to Big Data Discovery 2014 VLDB 8.6710828e-05
2,494 Streaming Graph Partitioning: An Experimental Study 2018 VLDB 8.6508229e-05
2,529 Pregelix: Big(ger) Graph Analytics on A Dataflow Engine 2015 VLDB 8.5940768e-05
2,551 NeMa: Fast Graph Search with Label Similarity 2013 VLDB 8.5572574e-05
2,595 LEOPARD: Lightweight Edge-Oriented Partitioning and Replication for Dynamic Graphs 2016 VLDB 8.4735292e-05
2,607 Graph Stream Summarization: From Big Bang to Big Crunch 2016 SIGMOD 8.4630211e-05
2,635 NG-DBSCAN: Scalable Density-Based Clustering for Arbitrary Data 2017 VLDB 8.4045788e-05
Previous Page 1 / 5 Next

Outgoing Citations (Sorted by Pagerank)

Showing 1 of 1 cited papers.

Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.

Rank Cited Paper Year Venue Pagerank
3 Pig Latin: A Not-So-Foreign Language for Data Processing 2008 SIGMOD 0.0024183614
Previous Page 1 / 1 Next

Semantically Similar Papers