Link Spam Detection Based on Mass Estimation
Summary: Introduces spam mass, a metric of the impact of link spam on a page's ranking. Estimates of spam mass enable identifying pages that disproportionately benefit from link spamming; Yahoo! web graph experiments reveal tens of thousands of heavy-weight spam instances. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Zoltan Gyongyi
- 2. Pavel Berkhin
- 3. Hector Garcia-Molina
- 4. Jan Pedersen
Incoming Citations (Sorted by Pagerank)
Showing 5 of 5 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 4,562 | Massively Parallel Algorithms for Personalized PageRank | 2021 | VLDB | 6.0846728e-05 |
| 5,655 | Personalized PageRank on Evolving Graphs with an Incremental Index-Update Scheme | 2023 | SIGMOD | 5.387631e-05 |
| 6,309 | Efficient Algorithms for Finding Approximate Heavy Hitters in Personalized PageRanks | 2018 | SIGMOD | 5.1167347e-05 |
| 12,472 | P2P Authority Analysis for Social Communities | 2007 | VLDB | 4.1945683e-05 |
| 13,671 | Link Spam Alliances | 2005 | VLDB | - |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 3 of 3 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 57 | Discovering Large Dense Subgraphs in Massive Graphs | 2005 | VLDB | 0.00065491112 |
| 2,564 | Combating Web Spam with TrustRank | 2004 | VLDB | 8.5277793e-05 |
| 13,671 | Link Spam Alliances | 2005 | VLDB | - |
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 7,802 | SocialSpamGuard: A Data Mining-Based Spam Detection System for Social Media Networks | 2011 | VLDB | 4.6469803e-05 |
| 13,338 | Resisting Tag Spam by Leveraging Implicit User Behaviors | 2017 | VLDB | - |
| 917 | Simrank++: Query Rewriting through Link Analysis of the Click Graph | 2008 | VLDB | 0.00015370124 |
| 6,309 | Efficient Algorithms for Finding Approximate Heavy Hitters in Personalized PageRanks | 2018 | SIGMOD | 5.1167347e-05 |
| 5,442 | RankMass Crawler: A Crawler with High Personalized PageRank Coverage Guarantee | 2007 | VLDB | 5.5026403e-05 |
| 13,808 | A Method of Re-ranking Web Search Results Using their Hidden Hyperlink Structure | 2002 | VLDB | - |
| 4,995 | On Link-based Similarity Join | 2011 | VLDB | 5.7787414e-05 |
| 57 | Discovering Large Dense Subgraphs in Massive Graphs | 2005 | VLDB | 0.00065491112 |
| 2,564 | Combating Web Spam with TrustRank | 2004 | VLDB | 8.5277793e-05 |
| 13,671 | Link Spam Alliances | 2005 | VLDB | - |