Extreme Data Mining
Summary: End-to-end extreme data mining at Google across hardware, networking, storage abstractions, and distributed numerical methods for high-throughput training and serving. Batch vs online training: infrastructure and algorithms optimized for planet-scale, 24/7 data processing. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
Incoming Citations (Sorted by Pagerank)
Showing 1 of 1 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 8,040 | Distributed Threshold Querying of General Functions by a Difference of Monotonic Representation | 2011 | VLDB | 4.600049e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 0 of 0 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
| Rank | Cited Paper | Year | Venue | Pagerank |
|---|
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 13,923 | Of Crawlers, Portals, Mice, and Men: Is there more to Mining the Web? | 1999 | SIGMOD | - |
| 13,806 | Query Processing Concepts and Techniques to Support Business Intelligence Applications | 2002 | VLDB | - |
| 13,265 | The Power of Summarization in Graph Mining and Learning: Smaller Data, Faster Methods, More Interpretability | 2021 | VLDB | - |
| 6,465 | Data mining, Hypergraph Transversals, and Machine Learning | 1997 | PODS | 5.0530117e-05 |
| 13,244 | Deep Data Integration | 2021 | SIGMOD | - |
| 13,526 | Data Management and Mining in Internet Ad Systems | 2010 | VLDB | - |
| 13,513 | Database Systems Research on Data Mining | 2010 | SIGMOD | - |
| 9,863 | Large Scale Graph Mining with G-Miner | 2019 | SIGMOD | 4.2682525e-05 |
| 13,555 | Extreme Streaming: Business Optimization Driving Algorithmic Challenges | 2008 | SIGMOD | - |
| 12,003 | Datacenters as Computers: Google Engineering & Database Research Perspectives | 2014 | VLDB | 4.1945683e-05 |