Fast Data in the Era of Big Data: Twitter's Real-Time Related Query Suggestion Architecture
Summary: Twitter’s real-time related query suggestion and spelling correction, delivering results within minutes after breaking news. From Hadoop batch analytics to a custom in-memory engine, enabling low-latency results and shaping data-platform design. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Gilad Mishne
- 2. Jeff Dalton
- 3. Zhenghua Li
- 4. Aneesh Sharma
- 5. Jimmy Lin
Incoming Citations (Sorted by Pagerank)
Showing 6 of 6 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 1,794 | Summingbird: A Framework for Integrating Batch and Online MapReduce Computations | 2014 | VLDB | 0.00010532024 |
| 6,170 | PolarDB-IMCI: A Cloud-Native HTAP Database System at Alibaba | 2023 | SIGMOD | 5.171601e-05 |
| 8,658 | Modernization of Databases in the Cloud Era: Building Databases that Run like Legos | 2023 | VLDB | 4.4729338e-05 |
| 9,504 | Supporting Scalable Analytics with Latency Constraints | 2015 | VLDB | 4.3341665e-05 |
| 11,802 | Query-able Kafka: An agile data analytics pipeline for mobile wireless networks | 2017 | VLDB | 4.1945683e-05 |
| 11,848 | Microblogs Data Management Systems: Querying, Analysis, and Visualization | 2016 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 12 of 12 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 7,999 | TI: An Efficient Indexing Mechanism for Real-Time Search on Tweets | 2011 | SIGMOD | 4.6100392e-05 |
| 7,866 | Operational Analytics Data Management Systems | 2016 | VLDB | 4.6321795e-05 |
| 3,601 | Large-Scale Machine Learning at Twitter | 2012 | SIGMOD | 6.9315087e-05 |
| 2,476 | A Platform for Scalable One-Pass Analytics using MapReduce | 2011 | SIGMOD | 8.6960139e-05 |
| 4,885 | GraphJet: Real-Time Content Recommendations at Twitter | 2016 | VLDB | 5.8534354e-05 |
| 824 | Twitter Heron: Stream Processing at Scale | 2015 | SIGMOD | 0.0001623129 |
| 2,228 | Real-Time Twitter Recommendation: Online Motif Detection in Large Dynamic Graphs | 2014 | VLDB | 9.2385241e-05 |
| 2,870 | Streaming Similarity Search over one Billion Tweets using Parallel Locality-Sensitive Hashing | 2013 | VLDB | 7.9799783e-05 |
| 4,572 | The Unified Logging Infrastructure for Data Analytics at Twitter | 2012 | VLDB | 6.0760183e-05 |
| 9,504 | Supporting Scalable Analytics with Latency Constraints | 2015 | VLDB | 4.3341665e-05 |