Mixer: Efficiently Understanding and Retrieving Visual Content at Web-scale
Summary: Mixer: class-based features with separate production/execution layers for images and videos. Two retrieval layers enable aggregation; on Baidu, model production time halved and throughput 9.14x, with 95% precision and 97% recall for video retrieval. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. An Qin
- 2. Mengbai Xiao
- 3. Yongwei Wu
- 4. Xinjie Huang
- 5. Xiaodong Zhang
Incoming Citations (Sorted by Pagerank)
Showing 3 of 3 citing papers.
| Rank | Citing Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 2,320 | High-Throughput Vector Similarity Search in Knowledge Graphs | 2023 | SIGMOD | 9.0366225e-05 |
| 4,641 | VIVA: An End-to-End System for Interactive Video Analytics | 2022 | CIDR | 6.027004e-05 |
| 10,409 | MicroNN: An On-device Disk-resident Updatable Vector Database | 2025 | SIGMOD | 4.1945683e-05 |
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 11 of 11 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next