Photon: A Fast Query Engine for Lakehouse Systems
Summary: Photon is a vectorized Lakehouse query engine, delivering fast queries on raw data lakes and Parquet via Spark API. Design choices (vectorization vs. code generation), memory manager, and SQL/Spark integration enable 10x gains and a 100TB TPC-DS record. (summarized by gpt-5-nano on Feb 09 2026)
Incoming Non-self Citations Over Time
Authors
- 1. Alexander Behm
- 2. Shoumik Palkar
- 3. Utkarsh Agarwal
- 4. Timothy Armstrong
- 5. David Cashman
- 6. Ankur Dave
- 7. Todd Greenstein
- 8. Shant Hovsepian
- 9. Ryan Johnson
- 10. Arvind Sai Krishnan
- 11. Paul Leventis
- 12. Ala Luszczak
- 13. Prashanth Menon
- 14. Mostafa Mokhtar
- 15. Gene Pang
- 16. Sameer Paranjpye
- 17. Greg Rahn
- 18. Bart Samwel
- 19. Tom van Bussel
- 20. Herman van Hovell
- 21. Maryann Xue
- 22. Reynold Xin
- 23. Matei Zaharia
Incoming Citations (Sorted by Pagerank)
Showing 28 of 28 citing papers.
Previous
Page 1 / 1
Next
Outgoing Citations (Sorted by Pagerank)
Showing 19 of 19 cited papers.
Citations counted here include only citations to other VLDB/SIGMOD/CIDR/PODS papers in this database.
Previous
Page 1 / 1
Next
Semantically Similar Papers
| Overall Rank | Paper | Year | Venue | Pagerank |
|---|---|---|---|---|
| 746 | Delta Lake: High-Performance ACID Table Storage over Cloud Object Stores | 2020 | VLDB | 0.00017326979 |
| 10,196 | PTO: A Workload-driven Predictive Table Optimizer for Lakehouse Systems | 2026 | SIGMOD | 4.1945683e-05 |
| 6,402 | BigLake: BigQuery’s Evolution toward a Multi-Cloud Lakehouse | 2024 | SIGMOD | 5.079818e-05 |
| 4,495 | ClickHouse - Lightning Fast Analytics for Everyone | 2024 | VLDB | 6.1410277e-05 |
| 1,377 | Lakehouse: A New Generation of Open Platforms that Unify Data Warehousing and Advanced Analytics | 2021 | CIDR | 0.00012296941 |
| 10,767 | The HANA Native Query Engine for Lakehouse Systems | 2025 | VLDB | 4.1945683e-05 |
| 9,857 | Towards Unifying Query Interpretation and Compilation | 2023 | CIDR | 4.269353e-05 |
| 7,907 | Petabyte-Scale Row-Level Operations in Data Lakehouses | 2024 | VLDB | 4.6205839e-05 |
| 7,059 | Adaptive and Robust Query Execution for Lakehouses at Scale | 2024 | VLDB | 4.8477825e-05 |
| 9,808 | Photon: A High-Performance Query Engine for the Lakehouse | 2022 | CIDR | 4.2794025e-05 |