talk-data.com
Google Cloud Next
session
2025-04-09 at 20:30
Inference at scale with Google Cloud’s AI Hypercomputer
Event:
Google Cloud Next '25
Speakers
Topics
Description
Learn how to run high-throughput and low-latency inference on Google Cloud to maximize price-performance on TPUs and GPUs, leveraging JetStream and vLLM.