talk-data.com

Google Cloud Next session 2025-04-09

Inference at scale with Google Cloud’s AI Hypercomputer (Recap)

Event: Google Cloud Next '25

Speakers

Reena Lee

Group Product Manager · Google Cloud

Juan Acevedo

Software Engineer · Google Cloud

Kirat Pandya

CEO · Osmos

Aditya Bindal

Chief Product Officer · Contextual AI

Topics

AI/ML Cloud Computing GCP

Description

Learn how to run high-throughput and low-latency inference on Google Cloud to maximize price-performance on TPUs and GPUs, leveraging JetStream and vLLM.