Aditya Bindal

Activities

2

talks

Chief Product Officer Contextual AI

Frequent Collaborators

Juan Acevedo Google Cloud 2 Kirat Pandya Osmos 2 Reena Lee Google Cloud 2

Filter by Event / Source

Google Cloud Next '25 2

Talks & appearances

2 activities · Newest first

Search activities →

Inference at scale with Google Cloud’s AI Hypercomputer

2025-04-09 · Google Cloud Next '25

session

with Aditya Bindal (Contextual AI) , Reena Lee (Google Cloud) , Kirat Pandya (Osmos) , Juan Acevedo (Google Cloud)

AI/ML Cloud Computing

Learn how to run high-throughput and low-latency inference on Google Cloud to maximize price-performance on TPUs and GPUs, leveraging JetStream and vLLM.

Inference at scale with Google Cloud’s AI Hypercomputer (Recap)

· Google Cloud Next '25

session

with Aditya Bindal (Contextual AI) , Reena Lee (Google Cloud) , Kirat Pandya (Osmos) , Juan Acevedo (Google Cloud)

AI/ML

Learn how to run high-throughput and low-latency inference on Google Cloud to maximize price-performance on TPUs and GPUs, leveraging JetStream and vLLM.