Scale your AI training and achieve peak performance with AI Hypercomputer. Gain actionable insights into optimizing your AI workloads for maximum goodput. Learn how to leverage our robust infrastructure for diverse models, including dense, Mixture of Experts, and diffusion. Discover how to customize your workflows with custom kernels and developer tools, facilitating seamless interactive development. You'll learn firsthand how Pathways, developed by Google Deepmind, enables large scale training resiliency, flexibility to express architecture.
talk-data.com
K
Speaker
Kirat Pandya
3
talks
CEO
Osmos
Frequent Collaborators
Filtering by:
Google Cloud Next '25
×
Filter by Event / Source
Talks & appearances
Showing 3 of 4 activities
with
Aditya Bindal
(Contextual AI)
,
Reena Lee
(Google Cloud)
,
Kirat Pandya
(Osmos)
,
Juan Acevedo
(Google Cloud)
Learn how to run high-throughput and low-latency inference on Google Cloud to maximize price-performance on TPUs and GPUs, leveraging JetStream and vLLM.
with
Aditya Bindal
(Contextual AI)
,
Reena Lee
(Google Cloud)
,
Kirat Pandya
(Osmos)
,
Juan Acevedo
(Google Cloud)
Learn how to run high-throughput and low-latency inference on Google Cloud to maximize price-performance on TPUs and GPUs, leveraging JetStream and vLLM.