talk-data.com

Topic

architecture

Activities

tagged

Activity Trend

95 peak/qtr

2020-Q1 2026-Q2

Top Events

Google Cloud Next '25 130 Global Azure France 2025 à Paris - #GlobalAzure 5 Mac Management with Microsoft Intune 1 x2x Community Meetup: Microservices rules by Chris Richardson & Vaughn Vernon 1

Top Speakers

Moontae Lee (LG AI Research) 4 Kshetrajna Radhaven (Shopify) 4 Newfel Harrat (Google Cloud) 4 Cesar Naranjo (Moloco) 4 Nathan Beach (Google Cloud) 3 Kirat Pandya (Osmos) 3 Leah Rivers (Google) 3 Chelsie Czop (Google Cloud) 3 Kasper Piskorski, PhD (Technology Innovation Institute) 3 Scott Dietzen (Augment Code) 2 Deepak Patil (Google Cloud) 2 Adarsh Seetharam (Google Cloud) 2

Activities

Showing filtered results

All Video Podcast Book

Filtering by: Kirat Pandya ×

AI Hypercomputer: Performance, scale, and the power of Pathways

2025-04-11 · Google Cloud Next '25

session

by Vaibhav Singh (Google Cloud) , Shaurya Gupta (Google Cloud) , Kirat Pandya (Osmos)

AI/ML

Scale your AI training and achieve peak performance with AI Hypercomputer. Gain actionable insights into optimizing your AI workloads for maximum goodput. Learn how to leverage our robust infrastructure for diverse models, including dense, Mixture of Experts, and diffusion. Discover how to customize your workflows with custom kernels and developer tools, facilitating seamless interactive development. You'll learn firsthand how Pathways, developed by Google Deepmind, enables large scale training resiliency, flexibility to express architecture.

Inference at scale with Google Cloud’s AI Hypercomputer

2025-04-09 · Google Cloud Next '25

session

by Aditya Bindal (Contextual AI) , Reena Lee (Google Cloud) , Kirat Pandya (Osmos) , Juan Acevedo (Google Cloud)

AI/ML Cloud Computing GCP

Learn how to run high-throughput and low-latency inference on Google Cloud to maximize price-performance on TPUs and GPUs, leveraging JetStream and vLLM.

Inference at scale with Google Cloud’s AI Hypercomputer (Recap)

· Google Cloud Next '25

session

by Aditya Bindal (Contextual AI) , Reena Lee (Google Cloud) , Kirat Pandya (Osmos) , Juan Acevedo (Google Cloud)

AI/ML Cloud Computing GCP

Learn how to run high-throughput and low-latency inference on Google Cloud to maximize price-performance on TPUs and GPUs, leveraging JetStream and vLLM.