Maximize your Gen AI inference performance on GKE. This session dives into the latest Kubernetes and GKE advancements, revealing how to achieve significant cost savings, reduced latency, and increased throughput. Discover new inference features on GKE for optimizing load balancing, scaling, accelerator selection, and overall usability. Plus, hear directly from Snap Inc. about their journey re-architecting their inference platform for the demands of Gen AI.
talk-data.com
S
Speaker
Shub Shrivastava
2
talks
Customer Engineer, Application Modernization
Google Cloud
Filter by Event / Source
Talks & appearances
2 activities · Newest first
with
Drew Bradstock
(Google Cloud)
,
Pere Kyle
(Snap)
,
James Brown
(Google Cloud)
,
Shub Shrivastava
(Google Cloud)
This session features panel discussion with Snap Inc., and its journey from being born on Google App Engine to how they’ve been able to grow and serve 400M+ DAU powered by Google Kubernetes Engine. Learn about the business decisions behind this evolution, as we dive into the strategic approach delivered by Snap’s leadership throughout the company’s history as a digital-born customer.
Click the blue “Learn more” button above to tap into special offers designed to help you implement what you are learning at Google Cloud Next 25.