Google Kubernetes Engine (GKE) provides cost efficiency and high performance to run AI inference on Google tensor processing units (TPUs) and NVIDIA graphics processing units. Join us to learn how Anthropic runs its inference workload for Claude on GKE, and how Anthropic achieved better price-perf on TPU v5e on GKE. We’ll also learn how GKE advanced management capabilities simplify Day-2 maintenance, and how Google Cloud Customer Support makes the entire experience a blast.
Click the blue “Learn more” button above to tap into special offers designed to help you implement what you are learning at Google Cloud Next 25.
talk-data.com
N
Speaker
Ning Liao
1
talks
Senior Engineering Manager
Google Cloud
Filter by Event / Source
Talks & appearances
1 activities · Newest first