talk-data.com talk-data.com

J

Speaker

James Maffey

1

talks

Product Manager Google Cloud
Filtering by: Google Cloud Next '25 ×

Filter by Event / Source

Talks & appearances

Showing 1 of 2 activities

Search activities →

Struggling to monitor the performance and health of your large language model (LLM) deployments on Google Kubernetes Engine (GKE)? This session unveils how the Google Cloud Observability suite provides a comprehensive solution for monitoring leading AI model servers like Ray, NVIDIA Triton, vLLM, TGI, and others. Learn how our one-click setup automatically configures dashboards, alerts, and critical metrics โ€“ including GPU and TPU utilization, latency, throughput, and error analysis โ€“ to enable faster troubleshooting and optimized performance. Discover how to gain complete visibility into your LLM infrastructure.