talk-data.com talk-data.com

Event

Learn Live: Bring Your Own AI Models to Intelligent Apps on AKS with Kaito

2024-02-28 โ€“ 2024-02-28 Meetup Visit website โ†—

Activities tracked

0

Join us to learn how to run open-source Large Language Models (LLMs) with HTTP-based inference endpoints inside your AKS cluster using the Kubernetes AI Toolchain Operator (KAITO). Weโ€™ll walk through the setup and deployment of containerized LLMs on GPU node pools and see how KAITO can help reduce operational burden of provisioning GPU nodes and tuning model deployment parameters to fit GPU profiles.

Presenters:

Paul Yu \| Senior Cloud Advocate\, Microsoft \| LinkedIn Ishaan Sehgal \| Software Engineer\, Microsoft \| LinkedIn

Learning objectives

  • Learn how to use Prometheus-style metrics with Azure Monitor.
  • Learn how to visualize application and infrastructure state with Azure Managed Grafana.
  • Learn how to use the AKS Cost Analysis add-on to monitor the different aspects of your AKS environment.

๐Ÿ“ŒAll About This Series

๐Ÿ“ŒMore About Learn Live

Sessions & talks

Showing 1โ€“0 of 0 ยท Newest first

Search within this event →

No individual activities are attached to this event yet.