talk-data.com talk-data.com

YouTube 2025-12-06 at 22:09

AWS re:Invent 2025 - Scale AI agents with custom models using Amazon SageMaker AI & SGLang (AIM387)

Description

Are you looking to customize foundation models and deploy AI agents at scale? Learn how to leverage Amazon SageMaker AI to build performant agentic workflows with customized open-weight models. This session covers the end-to-end journey: use Amazon SageMaker AI for model customization, track experiments with managed MLflow, establish governance with Amazon SageMaker AI Model Registry, and deploy optimized models using SGLang on Amazon SageMaker AI Inference for low-latency agent applications. Discover how to create repeatable, auditable workflows with Amazon SageMaker AI Pipelines while maintaining comprehensive observability and control for production AI systems. Includes a live demonstration of customizing and deploying an open-weight model, plus insights from SGLang's team on inference for agents.

Learn more: More AWS events: https://go.aws/3kss9CP

Subscribe: More AWS videos: http://bit.ly/2O3zS75 More AWS events videos: http://bit.ly/316g9t4

ABOUT AWS: Amazon Web Services (AWS) hosts events, both online and in-person, bringing the cloud computing community together to connect, collaborate, and learn from AWS experts. AWS is the world's most comprehensive and broadly adopted cloud platform, offering over 200 fully featured services from data centers globally. Millions of customers—including the fastest-growing startups, largest enterprises, and leading government agencies—are using AWS to lower costs, become more agile, and innovate faster.

AWSreInvent #AWSreInvent2025 #AWS