talk-data.com
AWS re:Invent 2025 - Build, fine-tune & deploy AI models with SageMaker HyperPod CLI & SDK (AIM371)
Description
Amazon SageMaker HyperPod helps build, train and deploy AI models at scale. You can spin-up preferred IDE on HyperPod to develop their models and run experiments, then scale to thousands of accelerators to run distributed training and finally deploy models for real-time or batch inference. With HyperPod task governance capabilities, you can optimize the utilization of the cluster resources. In this talk, we will show how interaction with HyperPod clusters is simplified via the SageMaker HyperPod CLI and SDK, which streamlined end-to-end model development, from training, fine-tuning to deployment. We'll show HyperPod CLI/SDK capabilities with live coding and troubleshooting.
Learn more: More AWS events: https://go.aws/3kss9CP
Subscribe: More AWS videos: http://bit.ly/2O3zS75 More AWS events videos: http://bit.ly/316g9t4
ABOUT AWS: Amazon Web Services (AWS) hosts events, both online and in-person, bringing the cloud computing community together to connect, collaborate, and learn from AWS experts. AWS is the world's most comprehensive and broadly adopted cloud platform, offering over 200 fully featured services from data centers globally. Millions of customers—including the fastest-growing startups, largest enterprises, and leading government agencies—are using AWS to lower costs, become more agile, and innovate faster.