talk-data.com talk-data.com

V

Speaker

Venkatesh Krishnan

1

talks

Senior Manager of Product Management AWS

Filter by Event / Source

Talks & appearances

1 activities · Newest first

Search activities →
AWS re:Invent 2024 - Reduce FM deployment costs and latency with Amazon SageMaker (AIM307)

Organizations need robust, scalable, and cost-effective solutions to deploy and serve foundation models (FMs). This session explores how to use Amazon SageMaker to deploy FMs to make predictions at the best price performance for any use case. Get a detailed overview of deployment strategies to support large-scale generative AI inferencing, and learn how to architect solutions that optimize performance and cost.

Learn more: AWS re:Invent: https://go.aws/reinvent. More AWS events: https://go.aws/3kss9CP

Subscribe: More AWS videos: http://bit.ly/2O3zS75 More AWS events videos: http://bit.ly/316g9t4

About AWS: Amazon Web Services (AWS) hosts events, both online and in-person, bringing the cloud computing community together to connect, collaborate, and learn from AWS experts. AWS is the world's most comprehensive and broadly adopted cloud platform, offering over 200 fully featured services from data centers globally. Millions of customers—including the fastest-growing startups, largest enterprises, and leading government agencies—are using AWS to lower costs, become more agile, and innovate faster.

AWSreInvent #AWSreInvent2024