talk-data.com talk-data.com

YouTube 2025-12-05 at 03:54

AWS re:Invent 2025 - Balance cost, performance & reliability for AI at enterprise scale (AIM3304)

Description

Deploying generative AI at enterprise scale requires balancing performance, cost, and reliability across diverse business purposes and use cases. Amazon Bedrock offers a complete portfolio of inference options, with on-demand cross-region inference for elastic scaling, on-demand service tiers for balancing performance and cost, including optimization options like prompt caching for improving latency while significantly reducing cost, and batch inference for cost-effective bulk processing. This interactive session covers the tools and approaches needed to architect hybrid inference strategies that enable enterprises to maximize price-performance ratios as AI workloads scale.

Learn more: More AWS events: https://go.aws/3kss9CP

Subscribe: More AWS videos: http://bit.ly/2O3zS75 More AWS events videos: http://bit.ly/316g9t4

ABOUT AWS: Amazon Web Services (AWS) hosts events, both online and in-person, bringing the cloud computing community together to connect, collaborate, and learn from AWS experts. AWS is the world's most comprehensive and broadly adopted cloud platform, offering over 200 fully featured services from data centers globally. Millions of customers—including the fastest-growing startups, largest enterprises, and leading government agencies—are using AWS to lower costs, become more agile, and innovate faster.

AWSreInvent #AWSreInvent2025 #AWS