talk-data.com
AWS re:Invent 2025 - Sustainable and cost-efficient generative AI with agentic workflows (AIM333)
Description
Building sustainable, cost-effective generative AI on AWS requires integrating agentic AI, efficient architecture, and cloud-native optimization. Agentic systems using Amazon Bedrock AgentCore employ contextual memory, asynchronous execution, and on-demand tool invocation to minimize compute waste. MCP enables secure connections between AI agents, AWS services, and custom tools. Efficiency increases through AWS's Trainium and Inferentia2 silicon (50% better performance per watt), Amazon SageMaker for scalable development, and optimization techniques like quantization and speculative decoding. Auto-scaling, batch processing, and spot instances prevent over-provisioning. Combined with CloudWatch and Cost Explorer monitoring, this approach delivers high-performance, low-carbon generative AI solutions.
Learn more: More AWS events: https://go.aws/3kss9CP
Subscribe: More AWS videos: http://bit.ly/2O3zS75 More AWS events videos: http://bit.ly/316g9t4
ABOUT AWS: Amazon Web Services (AWS) hosts events, both online and in-person, bringing the cloud computing community together to connect, collaborate, and learn from AWS experts. AWS is the world's most comprehensive and broadly adopted cloud platform, offering over 200 fully featured services from data centers globally. Millions of customers—including the fastest-growing startups, largest enterprises, and leading government agencies—are using AWS to lower costs, become more agile, and innovate faster.