Generative AI has introduced new query patterns as customers seek to leverage their data to customize customer experiences. With new vector search query patterns, vector data types and indexing, there's now a new frontier to consider for performance and cost optimization. In this session, learn how you can use Amazon MemoryDB to reduce latencies to single-digit milliseconds from single-digit seconds in generative AI workloads using durable semantic caching, while also reducing the cost incurred from your foundation models.

Learn more: AWS re:Invent: https://go.aws/reinvent. More AWS events: https://go.aws/3kss9CP

Subscribe: More AWS videos: http://bit.ly/2O3zS75 More AWS events videos: http://bit.ly/316g9t4

About AWS: Amazon Web Services (AWS) hosts events, both online and in-person, bringing the cloud computing community together to connect, collaborate, and learn from AWS experts. AWS is the world's most comprehensive and broadly adopted cloud platform, offering over 200 fully featured services from data centers globally. Millions of customers—including the fastest-growing startups, largest enterprises, and leading government agencies—are using AWS to lower costs, become more agile, and innovate faster.

AWSreInvent #AWSreInvent2024

talk-data.com

AWS re:Invent 2024 - Optimize gen AI apps with durable semantic caching in Amazon MemoryDB (DAT329)

Description

AWSreInvent #AWSreInvent2024