An interactive workshop exploring the infrastructure and service architecture you need to scale AI applications in production, including infrastructure as code basics, provisioning AWS resources (ECS clusters, networking, messaging queues, and Amazon RDS Postgres), and managing Pinecone indexes.
talk-data.com
Speaker
Zach Proser
3
talks
Staff Developer Advocate at Pinecone.
Bio from: Live Workshop: Scaling AI Apps
Filter by Event / Source
Talks & appearances
3 activities · Newest first
What does it take to go from an idea in a notebook to an application handling real-world traffic? The Pinecone and Pulumi teams will explore the infrastructure and service architecture you need in order to scale AI apps in production. We will delve into deploying high-volume AI systems through scalable microservices, efficient data processing, and seamless synchronization between user interfaces and databases. We will examine the nuances of containerization for enhanced portability and Infrastructure as Code (IaC) for streamlined cloud deployments. The workshop will also discuss industry best practices in scalability and security for production-grade AI systems in a cloud-native landscape. This workshop is designed to help developers and engineers gain valuable insights and practical strategies for evolving AI applications into resilient and efficient cloud-native solutions.