talk-data.com talk-data.com

Meetup talk 2024-09-19 at 16:00

Tech Talk: Simply (auto)-scaling high-performance LLMs with serverless deployments

Description

Learn how to automatically scale LLMs in production, optimize your resource usage, and improve performance for your AI-driven applications.