talk-data.com talk-data.com

Google Cloud Next session 2025-04-11 at 19:30

Hybrid LLMs for edge AI applications

Description

Unleash the full potential of large language models (LLMs) on your edge devices, even when there’s spotty internet. This session explores a hybrid approach that combines the power of cloud-based LLMs with the efficiency of on-device models. Learn how to intelligently route queries, enabling laptops and mobile phones to perform complex tasks while maintaining snappy performance. View demos of efficient task routing that optimizes for quality and cost to ensure your apps run smoothly, even during network disruptions.