talk-data.com talk-data.com

Meetup talk 2025-09-12 at 16:30

LLM Inference Under the Hood: From Edge Devices to Production Beasts

Description

We will dive deeper into inference from Apple Silicon to huge production clusters, and cover tricks how to make it faster.