talk-data.com talk-data.com

Meetup talk 2025-12-09 at 18:15

NEO: Unlocking Scalable LLM Inference with Smart CPU Offloading

Topics

Description

Date: 2025-12-09. NEO: Unlocking Scalable LLM Inference with Smart CPU Offloading.