talk-data.com talk-data.com

S

Speaker

Szymon Ożóg

1

talks

Senior AI Inference Engineer Aleph Alpha

Szymon is a Senior AI Inference Engineer at Aleph Alpha with extensive experience in optimizing GPU-based code, especially for large language models (LLMs).

Bio from: Why speed is all about memory

Filter by Event / Source

Talks & appearances

1 activities · Newest first

Search activities →

Why speed is all about memory — an exploration of why optimizing memory access is the most important factor in writing performant code. Szymon Ożóg will share his extensive experience in optimizing GPU-based code, especially for large language models (LLMs). Agenda topics include overview of challenges faced, organizational structure to back multiplatform development, shaping the tech stack to achieve the goal, and what's next.