Why speed is all about memory — an exploration of why optimizing memory access is the most important factor in writing performant code. Szymon Ożóg will share his extensive experience in optimizing GPU-based code, especially for large language models (LLMs). Agenda topics include overview of challenges faced, organizational structure to back multiplatform development, shaping the tech stack to achieve the goal, and what's next.
talk-data.com
S
Speaker
Szymon Ożóg
1
talks
Senior AI Inference Engineer
Aleph Alpha
Szymon is a Senior AI Inference Engineer at Aleph Alpha with extensive experience in optimizing GPU-based code, especially for large language models (LLMs).
Bio from: Why speed is all about memory
Filter by Event / Source
Talks & appearances
1 activities · Newest first