Why speed is all about memory — an exploration of why optimizing memory access is the most important factor in writing performant code. Szymon Ożóg will share his extensive experience in optimizing GPU-based code, especially for large language models (LLMs). Agenda topics include overview of challenges faced, organizational structure to back multiplatform development, shaping the tech stack to achieve the goal, and what's next.
talk-data.com
Activities tracked
1
Szymon (Aleph Alpha) will talk about Why speed is all about memory in which he will be exploring why optimizing memory access is the most important factor when writing performant code. Szymon has extensive experience in optimizing GPU-based code, especially when it comes to LLMs.
💡Agenda
- Overview of challenges we’ve faced
- Org structure to back multiplatform development
- Shaping the tech stack to achieve our goal
- What’s next
🔈 Speakers:
- Szymon Ożòg, Senior AI Inference Engineer
Agenda:
✨ 18:30 Doors open: time for networking with fellow attendees ✨ 19:00 Talk and Q&A ✨ 20:00 Mingling and networking with pizza and drinks ✨ 21:00 Meetup ends
- Where: In person, Aleph Alpha Berlin, Ritterstraße 6 - When: Tuesday, April 15th - Language: English
Sessions & talks
Showing 1–1 of 1 · Newest first