Topic

cpu

Activities

2

tagged

Activity Trend

1 peak/qtr

2020-Q1 2026-Q1

Top Events

🎄 The 2025 Christmas Gopher & Rustacean Roundup! 🎄 1 AI and Deep Learning for Enterprise #15 1

Top Speakers

ian osvald (Mor Consulting) 1

Activities

2 activities · Newest first

All Video Podcast Book

Never Underestimate Memory Architecture

2025-12-10 · 🎄 The 2025 Christmas Gopher & Rustacean Roundup! 🎄

talk

numa

Tasked with a mixed audience of Gophers and Rustaceans, Bryan went for the layer that is common to both - the CPU executing the code. All large servers adopt a design known as Non-Uniform Memory Access, where some memory is faster to access than the rest. Bryan will explain how this comes about, what it means for the performance of your programs, and what control you have over NUMA.

Llama.cpp for fun and (maybe) profit

2024-04-16 · AI and Deep Learning for Enterprise #15

talk

by ian osvald (Mor Consulting)

Python gpu llama.cpp llms quantised models

Running models locally on the CPU and possibly a GPU means we can experiment with the latest quantised models on real client data without anything leaving the machine. We can explore text question answering, image analysis and calling these tools via a Python API for rapid PoC experimentation. This quickly exposes the ways that LLMs go weird and maybe that helps us avoid some of the examples of early LLM deployments making embarrassing mistakes!