talk-data.com talk-data.com

D

Speaker

Dariusz Piotrowski

1

talks

Senior Machine Learning Engineer (contractor) Amazon Robotics

After spending 4 years as Applied Scientist at Amazon, where he worked on Text to Speech (TTS) for Alexa and Computer Vision (CV) for Ring he began to work for the past 2 years as a Senior Machine Learning Engineer (contractor) including last half a year for Amazon Robotics dedicating his time and skill to build a neuro-symbolic planning systems.

Bio from: PyData Trójmiasto #37

Filter by Event / Source

Talks & appearances

1 activities · Newest first

Search activities →

Ever wondered what actually happens when you call an LLM API? This talk breaks down the inference pipeline from tokenization to text generation, explaining what's really going on under the hood. He will walk through the key sampling strategies and their parameters - temperature, top-p, top-k, beam search. We'll also cover performance tricks like quantization, KV caching, and prompt caching that can speed things up significantly. If time allows, we will also touch on some use-case-specific techniques like pass@k and majority voting.