An "eval," short for evaluation, is a set of structured tests or benchmarks used to systematically assess the performance and quality of an AI model or a program. Creating evals is a foundational practice when building solutions with AI. In this talk Amy Heineike will provide an introduction to evals: What are they? Why do you need them? And how do you get started?
talk-data.com
A
Speaker
Amy Heineike
1
talks
AI Engineer
Tessl
Amy has been working in AI since the models were still small. She's grown multiple startups in data visualisation, nlp and summarisation. Now she's building the eval frameworks and tooling for Tessl - a company empowering coding agents with specs. She's fascinated by understanding how we use Data + AI to make sense of the world around us, and how we build effective teams that leverage it.
Bio from: Meetup #11 - An Introduction To Evals and The Story of DearGarden
Filter by Event / Source
Talks & appearances
1 activities · Newest first