talk-data.com talk-data.com

E

Speaker

Emeli Dral

1

talks

Filtering by: PyData London 2025 ×

Filter by Event / Source

Talks & appearances

Showing 1 of 2 activities

Search activities →
AI agents testing: How to evaluate the unpredictable

AI agents and multi-step workflows are powerful, but testing them can be tricky. This talk explores practical ways to test these complex systems — like running multi-step simulations, checking tool calls, and using LLMs for evaluation. You'll also learn how to prioritize what to test and set up session-level evaluations with open-source tools.