Sebastian Duerr

Activities

1

talks

Filter by Event / Source

PyData Seattle 2025 1

Talks & appearances

1 activities · Newest first

Search activities →

Evaluation is all you need

2025-11-08 · PyData Seattle 2025 Watch

talk

LLM RAG

LLM apps fail without reliable, reproducible evaluation. This talk maps the open‑source evaluation landscape, compares leading techniques (RAGAS, Evaluation Driven Development) and frameworks (DeepEval, Phoenix, LangFuse, and braintrust), and shows how to combine tests, RAG‑specific evals, and observability to ship higher‑quality systems. Attendees leave with a decision checklist, code patterns, and a production‑ready playbook.