talk-data.com
Meetup
talk
2025-10-09 at 16:00
Automated testing of a Large Language Model
Description
Live-demo showing automated testing of large language models. Addresses non-determinism in ML systems and demonstrates how a second LLM can act as a judge. Also explores Retrieval Augmented Generation (RAG) for querying documents and guiding tests.