Live-demo showing automated testing of large language models. Addresses non-determinism in ML systems and demonstrates how a second LLM can act as a judge. Also explores Retrieval Augmented Generation (RAG) for querying documents and guiding tests.
talk-data.com
Topic
automated testing
1
tagged
Activity Trend
1
peak/qtr
2020-Q1
2026-Q1
Filtering by:
Quality Engineering meetup #9
×