Live-demo showing automated testing of large language models. Addresses non-determinism in ML systems and demonstrates how a second LLM can act as a judge. Also explores Retrieval Augmented Generation (RAG) for querying documents and guiding tests.
talk-data.com
Topic
prompting
1
tagged
Activity Trend
3
peak/qtr
2020-Q1
2026-Q1
Top Events
Top Speakers
Filtering by:
Anupam Krishnamurthy
×