LLMs have opened up new avenues in NLP with their possible applications, but evaluating their output introduces a new set of challenges. In this talk, we discuss how the evaluation of LLMs differs from the evaluation of classic ML-based solutions and how we tackle the challenges.
talk-data.com
Topic
NLP
Natural Language Processing (NLP)
ai
machine_learning
text_analysis
2
tagged
Activity Trend
24
peak/qtr
2020-Q1
2026-Q1
Top Events
Filtering by:
Lena Nahorna
×
LLMs have opened up new avenues in NLP with their possible applications, but evaluating their output introduces a new set of challenges. In this talk, we discuss these challenges and our approaches to measuring the model output quality. We will talk about the existing evaluation methods and their pros and cons and then take a closer look at their application in a practical case study.