talk-data.com talk-data.com

Topic

evaluation

9

tagged

Activity Trend

5 peak/qtr
2020-Q1 2026-Q1

Activities

9 activities · Newest first

Join Kostia Omelianchuk and Lukas Beisteiner as they unpack the full scope of Grammatical Error Correction (GEC) from task framing, evaluation, and training to inference optimization and serving high-performance production systems at Grammarly. They will discuss: The modern GEC recipe (shift from heavily human-annotated corpora to semi-synthetic data generation), LLM-as-a-judge techniques for scalable evaluation, and techniques to make deployment fast and affordable, including Speculative Decoding.

Agents are powerful—but without feedback, they're flying blind. In this talk, we’ll walk through how to build self-improving agents by closing the loop with evaluation, experimentation, tracing, and prompt optimization. You’ll learn how to capture the right telemetry, run meaningful tests, and apply insights in a way that actually improves performance over time. Whether you’re building copilots, chatbots, or autonomous workflows, this session will give you the practical tools and architecture patterns you need to make your agents smarter—automatically.