talk-data.com talk-data.com

Event

Why you should build an LLM benchmark

2024-01-16 – 2024-01-16 Meetup Visit website β†—

Activities tracked

4

πŸ“Š Dive Deep into the World of LLM Benchmarks! πŸ“Š

Objective: By the end of this session, you should have a good understanding of how to select and maintain your own LLM benchmark.

Agenda: πŸ”¬ Demo! πŸ”Discover what ARC, HellSwag, and MMLU are exactly 🧫 Learn how to select the right benchmark πŸ§ͺ Methods to test LLMs tailored to your unique use case 🧱 Q&A

Speaker: J. Yarkoni ex-Google AI/ML Specialist (Shujin.ai) Jonathan comes from a background of leading R&D teams. Previously he co-founded NAM, an advertising startup, and AA-TLV meetup, which at its peak had 3,500 members. Over the last six years, he spearheaded AI/ML initiatives at Google Cloud Israel. More recently, he established Shujin.AI, a consultancy specializing in ML projects with an emphasis on Generative AI.

Sessions & talks

Showing 1–4 of 4 Β· Newest first

Search within this event →

Discover what ARC, HellSwag, and MMLU are exactly

talk
j yarkoni (Shujin.ai)

Learn how to select the right benchmark

talk
j yarkoni (Shujin.ai)

Methods to test LLMs tailored to your unique use case

talk
j yarkoni (Shujin.ai)
LLM