talk-data.com talk-data.com

Event

Transforming ML Training Infrastructure at Grammarly

2024-04-09 – 2024-04-09 Meetup Visit website ↗

Activities tracked

1

Join us on April 9 to hear Grammarly’s technical lead, Pavlo Skliar, discuss how and why we upgraded and tuned the ML training platform to a scalable system to streamline ML development at Grammarly.

Registration: to attend the meetup, please register ➡️ here ⬅️

We will share insights into our journey of upgrading ML training infrastructure. During the talk, we’ll discuss:

  • Why we decided to move away from our custom architecture solution constrained by hardware shortages
  • Key requirements that guided our decisions, the architectural challenges we faced, and which MLOps best practices were implemented to achieve scalability

  • Our learnings as we transition from a single-region, single-instance setup on AWS to a scalable system with cluster-compute capabilities across regions and clouds

🔈 Pavlo Skliar, Technical Lead at Grammarly ✨ Who Will Be Interested: ML engineers, MLOps engineers, ML Infrastructure engineers, and anyone with knowledge of, or interest in, ML architecture and infrastructure

This session will present a general overview of the topic, which will be of interest to enthusiasts and specialists at all levels. For the more senior members of our audience, we will briefly examine the practical aspects and associated challenges.

Agenda: 18:30 Doors open: Time for mingling and networking with fellows; snacks and drinks will be served 19:00 Talk 20:00 More snacks, drinks, mingling, and networking 21:00 Meetup ends

✅ Where: In person, Grammarly Berlin hub

✅ When: Tuesday, April 9 ✅ Language: English ✅ Use this link to register: https://gram.ly/43r0Pr3

The event is free. Registration is mandatory. Due to a limited number of seats, the invites will be sent to a limited number of registered on a first registered first invited basis. Please check your inbox for a confirmation email about your attendance.

Sessions & talks

Showing 1–1 of 1 · Newest first

Search within this event →

How and why we upgraded and tuned the ML training platform to a scalable system to streamline ML development at Grammarly.

2024-04-09
talk
Pavlo Skliar (Grammarly)

Discussion on upgrading and tuning Grammarly's ML training platform to a scalable system. Topics include moving away from a custom architecture due to hardware shortages, key requirements and architectural challenges, MLOps best practices for scalability, and lessons learned from transitioning from a single-region AWS setup to a cross-region, multi-cloud cluster compute deployment.