Discussion on upgrading and tuning Grammarly's ML training platform to a scalable system. Topics include moving away from a custom architecture due to hardware shortages, key requirements and architectural challenges, MLOps best practices for scalability, and lessons learned from transitioning from a single-region AWS setup to a cross-region, multi-cloud cluster compute deployment.
talk-data.com
P
Speaker
Pavlo Skliar
1
talks
Technical Lead
Grammarly
Grammarly’s technical lead.
Bio from: Transforming ML Training Infrastructure at Grammarly
Filter by Event / Source
Talks & appearances
1 activities · Newest first