Many SRE teams still rely on manual intervention for incident handling; automation can improve response times and reduce toil. We will cover: Setting up comprehensive observability: Cloud Logging, Cloud Monitoring, and OpenTelemetry; Incident automation strategies: Runbooks, Auto-Healing, and ChatOps; Lessons from AWS CloudWatch and Azure Monitor applied to GCP; Case study: Reducing MTTR (Mean Time to Resolution) through automated detection and remediation
talk-data.com
Speaker
Ronaldo Arrudas
1
talks
Ronaldo is the Digital Development Studio Leader at Nearsure. He is a gifted individual (AH/SD) and a Mensa member with exceptional analytical and strategic abilities, which he leverages as a leader to precisely execute complex, multidisciplinary projects. His practical, results-driven approach is evident in successful initiatives like the Fractal Initiative and Solution Insights program, demonstrating his impact on organizational transformation. An INTP, he relentlessly pursues innovative and efficient solutions, proactively addressing challenges—as shown by his internationally recognized work on the Coca-Cola GO! Platform. He champions transparency, direct communication, and neurodiversity, fostering inclusive teams through programs like 'Talk to Ronaldo,' and applies a pragmatic approach to negotiations and strategic decisions, consistently focusing on innovation and sustainable value creation.
Bio from: Google NY Site Reliability Engineering (SRE) Tech Talks, 24 Jun 2025
Filter by Event / Source
Talks & appearances
1 activities · Newest first