Many SRE teams still rely on manual intervention for incident handling; automation can improve response times and reduce toil. We will cover: Setting up comprehensive observability: Cloud Logging, Cloud Monitoring, and OpenTelemetry; Incident automation strategies: Runbooks, Auto-Healing, and ChatOps; Lessons from AWS CloudWatch and Azure Monitor applied to GCP; Case study: Reducing MTTR (Mean Time to Resolution) through automated detection and remediation
talk-data.com
Topic
CloudWatch
Amazon CloudWatch
monitoring
observability
aws
1
tagged
Activity Trend
4
peak/qtr
2020-Q1
2026-Q1
Filtering by:
Ronaldo Arrudas
×