Airflow has an inherent SLA alert mechanism. When the scheduler sees such an SLA miss for some task, it sends an alert by email. The problem is, that this email is nice, but we can’t really know when each task is eventually successful. Moreover, even if there is such an email upon success following an SLA miss, it does not give us a good view of the current status at any given time. In order to solve this, we developed SLAyer, an application that gets information of SLA misses from Airflow’s database and reports the current status to Prometheus, provides metrics per dag, task, and execution date currently in violation of its SLA.
talk-data.com
Topic
Prometheus
monitoring
alerting
time_series_database
1
tagged
Activity Trend
2
peak/qtr
2020-Q1
2026-Q1
Top Events
Data Engineering Podcast
5
VictoriaMetrics | Criteo - Observability Meetup in Paris 🇫🇷 - Tech Event
2
SQL Superpowers and Smart Plant Monitoring with Grafana
1
O'Reilly Data Science Books
1
Kubernetes & Cloud Native Berlin Meetup May Edition
1
New Relic Breakfast Club: Amsterdam
1
Inaugural Grafana & Friends London Meetup!
1
Sensors to Screens: Building Real-Time IoT & Physical Dashboards w Raspberry Pi
1
AWS Women’s UG Berlin November Meetup - AWS Cloud Talks
1
Airflow Summit 2022
1
Observability Insights with Grafana, HelloFresh and Reddit
1
New Relic Breakfast Club: London
1
Filtering by:
Airflow Summit 2022
×