talk-data.com talk-data.com

Filter by Source

Select conferences and events

People (29 results)

See all 29 →
Showing 3 results

Activities & events

Title & Speakers Event
Justin Reock – Head of Developer Relations @ cortex.io

Justin will explain through real-world use cases how teams can adopt the emerging practice of metric scorecards to reduce meetings and streamline release readiness assessments using data and automation. The list of criteria required to release a service to production, often referred to as a “production readiness standard,” is a mandatory component of reliable systems of software delivery. Aligning to these standards cross-functionally is challenging, especially when standards may need to be bypassed or changed, often at the last minute. And most importantly, systems always drift, and software that met these requirements six months ago may not still be meeting them today – so can they still be considered ready for production? Teams often resort to time-consuming practices which are brittle and difficult to change. Cortex has pioneered the scorecard as means of driving engineering initiatives using gamification. By ingesting data from the various systems that engineers would normally check manually process are streamlined and readiness checks transformed to an always-on, continuous verification of readiness.

metrics production readiness automation sre
Managing Complex Migrations 2024-09-17 · 22:00
Tom Elliott – Founder of a stealth startup in CI/CD space; formerly Director of Software Engineering at Yext @ Ocuroot

Migrations are often motivated by reliability, but can also harm reliability if not done with care. Tom will explore several migrations he led over the past few years, including moving 2000+ jobs to Nomad, subsequently containerizing 2600 and moving to an HA setup with RabbitMQ. We will discuss what went well and what went wrong in each instance, and how we applied what we learned to improve our migration competency.

nomad Docker rabbitmq ci/cd
Data SRE - an introduction 2024-09-17 · 22:00
Venkat Mahalingham – Data SRE @ Google Maps

Data safety is becoming increasingly important and this talk will introduce this to the audience, to open up beyond traditional losses around data integrity. When you think of SRE, RPC services and service operations immediately come to mind - Errors, latency, managing the size and number of tasks etc., For most products, there is another important story - that of data flows and data sets. A critical error in data (e.g. critical highway missing a segment in its route etc.,) could have widespread consequences to users. No amount of RPC service level reliability will protect against that risk. We need to think about safety against data loss.

data safety data flows sre data integrity
Showing 3 results