talk-data.com talk-data.com

Filter by Source

Select conferences and events

People (16 results)

See all 16 →
Showing 3 results

Activities & events

Title & Speakers Event
Event Google SRE NY Tech Talk 2024-02-07
Sami Meharzi – SRE, Big Table @ Google

When starting, software systems are similar to mechanical systems where functionality and changes are fairly predictable. However, with more automation and dynamic interactions, software systems start behaving more like biological systems/ecosystems. This sometimes leads to relatively small things having crazy unintended consequences and large things not quite having as much impact as one would hope. This stems from the full ecosystem and how everything (eventually) has some impact on everything else. With this in mind, there are approaches to solving problems at scale that would not make sense otherwise and some approaches that are detrimental.

Ashley Sawatsky – Senior Reliability & Incident Response Advocate @ Rootly

Palms are sweaty, knees weak, arms are heavy...sound like your first on-call shift? One of the biggest challenges in incident response work, especially for newer SREs, is the lack of safe spaces to fail. Incident simulations can be an effective way to take the terror out of that first on-call shift, but they take careful planning. In this talk, I’ll explore different types of simulations (from tabletops to full-on realistic mock incidents), how and when to utilize them, and how to make sure you get the most out of them.

Mattie Toia – Engineering Director, Production Platform Infrastructure @ Shopify

While we all can find places to improve, this talk will discuss how we can respond when bad things happen despite our implementation of many if not all of the recommended reliability practices. We'll talk about reasons why this might be the case, and then we'll examine some possible approaches to addressing them.

Showing 3 results