talk-data.com
People (16 results)
See all 16 →Activities & events
| Title & Speakers | Event |
|---|---|
|
Mechanical systems -> Biological systems: How managing infrastructure changes with scale and so how should we approach it
2024-02-07 · 23:00
Sami Meharzi
– SRE, Big Table
@ Google
When starting, software systems are similar to mechanical systems where functionality and changes are fairly predictable. However, with more automation and dynamic interactions, software systems start behaving more like biological systems/ecosystems. This sometimes leads to relatively small things having crazy unintended consequences and large things not quite having as much impact as one would hope. This stems from the full ecosystem and how everything (eventually) has some impact on everything else. With this in mind, there are approaches to solving problems at scale that would not make sense otherwise and some approaches that are detrimental. |
|
|
Fake 'till you make it: Get the most out of incident simulations
2024-02-07 · 23:00
Ashley Sawatsky
– Senior Reliability & Incident Response Advocate
@ Rootly
Palms are sweaty, knees weak, arms are heavy...sound like your first on-call shift? One of the biggest challenges in incident response work, especially for newer SREs, is the lack of safe spaces to fail. Incident simulations can be an effective way to take the terror out of that first on-call shift, but they take careful planning. In this talk, I’ll explore different types of simulations (from tabletops to full-on realistic mock incidents), how and when to utilize them, and how to make sure you get the most out of them. |
|
|
We've Done Everything Right. But Bad Things Keep Happening and Now What?
2024-02-07 · 23:00
Mattie Toia
– Engineering Director, Production Platform Infrastructure
@ Shopify
While we all can find places to improve, this talk will discuss how we can respond when bad things happen despite our implementation of many if not all of the recommended reliability practices. We'll talk about reasons why this might be the case, and then we'll examine some possible approaches to addressing them. |
|