talk-data.com talk-data.com

K

Speaker

Katarina Slama

1

talks

PhD, Research Scientist UK AI Security Institute

Katarina Slama holds a PhD in Neuroscience from UC Berkeley, focusing on mechanisms of human attention allocation. She previously served as a Member of Technical Staff at OpenAI, contributing to the development of InstructGPT in the pre-ChatGPT era and later evaluating societally relevant model behaviors. Based in London, she is broadly interested in the societal impacts of transformative AI, especially AI’s effects on mental health.

Bio from: Virtual Keynote Talk "AI Safety: Near and Far"

Filtering by: Virtual Keynote Talk "AI Safety: Near and Far" ×

Filter by Event / Source

Talks & appearances

Showing 1 of 3 activities

Search activities →

AI safety discourse often splits into immediate harm vs catastrophic risk framings. In this keynote, I argue that the two research streams will benefit from increased cross-talk and a greater number of synergistic projects. A zero-sum framing on attention and resources between the two communities is incorrect and does not serve either side's goals. Recent theoretical work, including on accumulative existential risk, unifies risk pathways between the two fields. Building on this, I suggest concrete synergies that are already in place - as well as opportunities for future collaboration.

I will discuss how shared research and monitoring infrastructure, such as UK AISI Inspect, can benefit both areas; how methodological approaches from human behavioral science, currently used in immediate harms research, can be ported into AI behavioral science applied to existential risk research; and how technical solutions from catastrophic risk research can be applied to mitigate immediate societal harms. We have a shared goal of building a better, safer future for everyone. Let's work together!