Topic

geometric deep learning

Activities

2

tagged

Activity Trend

1 peak/qtr

2020-Q1 2026-Q1

Top Events

#23 AI Series: DeepMind - F. Barbero 1 Nov 24 - Best of ICCV (Day 4) 1

Top Speakers

Federico Barbero (DeepMind / University of Oxford) 1 Lennart Bastian (TU Munich, CAMP Lab) 1

Activities

2 activities · Newest first

All Video Podcast Book

Forecasting Continuous Non-Conservative Dynamical Systems in SO(3)

2025-11-24 · Nov 24 - Best of ICCV (Day 4)

talk

by Lennart Bastian (TU Munich, CAMP Lab)

neural controlled differential equations savitzkyu2013golay paths so(3) dynamics

Tracking and forecasting the rotation of objects is fundamental in computer vision and robotics, yet SO(3) extrapolation remains challenging as (1) sensor observations can be noisy and sparse, (2) motion patterns can be governed by complex dynamics, and (3) application settings can demand long-term forecasting. This work proposes modeling continuous-time rotational object dynamics on SO(3) using Neural Controlled Differential Equations guided by Savitzky-Golay paths. Unlike existing methods that rely on simplified motion assumptions, our method learns a general latent dynamical system of the underlying object trajectory while respecting the geometric structure of rotations. Experimental results on real-world data demonstrate compelling forecasting capabilities compared to existing approaches.

Why do LLMs struggle with Long Context?

2025-10-21 · #23 AI Series: DeepMind - F. Barbero

talk

by Federico Barbero (DeepMind / University of Oxford)

llms machine learning transformer

Abstract: There is great interest in scaling the number of tokens that LLMs can efficiently and effectively ingest, a problem that is notoriously difficult. Training LLMs on a smaller context and hoping that they generalize well to much longer contexts has largely proven to be ineffective. In this talk, I will go over our work that aims to understand the failure points in modern LLM architectures. In particular, I will discuss dispersion in the softmax layers, generalization issues related to positional encodings, and smoothing effects that occur in the representations. Understanding these issues has proven to be fruitful, with related ideas now already being part of frontier models such as LLaMa 4. The talk is intended to be broadly accessible, but a basic understanding of the Transformer architectures used in modern LLMs will be helpful.