talk-data.com talk-data.com

Filter by Source

Select conferences and events

People (8 results)

See all 8 →
Showing 2 results

Activities & events

Title & Speakers Event
Arjun Bahuguna – Co-founder @ Audio Realities

Ever wondered how systems like Udio and Stable Audio turn a text-prompt into a full-fledged song? In this talk, we’ll pull back the curtain on the technology behind text-to-music generation, focusing on latent diffusion models. We’ll compare popular model architectures, break down key concepts with intuitive visuals, and explore the “why” behind their design choices. No deep learning background needed - just curiosity! We’ll end with a short interactive quiz to recap and test your understanding.

text-to-music generation latent diffusion models model architectures udio stable audio
Eugene Yakshin – C++ developer @ Hvoya Audio

I spent many hours listening closely to the decaying tails of shimmer, noticing patterns that lie outside the description of individual components. It turned out, new properties of sound aren't always found in the algorithms, but in the way of listening. Which approaches to investigation allow us to perceive these emergent params?

audio programming c++ generative music engines audio plugins music analysis libraries
Showing 2 results