The evolution of video understanding has followed a similar trajectory to language and image understanding - with the rise of large pre-trained foundation models trained on a huge amount of data. Given the surge of multimodal research lately, video foundation models are becoming even more powerful to decipher the rich visual information embedded in videos. This talk will explore diverse use cases of video understanding and provide a glimpse of Twelve Labs offerings.
talk-data.com
J
Speaker
James Le
1
talks
Head of Developer Experience
Twelve Labs
The Head of Developer Experience at Twelve Labs, a startup building multimodal foundation models for video understanding.
Bio from: Feb 2024 – AI, Machine Learning & Data Science Meetup
Filtering by:
Feb 2024 – AI, Machine Learning & Data Science Meetup
×
Filter by Event / Source
Talks & appearances
Showing 1 of 3 activities