The evolution of video understanding has followed a similar trajectory to language and image understanding - with the rise of large pre-trained foundation models trained on a huge amount of data. Given the surge of multimodal research lately, video foundation models are becoming even more powerful to decipher the rich visual information embedded in videos. This talk will explore diverse use cases of video understanding and provide a glimpse of Twelve Labs offerings.
talk-data.com
Company
Twelve Labs
Speakers
1
Activities
2
Speakers from Twelve Labs
Talks & appearances
2 activities from Twelve Labs speakers
James Le
(Head of Developer Experience)
James Le
(Head of Developer Experience)
The evolution of video understanding has followed a similar trajectory to language and image understanding - with the rise of large pre-trained foundation models trained on a huge amount of data. Given the surge of multimodal research lately, video foundation models are becoming even more powerful to decipher the rich visual information embedded in videos. This talk will explore diverse use cases of video understanding and provide a glimpse of Twelve Labs offerings.