talk-data.com talk-data.com

Meetup talk 2025-02-20 at 18:00

Exploring DeepSeek’s Janus-Pro Visual Question Answer (VQA) Capabilities

Description

DeepSeek’s Janus-Pro is an advanced multimodal model designed for both multimodal understanding and visual generation, with a particular emphasis on improvements in understanding tasks. In this talk, we’ll explore Janus-Pro’s Visual Question Answer (VQA) capabilities using FiftyOne’s Janus-Pro VQA Plugin. The plugin provides a seamless interface to Janus Pro’s visual question understanding capabilities within FiftyOne, offering: Vision-language tasks; Hardware acceleration (CUDA/MPS) when available; Dynamic version selection from HuggingFace; Full integration with FiftyOne’s Dataset and UI. Can’t wait to see it for yourself? Check out the FiftyOne Quickstart with Janus-Pro.