Topic

mm-rag

Activities

1

tagged

Activity Trend

1 peak/qtr

2020-Q1 2026-Q2

Top Events

AI Meetup (February): AI, GenAI, LLMs and ML 1

Activities

1 activities · Newest first

All Video Podcast Book

Tech Talk: Searching and Reasoning Over Multimedia Data with Vector Databases and LMMs

2024-02-22 · AI Meetup (February): AI, GenAI, LLMs and ML

talk

cross-modal search multimodal embedding models multimodal large language models real-time demos vector databases

In this talk, Zain Hasan will discuss how we can use open-source multimodal embedding models in conjunction with large generative multimodal models that can that can see, hear, read, and feel data(!), to perform cross-modal search (searching audio with images, videos with text etc.) and multimodal retrieval augmented generation (MM-RAG) at the billion-object scale with the help of open source vector databases. I will also demonstrate, with live code demos, how being able to perform this cross-modal retrieval in real-time can enables users to use LLMs that can reason over their enterprise multimodal data. This talk will revolve around how we can scale the usage of multimodal embedding and generative models in production.