Unlock the power of generative AI with retrieval augmented generation (RAG) on Google Cloud. In this session, we’ll navigate key architectural decisions to deploy and run RAG apps: from model and app hosting to data ingestion and vector store choice. We’ll cover reference architecture options – from an easy-to-deploy approach with Vertex AI RAG Engine, to a fully managed solution on Vertex AI, to a flexible DIY topology with Google Kubernetes Engine and open source tools – and compare trade-offs between operational simplicity and granular control.
talk-data.com
K
Speaker
Kumar Dhanagopal
1
talks
Cross-Product Solutions Developer
Google Cloud
Filter by Event / Source
Talks & appearances
1 activities · Newest first