talk-data.com
PyData
talk
2025-11-08 at 23:20
Securing Retrieval-Augmented Generation: How to Defend Vector Databases Against 2025 Threats
Event:
PyData Seattle 2025
Speakers
Topics
Description
Modern LLM applications rely heavily on embeddings and vector databases for retrieval-augmented generation (RAG). But in 2025, researchers and OWASP flagged vector databases as a new attack surface — from embedding inversion (recovering sensitive training text) to poisoned vectors that hijack prompts. This talk demystifies these threats for practitioners and shows how to secure your RAG pipeline with real-world techniques like encrypted stores, anomaly detection, and retrieval validation. Attendees will leave with a practical security checklist for keeping embeddings safe while still unlocking the power of retrieval.