Data + AI Summit 2025

Thursday Keynote (Virtual Replay)

2025-06-13

keynote

AI/ML Databricks

Be first to witness the latest breakthroughs from Databricks and share the success of innovative data and AI companies.

Summit Live: A Conversation With AI influencer Josue Bogran

2025-06-12 Watch

talk

Josue Bogran (JosueBogran.com & zeb.co)

AI/ML

Josue is well known for his practical perspectives on the data and AI landscape. We'll talk about what he is seeing in the market, his take on product feature updates, and some humor mixed in.

Summit Live: Women In Data and AI Conversation

2025-06-12 Watch

talk

Lisa Cohen (Anthropic) , Kate Ostbye (Pfizer) , Holly Smith (Databricks) , Pallavi Koppol (Databricks)

AI/ML Databricks

Each year at Summit, Women in Data and AI have a half day for in-person discussions on empowering Women in Data and AI Breakfast, and networking with like-minded professionals and trailblazers. For this virtual discussion, hear from Kate Ostbye (Pfizer), Lisa Cohen (Anthropic), Pallavi Koppol and Holly Smith (Databricks) about navigating challenges, celebrating successes, and inspire one another as we champion diversity and innovation in data together. And how to get involved year-round.

Route to Success: Scalable Routing Agents With Databricks and DSPy

2025-06-12 Watch

lightning_talk

Luis Moros (Databricks)

AI/ML Databricks GenAI

As companies increasingly adopt Generative AI, they're faced with a new challenge: managing multiple AI assistants. What if you could have a single, intuitive interface that automatically directs questions to the best assistant for the task? Join us to discover how to implement a flexible Routing Agent that streamlines working with multiple AI Assistants. We'll show you how to leverage Databricks and DSPy 3.0 to simplify adding this powerful pattern to your system. We'll dive into the essential aspects including: Using DSPy optimizers to maximize correct route selections Optimizing smaller models to reduce latency Creating stateful interactions Designing for growth and adaptability to support tens or hundreds of AI Assistants Ensuring authorized access to AI Assistants Tracking performance in production environments We'll share real-world examples that you can apply today. You'll leave with the knowledge to make your AI system run smoothly and efficiently.

Sponsored by: C2S Technologies Inc. | Qbeast: Lakehouse Acceleration as a Service

Achieve Your Mission With AI-Driven Decisions

2025-06-12 Watch

talk

Shannon Bisselink (Databricks) , Spencer Schaefer (Federal Gov (VA) / Lunar Analytics (Ai)) , Suresh Kaudi (World Bank) , Andrew Hahn (Databricks)

AI/ML Cyber Security

Government leaders overwhelmingly recognize the potential benefits of AI as critical to long-term strategic goals of efficiency, but implementation challenges and security concerns could be obstacles to success.

AI Evaluation from First Principles: You Can't Manage What You Can't Measure

2025-06-12 Watch

talk

Pallavi Koppol (Databricks) , Jonathan Frankle (Databricks)

AI/ML Databricks GenAI LLM

Is your AI evaluation process holding back your system's true potential? Many organizations struggle with improving GenAI quality because they don't know how to measure it effectively. This research session covers the principles of GenAI evaluation, offers a framework for measuring what truly matters, and demonstrates implementation using Databricks.Key Takeaways:-Practical approaches for establishing reliable metrics for subjective evaluations-Techniques for calibrating LLM judges to enable cost-effective, scalable assessment-Actionable frameworks for evaluation systems that evolve with your AI capabilitiesWhether you're developing models, implementing AI solutions, or leading technical teams, this session will equip you to define meaningful quality metrics for your specific use cases and build evaluation systems that expose what's working and what isn't, transforming AI guesswork into measurable success.

Automating Taxonomy Generation With Compound AI on Databricks

2025-06-12 Watch

talk

Allistair Cota (Lovelytics) , Sudhir Gajre (Lovelytics)

AI/ML API Databricks LLM

Taxonomy generation is a challenge across industries such as retail, manufacturing and e-commerce. Incomplete or inconsistent taxonomies can lead to fragmented data insights, missed monetization opportunities and stalled revenue growth. In this session, we will explore a modern approach to solving this problem by leveraging Databricks platform to build a scalable compound AI architecture for automated taxonomy generation. The first half of the session will walk you through the business significance and implications of taxonomy, followed by a technical deep dive in building an architecture for taxonomy implementation on the Databricks platform using a compound AI architecture. We will walk attendees through the anatomy of taxonomy generation, showcasing an innovative solution that combines multimodal and text-based LLMs, internal data sources and external API calls. This ensemble approach ensures more accurate, comprehensive and adaptable taxonomies that align with business needs.

Beyond Chatbots: Building Autonomous Insurance Applications With Agentic AI Framework

2025-06-12 Watch

talk

Amit Kumar Jha (Databricks) , Marcela Granados (Databricks)

AI/ML BI Data Governance Databricks

The insurance industry is at the crossroads of digital transformation, facing challenges from market competition and customer expectations. While conventional ML applications have historically provided capabilities in this domain, the emergence of Agentic AI frameworks presents a revolutionary opportunity to build truly autonomous insurance applications. We will address issues related to data governance and quality while discussing how to monitor/evaluate fine-tune models. We'll demonstrate the application of the agentic framework in the insurance context and how these autonomous agents can work collaboratively to handle complex insurance workflows — from submission intake and risk evaluation to expedited quote generation. This session demonstrates how to architect intelligent insurance solutions using Databricks Mosaic AI agentic core components including Unity Catalog, Playground, model evaluation/guardrails, privacy filters, AI functions and AI/BI Genie.

Breaking Up With Spark Versions: Client APIs, AI-Powered Automatic Updates, and Dependency Management for Databricks Serverless

2025-06-12 Watch

talk

Justin Breese (Databricks)

AI/ML API Databricks Spark

This session explains how we've made our Apache Spark™ versionless for end users by introducing a stable client API, environment versioning and automatic remediation. These capabilities have enabled auto-upgrade of hundreds of millions of workloads with minimal disruption for Serverless Notebooks and Jobs. We'll also introduce a new approach to dependency management using environments. Admins will learn how to speed up package installation with Default Base Environments, and users will see how to manage custom environments for their own workloads.

Daft and Unity Catalog: A Multimodal/AI-Native Lakehouse

2025-06-12 Watch

talk

Jay Chia (Eventual)

AI/ML Analytics Big Data Data Analytics Data Lakehouse

Modern data organizations have moved beyond big data analytics to also incorporate advanced AI/ML data workloads. These workflows often involve multimodal datasets containing documents, images, long-form text, embeddings, URLs and more. Unity Catalog is an ideal solution for organizing and governing this data at scale. When paired with the Daft open source data engine, you can build a truly multimodal, AI-ready data lakehouse. In this session, we’ll explore how Daft integrates with Unity Catalog’s core features (such as volumes and functions) to enable efficient, AI-driven data lakehouses. You will learn how to ingest and process multimodal data (images, text and videos), run AI/ML transformations and feature extractions at scale, and maintain full control and visibility over your data with Unity Catalog’s fine-grained governance.

Evaluation-Driven Development Workflows: Best Practices and Real-World Scenarios

2025-06-12 Watch

talk

Wenwen Xie (Databricks) , Arthur Dooner (Databricks)

AI/ML API LLM

In enterprise AI, Evaluation-Driven Development (EDD) ensures reliable, efficient systems by embedding continuous assessment and improvement into the AI development lifecycle. High-quality evaluation datasets are created using techniques like document analysis, synthetic data generation via Mosaic AI’s synthetic data generation API, SME validation, and relevance filtering, reducing manual effort and accelerating workflows. EDD focuses on metrics such as context relevance, groundedness, and response accuracy to identify and address issues like retrieval errors or model limitations. Custom LLM judges, tailored to domain-specific needs like PII detection or tone assessment, enhance evaluations. By leveraging tools like Mosaic AI Agent Framework and Agent Evaluation, MLflow, EDD automates data tracking, streamlines workflows, and quantifies improvements, transforming AI development for delivering scalable, high-performing systems that drive measurable organizational value.

Got Metrics? Build a Metric Store — A Tour of Developing Metrics Through UC Metric Views

2025-06-12 Watch

talk

Amit Pahwa (Databricks) , Cristian Figueroa (Databricks)

AI/ML BI Databricks

I have metrics, you have metrics — we all have metrics. But the real problem isn’t having metrics, it’s that the numbers never line up, leading to endless cycles of reconciliation and confusion. Join us as we share how our Data Team at Databricks tackled this fundamental challenge in Business Intelligence by building an internal Metric Store — creating a single source of truth for all business metrics using the newly-launched UC Metric Views. Imagine a world where numbers always align, metric definitions are consistently applied across the organization and every metric comes with built-in ML-based forecasting, AI-powered anomaly detection and automatic explainability. That’s the future we’ve built — and we’ll show you how you can get started today.

Latest Innovations in AI/BI Dashboards and Genie

2025-06-12 Watch

talk

Miranda Luna (Databricks) , Chao Cai (Databricks)

AI/ML Analytics BI Databricks

Discover how the latest innovations in Databricks AI/BI Dashboards and Genie are transforming self-service analytics. This session offers a high-level tour of new capabilities that empower business users to ask questions in natural language, generate insights faster and make smarter decisions. Whether you're a long-time Databricks user or just exploring what's possible with AI/BI, you'll walk away with a clear understanding of how these tools are evolving — and how to leverage them for greater business impact.

Low-Emission Oil & Gas: Engineering the Balance Between Clean and Reliable

2025-06-12 Watch

talk

Krishanu Roy (bp) , Jay Yoon (NOV) , Srinivas Chandolu (BP) , Ali Marzban (NOV)

AI/ML Analytics GenAI Cyber Security

Join two energy industry leaders as they showcase groundbreaking applications of AI and data solutions in modern oil and gas operations. NOV demonstrates how their Generative AI pipeline revolutionized drilling mud report processing, automating the analysis of 300 reports daily with near-perfect accuracy and real-time analytics capabilities. BP shares how Unity Catalog has transformed their enterprise-wide data strategy, breaking down silos while maintaining robust governance and security. Together, these case studies illustrate how AI and advanced analytics are enabling cleaner, more efficient energy operations while maintaining the reliability demanded by today's market.

Revolutionizing Insurance: How to Drive Growth and Innovation

2025-06-12 Watch

talk

Anindita Mahapatra (Databricks) , Porter Orr (The Standard Insurance Company) , Kranthi Nekkalapu (Suncorp) , Adrien de Nazelle (Oliver Wyman)

AI/ML Analytics Data Analytics Data Modelling

The insurance industry is rapidly evolving as advances in data and artificial intelligence (AI) drive innovation, enabling more personalized customer experiences, streamlined operations, and improved efficiencies. With powerful data analytics and AI-driven solutions, insurers can automate claims processing, enhance risk management, and make real-time decisions. Leveraging insights from large and complex datasets, organizations are delivering more customer-centric products and services than ever before. Key takeaways: Real-world applications of data and AI in claims automation, underwriting, and customer engagementHow predictive analytics and advanced data modeling help anticipate risks and meet customer needs. Personalization of policies, optimized pricing, and more efficient workflows for greater ROI. Discover how data and AI are fueling growth, improving protection, and shaping the future of the insurance industry!

Scaling Smarter: Technical Dive Into How Databricks Optimizes Model Serving

2025-06-12 Watch

talk

Asfandyar Qureshi (Databricks)

AI/ML Databricks

Learn from the experts on how Databricks’ Mosaic AI Model Serving delivers unparalleled speed and scalability for deploying AI models. This session delves into the architecture and innovations that showcase the impressive improvements in throughput for the AI-serving infrastructure that powers Mosaic AI.

Securely Deploying AI/BI to All Users in Your Enterprise

2025-06-12 Watch

talk

Austin Green (Databricks) , Keegan Dubbs (Databricks)

AI/ML BI Databricks Cyber Security

Bringing AI/BI to every business user starts with getting security, access and governance right. In this session, we’ll walk through the latest best practices for configuring Databricks accounts, setting up workspaces, and managing authentication protocols to enable secure and scalable onboarding. Whether you're supporting a small team or an entire enterprise, you'll gain practical insights to protect your data while ensuring seamless and governed access to AI/BI tools.

The New Competitive Edge: Building Resilient Supply Chains With Data + AI

2025-06-12 Watch

talk

Dee Fitzgerald (Danone) , Andy Hancock (SAP) , Usman Zubair (Databricks)

AI/ML GenAI

Consumer-facing industries are evolving faster than ever — and in today’s competitive landscape, it’s supply chains, not companies, that are truly competing. While data and AI offer huge potential for optimization, many organizations struggle to turn use cases into real business impact. In this session, leaders from retail, consumer goods, travel and hospitality will share how they’re building strong data foundations to unlock AI-driven supply chain optimization. Learn how they're using generative AI to boost productivity, streamline operations and improve insights through better data collaboration.

Tracking Data and AI Lineage: Ensuring Transparency and Compliance

2025-06-12 Watch

talk

Prithvi Kannan (Databricks) , Murt Neemuchwala (Databricks)

AI/ML Databricks

As AI becomes more deeply integrated into data platforms, understanding where data comes from — and where it goes — is essential for ensuring transparency, compliance and trust. In this session, we’ll explore the newest advancements in data and AI lineage across the Databricks Platform, including during model training, evaluation and inference. You’ll also learn how lineage system tables can be used for impact analysis and to gain usage insights across your data estate. We’ll cover newly released capabilities — such as Bring Your Own Lineage — that enable an end-to-end view of your data and AI assets in Unity Catalog. Plus, get a sneak peek at what’s coming next on the lineage roadmap!

Unlocking the Databricks Marketplace: A Hands-On Guide for Data Consumers and Providers

2025-06-12 Watch

talk

Tia Chang (Databricks)

AI/ML Databricks

Curious about how to get real value from the Databricks Marketplace—whether you're consuming data or sharing it? This demo-heavy session answers the top 10 questions we hear from both data consumers and providers, with real examples you can put into practice right away. We’ll show consumers how to find the right product listing whether that's tables, files, AI models, solution accelerators, or Partner Connect integrations, try them out using sample notebooks, and access them with ease. You’ll also see how the Private Marketplace helps teams work more efficiently with a curated catalog of approved data. For providers, learn how to list your product in a way that stands out, use notebooks and documentation to help users get started, reach new audiences, and securely share data across your company or with trusted partners using the Private Marketplace. If you’ve ever asked, “How do I get started?” or “How do I make my data available internally or externally?”—this session has the answers, with demos to match.

What’s New in Databricks SQL: Latest Features and Live Demos

2025-06-12 Watch

talk

Gaurav Saraf (Databricks) , Kent Marten (Databricks)

AI/ML BI Databricks SQL Data Streaming

Databricks SQL has added significant features in the last year at a fast pace. This session will share the most impactful features and the customer use cases that inspired them. We will highlight the new SQL editor, SQL coding features, streaming tables and materialized views, BI integrations, cost management features, system tables and observability features, and more. We will also share AI-powered performance optimizations.

Kill Bill-ing? Revenge is a Dish Best Served Optimized with GenAI

2025-06-12 Watch

lightning_talk

Abdul Furkhan (Sportsbet)

AI/ML Cloud Computing Data Engineering Databricks GenAI Spark

In an era where cloud costs can spiral out of control, Sportsbet achieved a remarkable 49% reduction in Total Cost of Ownership (TCO) through an innovative AI-powered solution called 'Kill Bill.' This presentation reveals how we transformed Databricks' consumption-based pricing model from a challenge into a strategic advantage through an intelligent automation and optimization. Understand how to use GenAI to reduce Databricks TCO Leverage generative AI within Databricks solutions enables automated analysis of cluster logs, resource consumption, configurations, and codebases to provide Spark optimization suggestions Create AI agentic workflows by integrating Databricks' AI tools and Databricks Data Engineering tools Review a case study demonstrating how Total Cost of Ownership was reduced in practice. Attendees will leave with a clear understanding of how to implement AI within Databricks solutions to address similar cost challenges in their environments.

talk-data.com

Top Topics

Top Speakers

Thursday Keynote (Virtual Replay)

Summit Live: A Conversation With AI influencer Josue Bogran

Summit Live: Women In Data and AI Conversation

Route to Success: Scalable Routing Agents With Databricks and DSPy

Sponsored by: C2S Technologies Inc. | Qbeast: Lakehouse Acceleration as a Service

Achieve Your Mission With AI-Driven Decisions

AI Evaluation from First Principles: You Can't Manage What You Can't Measure

Automating Taxonomy Generation With Compound AI on Databricks

Beyond Chatbots: Building Autonomous Insurance Applications With Agentic AI Framework

Breaking Up With Spark Versions: Client APIs, AI-Powered Automatic Updates, and Dependency Management for Databricks Serverless

Daft and Unity Catalog: A Multimodal/AI-Native Lakehouse

Evaluation-Driven Development Workflows: Best Practices and Real-World Scenarios

Got Metrics? Build a Metric Store — A Tour of Developing Metrics Through UC Metric Views

Latest Innovations in AI/BI Dashboards and Genie

Low-Emission Oil & Gas: Engineering the Balance Between Clean and Reliable

Revolutionizing Insurance: How to Drive Growth and Innovation

Scaling Smarter: Technical Dive Into How Databricks Optimizes Model Serving

Securely Deploying AI/BI to All Users in Your Enterprise

Sponsored by: Galileo Technologies Inc. | Taming Rogue AI Agents with Observability-Driven Evaluation

Sponsored by: Twilio | From Data to Impact: Scaling AI with Unified Customer Intelligence

The New Competitive Edge: Building Resilient Supply Chains With Data + AI

Tracking Data and AI Lineage: Ensuring Transparency and Compliance

Unlocking the Databricks Marketplace: A Hands-On Guide for Data Consumers and Providers

What’s New in Databricks SQL: Latest Features and Live Demos

Kill Bill-ing? Revenge is a Dish Best Served Optimized with GenAI