talk-data.com talk-data.com

Event

Data + AI Summit 2025

2025-06-09 – 2025-06-13 Databricks Summit Visit website ↗

Activities tracked

98

Filtering by: GenAI ×

Sessions & talks

Showing 51–75 of 98 · Newest first

Search within this event →
Leveraging GenAI for Synthetic Data Generation to Improve Spark Testing and Performance in Big Data

Leveraging GenAI for Synthetic Data Generation to Improve Spark Testing and Performance in Big Data

2025-06-11 Watch
lightning_talk
Satej Kumar Sahu (Zalando SE)

Testing Spark jobs in local environments is often difficult due to the lack of suitable datasets, especially under tight timelines. This creates challenges when jobs work in development clusters but fail in production, or when they run locally but encounter issues in staging clusters due to inadequate documentation or checks. In this session, we’ll discuss how these challenges can be overcome by leveraging Generative AI to create custom synthetic datasets for local testing. By incorporating variations and sampling, a testing framework can be introduced to solve some of these challenges, allowing for the generation of realistic data to aid in performance and load testing. We’ll show how this approach helps identify performance bottlenecks early, optimize job performance and recognize scalability issues while keeping costs low. This methodology fosters better deployment practices and enhances the reliability of Spark jobs across environments.

Retail Genie: No-Code AI Apps for Empowering BI Users to be Self-Sufficient

Retail Genie: No-Code AI Apps for Empowering BI Users to be Self-Sufficient

2025-06-11 Watch
talk
Harish Rajagopalan (Databricks) , Siddhesh Pore (Databricks)

Explore how Databricks AI/BI Genie revolutionizes retail analytics, empowering business users to become self-reliant data explorers. This session highlights no-code AI apps that create a conversational interface for retail data analysis. Genie spaces harness NLP and generative AI to convert business questions into actionable insights, bypassing complex SQL queries. We'll showcase retail teams effortlessly analyzing sales trends, inventory and customer behavior through Genie's intuitive interface. Witness real-world examples of AI/BI Genie's adaptive learning, enhancing accuracy and relevance over time. Learn how this technology democratizes data access while maintaining governance via Unity Catalog integration. Discover Retail Genie's impact on decision-making, accelerating insights and cultivating a data-driven retail culture. Join us to see the future of accessible, intelligent retail analytics in action.

Revolutionizing Banking Data, Analytics and AI: Building an Enterprise Data Hub With Databricks

Revolutionizing Banking Data, Analytics and AI: Building an Enterprise Data Hub With Databricks

2025-06-11 Watch
talk
Shailender Sidhu (Deloitte) , Mohan Sankararaman (First Horizon Bank) , Jamie Cosgrove (Databricks)

Explore the transformative journey of a regional bank as it modernizes its enterprise data infrastructure amidst the challenges of legacy systems and past mergers and acquisitions. The bank is creating an Enterprise Data Hub using Deloitte's industry experience and the Databricks Data Intelligence Platform to drive growth, efficiency and Large Financial Institution readiness needs. This session will showcase how the new data hub will be a one-stop-shop for LOB and enterprise needs, while unlocking the advanced analytics and GenAI possibilities. Discover how this initiative is going to empower the ambitions of a regional bank to realize their “big bank muscle, small bank hustle.”

Sponsored by: Informatica | Power Analytics and AI on Databricks With Master (Golden) Record Data

Sponsored by: Informatica | Power Analytics and AI on Databricks With Master (Golden) Record Data

2025-06-11 Watch
lightning_talk
Ajay GOLLAPALLI (Informatica)

Supercharge advanced analytics and AI insights on Databricks with accurate and consistent master data. This session explores how Informatica’s Master Data Management (MDM) integrates with Databricks to provide high-quality, integrated golden record data like customer, supplier, product 360 or reference data to support downstream analytics, Generative AI and Agentic AI. Enterprises can accelerate and de-risk the process of creating a golden record via a no-code/low-code interface, allowing data teams to quickly integrate siloed data and create a complete and consistent record that improves decision-making speed and accuracy.

Learn How the Virtue Foundation Saves Lives by Optimizing Health Care Delivery Across the Globe

Learn How the Virtue Foundation Saves Lives by Optimizing Health Care Delivery Across the Globe

2025-06-11 Watch
lightning_talk
Joan LaRovere (Virtue Foundation)

The Virtue Foundation uses cutting-edge techniques in AI to optimize global health care delivery to save lives. With Unity Catalog as a foundation, they are using advanced Gen AI with model serving, vector search and MLflow to radically change how they map volunteer health resources with the right locations and facilities. Audio for this session is delivered in the conference mobile app, you must bring your own headphones to listen.

GenAI-Powered Shopping Assistant for Prada e-Commerce Search Bar

2025-06-11
talk
Simone Giordani (Data Reply IT) , Maria Paola Tatulli (Prada Group)

Prada has developed a complex solution, leveraging MosaicAI to propose an interactive and natural language product discovery capability that could improve its e-commerce search bar. The backbone is a 70B model and a Vector Store, which collaborates with additional filterings and AI solutions to suggest not only the perfect outfit for each occasion, but also provide alternative solutions and similar items.

Transforming Data Governance for Multimodal Data at Amgen With Databricks

Transforming Data Governance for Multimodal Data at Amgen With Databricks

2025-06-11 Watch
talk
Jaison Dominic (Amgen) , Jinesh Kunjumon (AMGEN)

Amgen is advancing its Enterprise Data Fabric to securely manage sensitive multimodal data, such as imaging and research data, across formats.Databricks is already the de facto standard for governance on structured data, and Amgen seeks to extend it for unstructured multi modal data too. This approach will also allow Amgen to standardize its GenAI projects on Databricks. Key priorities include: Centralized data access: establishing a unified, secure access control system Enhanced traceability: implementing detailed processes for transparency and accountability Consistent access standards: ensuring uniform data access privilege experience User support: providing flexible access for diverse stakeholders Comprehensive auditing: enabling thorough permission audits and data usage tracking Learn strategies for implementing a comprehensive multimodal data governance framework using Databricks, as we share our experience on standardizing data governance for GenAI use cases.

No Time for the Dad Bod: Automating Life with AI and Databricks

No Time for the Dad Bod: Automating Life with AI and Databricks

2025-06-10 Watch
lightning_talk
Sean Falconer (Confluent)

Life as a father, tech leader, and fitness enthusiast demands efficiency. To reclaim my time, I’ve built AI-driven solutions that automate everyday tasks—from research agents that prep for podcasts to multi-agent systems that plan meals—all powered by real-time data and automation. This session dives into the technical foundations of these solutions, focusing on event-driven agent design and scalable patterns for robust AI systems. You’ll discover how Databricks technologies like Delta Lake, for reliable and scalable data management, and DSPy, for streamlining the development of generative AI workflows, empower seamless decision-making and deliver actionable insights. Through detailed architecture diagrams and a live demo, I’ll showcase how to design systems that process data in motion to tackle complex, real-world problems. Whether you’re an engineer, architect, or data scientist, you’ll leave with practical strategies to integrate AI-driven automation into your workflows.

Accelerate End-to-End Multi-Agents on Databricks and DSPy

Accelerate End-to-End Multi-Agents on Databricks and DSPy

2025-06-10 Watch
talk
Austin Choi (Databricks)

A production-ready GenAI application is more than the framework itself. Like ML, you need a unified platform to create an end-to-end workflow for production quality applications.Below is an example of how this works on Databricks: Data ETL with Lakeflow Declarative Pipelines and jobs Data storage for governance and access with Unity Catalog Code development with Notebooks Agent versioning and metric tracking with MLflow and Unity Catalog Evaluation and optimizations with Mosaic AI Agent Framework and DSPy Hosting infrastructure with monitoring with Model Serving and AI Gateway Front-end apps using Databricks Apps In this session, learn how to build agents to access all your data and models through function calling. Then, learn how DSPy enables agent interaction with each other to ensure the question is answered correctly. We will demonstrate a chatbot, powered by multiple agents, to be able to answer questions and reason answers the base LLM does not know and very specialized topics.ow and very specialized topics.

AI Meets SQL: Leverage GenAI at Scale to Enrich Your Data

AI Meets SQL: Leverage GenAI at Scale to Enrich Your Data

2025-06-10 Watch
talk
Sid Taneja (Databricks) , Youngbin Kim (Databricks)

This session is repeated. Integrating AI into existing data workflows can be challenging, often requiring specialized knowledge and complex infrastructure. In this session, we'll share how SQL users can leverage AI/ML to access large language models (LLMs) and traditional machine learning directly from within SQL, simplifying the process of incorporating AI into data workflows. We will demonstrate how to use Databricks SQL for natural language processing, traditional machine learning, retrieval augmented generation and more. You'll learn about best practices and see examples of solving common use cases such as opinion mining, sentiment analysis, forecasting and other common AI/ML tasks.

Cross-Region AI Model Deployment for Resiliency and Compliance

Cross-Region AI Model Deployment for Resiliency and Compliance

2025-06-10 Watch
talk
Greg Wood (Databricks) , Tony Farias (Databricks)

AI for enterprises, particularly in the era of GenAI, requires rapid experimentation and the ability to productionize models and agents quickly and at scale. Compliance, resilience and commercial flexibility drive the need to serve models across regions. As cloud providers struggle with rising demand for GPUs in environments, VM shortages have become commonplace, and add to the pressure of general cloud outages. Enterprises that can quickly leverage GPU capacity in other cloud regions will be better equipped to capitalize on the promise of AI, while staying flexible to serve distinct user bases and complying with regulations. In this presentation we will show and discuss how to implement AI deployments across cloud regions, deploying a model across regions and using a load balancer to determine where to best route a user request.

Databricks on Databricks: Powering Marketing Insights with Lakehouse

Databricks on Databricks: Powering Marketing Insights with Lakehouse

2025-06-10 Watch
talk
Elizabeth Dobbs (Databricks) , Anoop Muraleedharan (Databricks)

This presentation outlines the evolution of our marketing data strategy, focusing on how we’ve built a strong foundation using the Databricks Lakehouse. We will explore key advancements across data ingestion, strategy, and insights, highlighting the transition from legacy systems to a more scalable and intelligent infrastructure. Through real-world applications, we will showcase how unified Customer 360 insights drive personalization, predictive analytics enhance campaign effectiveness, and GenAI optimizes content creation and marketing execution. Looking ahead, we will demonstrate the next phase of our CDP, the shift toward an end-user-first analytics model powered by AIBI, Genie and Matik, and the growing importance of clean rooms for secure data collaboration. This is just the beginning, and we are poised to unlock even greater capabilities in the future.

RecSys, Topic Modeling and Agents: Bridging the GenAI-Traditional ML Divide

RecSys, Topic Modeling and Agents: Bridging the GenAI-Traditional ML Divide

2025-06-10 Watch
talk
Dan Pechi (Databricks)

The rise of GenAI has led to a complete reinvention of how we conceptualize Data + AI. In this breakout, we will recontextualize the rise of GenAI in traditional ML paradigms, and hopefully unite the pre- and post-LLM eras. We will demonstrate when and where GenAI may prove more effective than traditional ML algorithms, and highlight problems for which the wheel is unnecessarily being reinvented with GenAI. This session will also highlight how MLflow provides a unified means of benchmarking traditional ML against GenAI, and lay out a vision for bridging the divide between Traditional ML and GenAI practitioners.

Revolutionizing Nuclear AI With HiVE and Bertha on Databricks Architecture

Revolutionizing Nuclear AI With HiVE and Bertha on Databricks Architecture

2025-06-10 Watch
talk
Lou Martinez Sancho (Westinghouse Electric Company)

In this session we will explore the revolutionary advancements in nuclear AI capabilities with HiVE and Bertha on Databricks architecture. HiVE, developed by Westinghouse, leverages over a century of proprietary data to deliver unparalleled AI capabilities. At its core is Bertha, a generative AI model designed to tackle the unique challenges of the nuclear industry. This session will delve into the technical architecture of HiVE and Bertha, showcasing how Databricks' scalable environment enhances their performance. We will discuss the secure data infrastructure supporting HiVE, ensuring data integrity and compliance. Real-world applications and use cases will demonstrate the impact of HiVE and Bertha on improving efficiency, innovation and safety in nuclear operations. Discover how the fusion of HiVE and Bertha with Databricks architecture is transforming the nuclear AI landscape and driving the future of nuclear technology.

Shifting Left — Setting up Your GenAI Ecosystem to Work for Business Analysts

Shifting Left — Setting up Your GenAI Ecosystem to Work for Business Analysts

2025-06-10 Watch
talk
James Lin (Experian)

At Data and AI in 2022, Databricks pioneered the term to shift left in how AI workloads would enable less data science driven people to create their own apps. In 2025, we take a look at how Experian is doing on that journey. This session highlights Databricks services that assist with the shift left paradigm for Generative AI, including how AI/BI Genie helps with Generative analytics, and how Agent Studio helps with synthetic generation of test cases to validate model performance.

The Next Wave of AI Applications Driven by Agentic Workflow at Adidas Using Databricks

2025-06-10
talk
Joana Ferreira (Adidas AG) , Mahavir Teraiya (Databricks)

Curious to know how Adidas is transforming customer experience and business impact with agentic workflows, powered by Databricks? By leveraging cutting-edge tools like MosaicML’s deployment capabilities, Mosaic AI Gateway, and MLflow, Adidas built a scalable GenAI agentic infrastructure that delivers actionable insights from growing 2 million product reviews annually. With remarkable results: 60% latency reduction (15.5 seconds to 6 seconds) 91.67% cost savings (transitioning to more efficient LLMs) 98.5% token efficiency, reducing input tokens from 200k to just 3k 20% increase in productivity (faster time to insight) Empowering over 500 decision-makers across 150+ countries, this infrastructure is set to optimize products and services for Adidas’ 500 million members by 2025 while supporting dozens of upcoming AI-driven solutions. Join us to explore how Adidas turned agentic workflows infra into a strategic advantage using Databricks and learn how you can do the same!

Manufacturing and Transportation Industry Forum | Sponsored by: Deloitte and AWS

Manufacturing and Transportation Industry Forum | Sponsored by: Deloitte and AWS

2025-06-10 Watch
talk
Victor Dsouza (Applied Materials) , Richard Masters (Virgin Atlantic Airways) , Andy Isenman (Heathrow) , Dr. Andrej Levin (Boston Consulting Group) , Shiv Trisal (Databricks) , Caitlin Gordon (Databricks)

Join us for an inspiring forum showcasing how manufacturers and transportation leaders are turning today's challenges into tomorrow's opportunities. From automotive giants revolutionizing product development with generative AI to logistics providers optimizing routes for both cost and sustainability, discover how industry pioneers are reshaping the future of industrial operations. Highlighting this session is an exciting collaboration between Heathrow Airport and Virgin Atlantic, demonstrating how partnership and innovation are transforming the air travel experience. Learn how these leaders and other companies are using Databricks to tackle their most pressing challenges — from smart factory transformations to autonomous systems development — proving that the path to profitability and sustainability runs through intelligent operations.

Public Sector Industry Forum | Sponsored by: Deloitte and AWS

Public Sector Industry Forum | Sponsored by: Deloitte and AWS

2025-06-10 Watch
talk
Molly Just-Behr (Databricks) , Sanjeev Sharma (TriWest Healthcare Alliance) , Teneika Askew (Navy) , Mike Daniels (Databricks) , Todd Schroeder (Databricks) , Sujit Mohanty (Databricks)

Join the 60-minute kickoff session at the Public Sector Forum for an opportunity to to accelerate innovation into your enterprise through governance, compliance and GenAI. Featuring keynotes from data-driven agency leaders and providing a future-looking journey from Databricks, this event offers invaluable insights. Understand the outcomes of Data and AI powering transformation across common areas of government and beyond: Improving constituent experience Reducing cost and enhancing services Identifying fraud, waste and abuse Achieving scale and security You will not want to miss this exclusive opportunity to own your data and eliminate government silos. Discover the Data + AI Company with deep compliance experience and widespread adoption.

How Anthropic Transforms Financial Services Teams With GenAI

How Anthropic Transforms Financial Services Teams With GenAI

2025-06-10 Watch
lightning_talk
Reed Foster (Anthropic)

Learn how GenAI is being applied to financial services teams using Claude, an acknowledged leader in large language models. Integrated with the scale and security of the Databricks Data Intelligence Platform, we will share how Claude is enabling financial services organizations to streamline operations, maximize productivity for investment and compliance teams and in some cases turn traditional cost-centers into revenue drivers.

GenAI Observability in Customer Care

GenAI Observability in Customer Care

2025-06-10 Watch
talk
Matteo Ciccozzi (EarnIn) , Willem Dhaeseleer (EarnIn)

Customer support is going through the GenAI revolution, but how can we use AI to foster deeper empathy with our end users?To enable this, Earnin has built its GenAI observability platform on Databricks, leveraging Lakeflow Declarative Pipeliness, Kafka and Databricks AI/BI.This session covers how we use Lakeflow Declarative Pipelines to monitor our customer care chatbot in near real-time and how we leverage Databricks to better anticipate our customers' needs.

Managing Data and AI Security Risks With DASF 2.0 — and a Customer Story

Managing Data and AI Security Risks With DASF 2.0 — and a Customer Story

2025-06-10 Watch
talk
Arun Pamulapati (Databricks) , Joseph Raetano (US AI)

The Databricks Security team led a broad working group that significantly evolved the Databricks AI Security Framework (DASF) to its 2.0 version since its first release by closely collaborating with the top cyber security researchers at industry organizations such as OWASP, Gartner, NIST, HITRUST, FAIR Institute and several Fortune 100 companies to address the evolving risks and associated controls of AI systems in enterprises. Join us to to learn how The CLEVER GenAI pipeline, an AI-driven innovation in healthcare, processes over 1.5 million clinical notes daily to classify social determinants impacting veteran care while adhering to robust security measures like NIST 800-53 controls and by leveraging Databricks AI Security Framework. We will discuss robust AI security guidelines to help data and AI teams understand how to deploy their AI applications securely. This session will give a security framework for security teams, AI practitioners, data engineers and governance teams.

Sponsored by: Accenture & Avanade | Enterprise Scaling and Value of Generative AI and Agentic AI

Sponsored by: Accenture & Avanade | Enterprise Scaling and Value of Generative AI and Agentic AI

2025-06-10 Watch
talk
Venkatesh Rao (Accenture (HQ))

In this talk, we will explore the transformative potential of Generative AI and Agentic AI in driving enterprise-scale innovation and delivering substantial business value. As organizations increasingly recognize the power of AI to move beyond automation towards true augmentation and intelligent decision-making, understanding the nuances of scaling these advanced AI paradigms becomes critical. We will delve into practical strategies for deploying, managing, and optimizing Agentic AI frameworks showcasing how autonomous, goal-directed AI systems can unlock new efficiencies, enhance customer experiences, and foster continuous innovation. Through real-world case studies and actionable insights, attendees will gain a comprehensive understanding of the key considerations to architect, implement, and measure the ROI of large-scale Generative and Agentic AI initiatives, positioning their enterprises for sustained growth and competitive advantage in the AI-first era.

Sponsored by: AWS | Deploying a GenAI Agent using Databricks Mosaic AI, Anthropic, LangGraph, and Amazon Bedrock

Sponsored by: AWS | Deploying a GenAI Agent using Databricks Mosaic AI, Anthropic, LangGraph, and Amazon Bedrock

2025-06-10 Watch
lightning_talk

In this session, you’ll see how to build and deploy a GenAI agent and Model Context Protocol (MCP) with Databricks, Anthropic, Mosaic External AI Gateway, and Amazon Bedrock. You will learn the architecture, best-practices of using Databricks Mosaic AI, Anthropic Sonnet 3.7 first-party frontier model, and LangGraph for custom workflow orchestration in Databricks Data Intelligence Platform. You’ll also see how to use Databricks Mosaic AI to provide agent evaluation and monitoring. In addition, you will also see how inline agent will use MCP to provide tools and other resources using Amazon Nova models with Amazon Bedrock inline agent for deep research. This approach gives you the flexibility of LangGraph, the powerful managed agents offered by Amazon Bedrock, and Databricks Mosaic AI’s operational support for evaluation and monitoring.

Gen AI Deployment and Monitoring

2025-06-10
talk

This course introduces learners to deploying, operationalizing, and monitoring generative artificial intelligence (AI) applications. First, learners will develop knowledge and skills in deploying generative AI applications using tools like Model Serving. Next, the course will discuss operationalizing generative AI applications following modern LLMOps best practices and recommended architectures. Finally, learners will be introduced to the idea of monitoring generative AI applications and their components using Lakehouse Monitoring. Pre-requisites: Familiarity with prompt engineering and retrieval-augmented generation (RAG) techniques, including data preparation, embeddings, vectors, and vector databases. A foundational knowledge of Databricks Data Intelligence Platform tools for evaluation and governance (particularly Unity Catalog). Labs: Yes Certification Path: Databricks Certified Generative AI Engineer Associate

ReguBIM AI – Transforming BIM, Engineering, and Code Compliance with Generative AI

ReguBIM AI – Transforming BIM, Engineering, and Code Compliance with Generative AI

2025-06-10 Watch
lightning_talk
Qi Qi Oh (Exyte Singapore Pte. Ltd.)

At Exyte, we design, engineer, and deliver ultra-clean and sustainable facilities for high-tech industries. One of the most complex tasks our engineers and designers face is ensuring that their building designs comply with constantly evolving codes and regulations – often a manual, error-prone process. To address this, we developed ReguBIM AI, a generative AI-powered assistant that helps our teams verify code compliance more efficiently and accurately by linking 3D Building Information Modeling (BIM) data with regulatory documents. Built on the Databricks Data Intelligence Platform, ReguBIM AI is part of our broader vision to apply AI meaningfully across engineering and design processes. We are proud to share that ReguBIM AI won the Grand Prize and EMEA Winner titles at the Databricks GenAI World Cup 2024 — a global hackathon that challenged over 1,500 data scientists and AI engineers from 18 countries to create innovative generative AI solutions for real-world problems.