Databricks

Next-Gen Data Science: How Posit and Databricks Are Transforming Analytics at Scale

2025-06-11 · Data + AI Summit 2025 Watch

lightning_talk

by James Blair (Posit, PBC)

Analytics API Data Science Python

Modern data science teams face the challenge of navigating complex landscapes of languages, tools and infrastructure. Positron, Posit’s next-generation IDE, offers a powerful environment tailored for data science, seamlessly integrating with Databricks to empower teams working in Python and R. Now integrated within Posit Workbench, Positron enables data scientists to efficiently develop, iterate and analyze data with Databricks — all while maintaining their preferred workflows. In this session, we’ll explore how Python and R users can develop, deploy and scale their data science workflows by combining Posit tools with Databricks. We’ll showcase how Positron simplifies development for both Python and R and how Posit Connect enables seamless deployment of applications, reports and APIs powered by Databricks. Join us to see how Posit + Databricks create a frictionless, scalable and collaborative data science experience — so your teams can focus on insights, not infrastructure.

No-Trust, All Value: Monetizing Analytics With Databricks Clean Rooms

2025-06-11 · Data + AI Summit 2025 Watch

lightning_talk

by Eddie Edgeworth (Koantek)

Analytics Delta

In a world where data collaboration is essential but trust is scarce, Databricks Clean Rooms delivers a game-changing model: no data shared, all value gained. Discover how data providers can unlock new revenue streams by launching subscription-based analytics and “Built-on-Databricks” services that run on customer data — without exposing raw data or violating compliance. Clean Rooms integrates Unity Catalog’s governance, Delta Sharing’s secure exchange and serverless compute to enable true multi-party collaboration — without moving data. See how privacy-preserving models like fraud detection, clinical analytics and ad measurement become scalable, productizable and monetizable across industries. Walk away with a proven pattern to productize analytics, preserve compliance and turn trustless collaboration into recurring revenue.

Scaling Blockchain ML With Databricks: From Graph Analytics to Graph Machine Learning

2025-06-11 · Data + AI Summit 2025 Watch

lightning_talk

by Indra Rustandi (Coinbase)

AI/ML Analytics Blockchain Delta NFT

Coinbase leverages Databricks to scale ML on blockchain data, turning vast transaction networks into actionable insights. This session explores how Databricks’ scalable infrastructure, powered by Delta Lake, enables real-time processing for ML applications like NFT floor price predictions. We’ll show how GraphFrames helps us analyze billion-node transaction graphs (e.g., Bitcoin) for clustering and fraud detection, uncovering structural patterns in blockchain data. But traditional graph analytics has limits. We’ll go further with Graph Neural Networks (GNNs) using Kumo AI, which learn from the transaction network itself rather than relying on hand-engineered features. By encoding relationships directly into the model, GNNs adapt to new fraud tactics, capturing subtle relationships that evolve over time. Join us to see how Coinbase is advancing blockchain ML with Databricks and deep learning on graphs.

Sponsored by: Acceldata | Agentic Data Management: Trusted Data for Enterprise AI on Databricks

A Japanese Mega-Bank’s Journey to a Modern, GenAI-Powered, Governed Data Platform

2025-06-11 · Data + AI Summit 2025 Watch

talk

by Anshul Wadhawan (Deloitte Consulting LLP) , Gordon Wilson (Sumitomo Mitsui Banking Corporation)

Azure Cloud Computing Delta GenAI

SMBC, a major Japanese multinational financial services institution, has embarked on an initiative to build a GenAI-powered, modern and well-governed cloud data platform on Azure/Databricks. This initiative aims to build an enterprise data foundation encompassing loans, deposits, securities, derivatives, and other data domains. Its primary goals are: To decommission legacy data platforms and reduce data sprawl by migrating 20+ core banking systems to a multi-tenant Azure Databricks architecture To leverage Databrick’s delta-share capabilities to address SMBC’s unique global footprint and data sharing needs To govern data by design using Unity Catalog To achieve global adoption of the frameworks, accelerators, architecture and tool stack to support similar implementations across EMEA Deloitte and SMBC leveraged the Brickbuilder asset “Data as a Service for Banking” to accelerate this highly strategic transformation.

American Airlines Flies to New Heights with Data Intelligence

2025-06-11 · Data + AI Summit 2025 Watch

talk

by Saimahesh Chava (American Airlines) , Yash Joshi (Accenture)

API Data Governance GitHub Hive

American Airlines migrated from Hive Metastore to Unity Catalog using automated processes with Databricks APIs and GitHub Actions. This automation streamlined the migration for many applications within AA, ensuring consistency, efficiency and minimal disruption while enhancing data governance and disaster recovery capabilities.

Building and Scaling Production AI Systems With Mosaic AI

2025-06-11 · Data + AI Summit 2025 Watch

talk

by Kasey Uhlenhuth (Databricks)

AI/ML

Ready to go beyond the basics of Mosaic AI? This session will walk you through how to architect and scale production-grade AI systems on the Databricks Data Intelligence Platform. We’ll cover practical techniques for building end-to-end AI pipelines — from processing structured and unstructured data to applying Mosaic AI tools and functions for model development, deployment and monitoring. You’ll learn how to integrate experiment tracking with MLflow, apply performance tuning and use built-in frameworks to manage the full AI lifecycle. By the end, you’ll be equipped to design, deploy and maintain AI systems that deliver measurable outcomes at enterprise scale.

Building Tool-Calling Agents With Databricks Agent Framework and MCP

2025-06-11 · Data + AI Summit 2025 Watch

talk

by Siddharth Murching (Databricks) , Elise Gonzales (Databricks)

AI/ML API Cyber Security

Want to create AI agents that can do more than just generate text? Join us to explore how combining Databricks' Mosaic AI Agent Framework with the Model Context Protocol (MCP) unlocks powerful tool-calling capabilities. We'll show you how MCP provides a standardized way for AI agents to interact with external tools, data and APIs, solving the headache of fragmented integration approaches. Learn to build agents that can retrieve both structured and unstructured data, execute custom code and tackle real enterprise challenges. Key takeaways: Implementing MCP-enabled tool-calling in your AI agents Prototyping in AI Playground and exporting for deployment Integrating Unity Catalog functions as agent tools Ensuring governance and security for enterprise deployments Whether you're building customer service bots or data analysis assistants, you'll leave with practical know-how to create powerful, governed AI agents.

ClickHouse and Databricks for Real-Time Analytics

2025-06-11 · Data + AI Summit 2025 Watch

talk

by Melvyn Peignon (ClickHouse)

Analytics ClickHouse Data Lakehouse Delta

ClickHouse is a C++ based, column-oriented database built for real-time analytics. While it has its own internal storage format, the rise of open lakehouse architectures has created a growing need for seamless interoperability. In response, we have developed integrations with your favorite lakehouse ecosystem to enhance compatibility, performance and governance. From integrating with Unity Catalog to embedding the Delta Kernel into ClickHouse, this session will explore the key design considerations behind these integrations, their benefits to the community, the lessons learned and future opportunities for improved compatibility and seamless integration.

Driving Secure AI Innovation with Obsidian Security, Databricks, and PointGuard AI

2025-06-11 · Data + AI Summit 2025 Watch

talk

by Alfredo Hickman (Obsidian Security) , JD Braun (Databricks) , Mali Gorantla (PointGuard AI)

AI/ML LLM Cyber Security

As enterprises adopt AI and Large Language Models (LLMs), securing and governing these models - and the data used to train them - is essential. In this session, learn how Databricks Partner PointGuard AI helps organizations implement the Databricks AI Security Framework to manage AI-specific risks, ensuring security, compliance, and governance across the entire AI lifecycle. Then, discover how Obsidian Security provides a robust approach to AI security, enabling organizations to confidently scale AI applications.

End-to-End Interoperable Data Platform: How Bosch Leverages Databricks Supply Chain Consolidation

2025-06-11 · Data + AI Summit 2025 Watch

talk

by Satish Karunakaran (Robert Bosch GmbH) , Marc-Alexander Frey (Robert Bosch GmbH)

Data Lakehouse dbt LLM

This session will showcase Bosch’s journey in consolidating supply chain information using the Databricks platform. It will dive into how Databricks not only acts as the central data lakehouse but also integrates seamlessly with transformative components such as dbt and Large Language Models (LLMs). The talk will highlight best practices, architectural considerations, and the value of an interoperable platform in driving actionable insights and operational excellence across complex supply chain processes. Key Topics and Sections Introduction & Business Context Brief Overview of Bosch’s Supply Chain Challenges and the Need for a Consolidated Data Platform. Strategic Importance of Data-Driven Decision-Making in a Global Supply Chain Environment. Databricks as the Core Data Platform Integrating dbt for Transformation Leveraging LLM Models for Enhanced Insights

Entity Resolution for the Best Outcomes on Your Data

2025-06-11 · Data + AI Summit 2025 Watch

talk

by Ninad Sohoni (Databricks) , Yinxi Zhang (Databricks)

There are many ways to implement entity resolution (ER) system — both using vendor software and open-source libraries that enable DIY Entity Resolution. However, generally we see common challenges with any approach — scalability, bound to a single model architecture, lack of metrics and explainability, and stagnant implementations that do not "learn" with experience. Recent experiments with transformer-based approaches, fast lookups with vector search and Databricks components such as Databricks Apps and Agent Eval provide the foundations for a composable ER system that can get better with time on your data. In this presentation, we include a demo of how to use these components to build a composable ER that has the best outcomes for your data.

Evolving Data Insights With Privacy at Mastercard

2025-06-11 · Data + AI Summit 2025 Watch

talk

by Spencer Cook (Databricks) , john Derrico (Mastercard)

AI/ML BI

Mastercard is a global technology company whose role is anchored in trust. It supports 3.4 billion cards and over 143 billion transactions annually. To address customers’ increasing data volume and complex privacy needs, Mastercard has developed a novel service atop Databricks’ Clean Rooms and broader Data Intelligence Platform. This service combines several Databricks components with Mastercard’s IP, providing an evolved method for data-driven insights and value-added services while ensuring a unique standalone turnkey service. The result is a secure environment where multiple parties can collaborate on sensitive data without directly accessing each other’s information. After this session, attendees will understand how Mastercard used its expertise in privacy-enhancing technologies to create collaboration tools powered by Databricks’ Clean Rooms, AI/BI, Apps, Unity Catalog, Workflows and DatabricksIQ — as well as how to take advantage of this new privacy-enhancing service directly.

Hands-On Learning: AI Agents Workshop: Create, Evaluate, and Deploy using Mosaic AI

2025-06-11 · Data + AI Summit 2025

workshop

by Amber Roberts (Databricks) , Nicolas Pelaez (Databricks)

AI/ML Python SQL

Looking for a practical workshop on building an AI Agent on Databricks? Well, we have just the thing for you.This hands-on workshop takes you through the process of creating intelligent agents that can reason their way to useful outcomes. You'll start by building your own toolkit of SQL and Python functions that give your agent practical capabilities. Then we'll explore how to select the right foundation model for your needs, connect your custom tools, and watch as your agent tackles complex challenges through visible reasoning paths.The workshop doesn't just stop at building—you'll dive into evaluation techniques using evaluation datasets to identify where your agent shines and where it needs improvement. After implementing and measuring your changes, we'll explore deployment strategies, including a feedback collection interface that enables continuous improvement and governance mechanisms to ensure responsible AI usage in production environments.

Hands-on Learning: Databricks SQL in Action: Intelligent Data Warehousing, Analytics and BI Workshop (repeat)

2025-06-11 · Data + AI Summit 2025

workshop

by Pearl Ubaru (Databricks)

AI/ML Analytics BI Cloud Computing Data Lake Data Lakehouse DWH SQL

Most organizations run complex cloud data architectures that silo applications, users and data. Join this interactive hands-on workshop to learn how Databricks SQL allows you to operate a multi-cloud lakehouse architecture that delivers data warehouse performance at data lake economics — with up to 12x better price/performance than traditional cloud data warehouses. Here’s what we’ll cover: How Databricks SQL fits in the Data Intelligence Platform, enabling you to operate a multicloud lakehouse architecture that delivers data warehouse performance at data lake economics How to manage and monitor compute resources, data access and users across your lakehouse infrastructure How to query directly on your data lake using your tools of choice or the built-in SQL editor and visualizations How to use AI to increase productivity when querying, completing code or building dashboards Ask your questions during this hands-on lab, and the Databricks experts will guide you.

Harnessing Databricks Asset Bundles: Transforming Pipeline Management at Scale at Stack Overflow

2025-06-11 · Data + AI Summit 2025 Watch

talk

by Chelsea Zhang (Stack Overflow)

Data Engineering Cyber Security

Discover how Stack Overflow optimized its data engineering workflows using Databricks Asset Bundles (DABs) for scalable and efficient pipeline deployments. This session explores the structured pipeline architecture, emphasizing code reusability, modular design and bundle variables to ensure clarity and data isolation across projects. Learn how the data team leverages enterprise infrastructure to streamline deployment across multiple environments. Key topics include DRY-principled modular design, essential DAB features for automation and data security strategies using Unity Catalog. Designed for data engineers and teams managing multi-project workflows, this talk offers actionable insights on optimizing pipelines with Databricks evolving toolset.

How Ad Tech Runs on Databricks

2025-06-11 · Data + AI Summit 2025 Watch

talk

by anj yamsani (Influential (Publicis)) , Kayvon Raphael (Magnite) , Abhishek Yadav (LG Ads) , Scott Simony (Databricks) , Cyrus Mohammadian (Influential) , Atul Saurabh (MiQ)

Marketing

Discover how leading ad techs and agencies — including Magnite, LG Ads, MiQ and Publicis Influential — leverage Databricks to power the advertising and marketing ecosystem.

talk-data.com

Activity Trend

Top Events

Top Speakers

Next-Gen Data Science: How Posit and Databricks Are Transforming Analytics at Scale

No-Trust, All Value: Monetizing Analytics With Databricks Clean Rooms

Scaling Blockchain ML With Databricks: From Graph Analytics to Graph Machine Learning

Sponsored by: Acceldata | Agentic Data Management: Trusted Data for Enterprise AI on Databricks

Sponsored by: dbt Labs | Leveling Up Data Engineering at Riot: How We Rolled Out dbt and Transformed the Developer Experience

Sponsored by: Google Cloud | Powering AI & Analytics: Innovations in Google Cloud Storage for Data Lakes

Sponsored by: Qubika | Agentic AI In Finance: How To Build Agents Using Databricks And LangGraph

A Japanese Mega-Bank’s Journey to a Modern, GenAI-Powered, Governed Data Platform

American Airlines Flies to New Heights with Data Intelligence

Building and Scaling Production AI Systems With Mosaic AI

Building Tool-Calling Agents With Databricks Agent Framework and MCP

ClickHouse and Databricks for Real-Time Analytics

Driving Secure AI Innovation with Obsidian Security, Databricks, and PointGuard AI

End-to-End Interoperable Data Platform: How Bosch Leverages Databricks Supply Chain Consolidation

Entity Resolution for the Best Outcomes on Your Data

Evolving Data Insights With Privacy at Mastercard

Hands-On Learning: AI Agents Workshop: Create, Evaluate, and Deploy using Mosaic AI

Hands-on Learning: Databricks SQL in Action: Intelligent Data Warehousing, Analytics and BI Workshop (repeat)

Harnessing Databricks Asset Bundles: Transforming Pipeline Management at Scale at Stack Overflow

How Ad Tech Runs on Databricks