Databricks

Overview of Agent Bricks

2025-07-21 · July 2025 - AI Summer Special! ☀️

talk

by Gavi Regunath (Advancing Analytics)

agent bricks ai

Overview of Agent Bricks with Gavi Regunath, Databricks MVP and Chief AI Officer at Advancing Analytics

Cost-effective AI on Databricks

2025-07-21 · July 2025 - AI Summer Special! ☀️

talk

by Maria Zervou (Databricks)

ai query generative ai llms sql-like syntax

Curious how to apply resource-intensive generative AI models across massive datasets without breaking the bank? Join this session to discover efficient batch inference strategies for foundation models on Databricks. Learn how to build scalable, cost-effective pipelines that power LLMs and other generative AI systems—optimized for performance, quality, and throughput. We’ll also dive into ai_query, a powerful new capability that lets you run generative AI directly on your data using SQL-like syntax. See how it simplifies development, unlocks new use cases, and accelerates insights with live demos and real-world examples.

Leveling Up: Analytics in the Gaming Industry (w/ Carly Taylor)

2025-07-18 · Mavens of Data Listen

podcast_episode

by Carly Taylor (ggAI)

Analytics Data Analytics KPI

In this episode, we'll chat with Carly Taylor, Field CTO of Gaming at Databricks, to explore the fascinating world of data analytics in the gaming industry, where every click, quest, and respawn generates insights that shape the games we love. Carly shares her experience working in gaming to help harness data for better gameplay and smarter monetization. She'll break down what analysts, data scientists, and sales engineers actually do in gaming and how teams turn raw data into real-time decisions. Whether you're a player, a data nerd, or someone who wants to turn both into a career, this episode is your walkthrough guide to data in gaming. What You'll Learn: How gaming companies use data to optimize player experience and business outcomes What it's like to work in a field engineering or customer-facing analyst role The tools, KPIs, and best practices for success How to break into a data role in gaming and what skills to focus on Stay updated with Carly's latest by subscribing to her Substack Register for free to be part of the next live session: https://bit.ly/3XB3A8b Follow us on Socials: LinkedIn YouTube Instagram (Mavens of Data) Instagram (Maven Analytics) TikTok Facebook Medium X/Twitter

Amber Roberts — Staff Technical Marketing Manager at Databricks

2025-07-16 · Agentic AI Summit | Virtual

talk

by Amber Roberts (Databricks)

Marketing

Session led by Amber Roberts, Staff Technical Marketing Manager at Databricks

Data Engineering Central Podcast - Episode 8

2025-07-10 · Data Engineering Central Podcast Listen

podcast_episode

Data Engineering DuckDB Iceberg SQL

This is a free preview of a paid episode. To hear more, visit dataengineeringcentral.substack.com

Hello! A new episode of the Data Engineering Central Podcast is dropping today, we will be covering a few hot topics! * Apache Iceberg Catalogs * new Boring Catalog * new full Iceberg support from Databricks/Unity Catalog * Databricks SQL Scripting * DuckDB coming to a Lake House near you * Lakebase from Databricks Going to be a great show, come along for the ride! Thanks …

A Decade in Data Engineering - Lessons Realities and Where We Go From Here

2025-07-01 · Airflow Summit 2025

session

by Ben Rogojan (Facebook)

Airflow BigQuery Data Engineering Hadoop Snowflake

There was a post on the data engineering subreddit recently that discussed how difficult it is to keep up with the data engineering world. Did you learn Hadoop, great we are on Snowflake, BigQuery and Databricks now. Just learned Airflow, well now we have Airflow 3.0. And the list goes on. But what doesn’t change, and what have been the lessons over the past decade. That’s what I’ll be covering in this talk. Real lessons and realities that come up time and time again whether you’re working for a start-up or a large enterprise.

Orchestrating Databricks with Airflow: Unlocking the Power of MVs, Streaming Tables, and AI

2025-07-01 · Airflow Summit 2025

session

by Tahir Fayyaz (/ Google Cloud Platform Team specialising in Data & Machine Learning, BigQuery expert) , Shanelle Roman

AI/ML Airflow Analytics dbt SQL Data Streaming

As data workloads grow in complexity, teams need seamless orchestration to manage pipelines across batch, streaming, and AI/ML workflows. Apache Airflow provides a flexible and open-source way to orchestrate Databricks’ entire platform, from SQL analytics with Materialized Views (MVs) and Streaming Tables (STs) to AI/ML model training and deployment. In this session, we’ll showcase how Airflow can automate and optimize Databricks workflows, reducing costs and improving performance for large-scale data processing. We’ll highlight how MVs and STs eliminate manual incremental logic, enable real-time ingestion, and enhance query performance—all while maintaining governance and flexibility. Additionally, we’ll demonstrate how Airflow simplifies ML model lifecycle management by integrating Databricks’ AI/ML capabilities into end-to-end data pipelines. Whether you’re a dbt user seeking better performance, a data engineer managing streaming pipelines, or an ML practitioner scaling AI workloads, this session will provide actionable insights on using Airflow and Databricks together to build efficient, cost-effective, and future-proof data platforms.

Orchestrating Databricks with Airflow: Unlocking the Power of MVs, Streaming Tables, and AI

2025-07-01 · Airflow Summit 2025

session

by Tahir Fayyaz (/ Google Cloud Platform Team specialising in Data & Machine Learning, BigQuery expert) , Shanelle Roman

AI/ML Airflow Analytics dbt SQL Data Streaming

As data workloads grow in complexity, teams need seamless orchestration to manage pipelines across batch, streaming, and AI/ML workflows. Apache Airflow provides a flexible and open-source way to orchestrate Databricks’ entire platform, from SQL analytics with Materialized Views (MVs) and Streaming Tables (STs) to AI/ML model training and deployment. In this session, we’ll showcase how Airflow can automate and optimize Databricks workflows, reducing costs and improving performance for large-scale data processing. We’ll highlight how MVs and STs eliminate manual incremental logic, enable real-time ingestion, and enhance query performance—all while maintaining governance and flexibility. Additionally, we’ll demonstrate how Airflow simplifies ML model lifecycle management by integrating Databricks’ AI/ML capabilities into end-to-end data pipelines. Whether you’re a dbt user seeking better performance, a data engineer managing streaming pipelines, or an ML practitioner scaling AI workloads, this session will provide actionable insights on using Airflow and Databricks together to build efficient, cost-effective, and future-proof data platforms.

Niels Claeys: Use dbt and DuckDB Instead of Databricks for Data Processing

2025-06-27 · DATA MINER Big Data Europe Conference 2020 Watch

video

by Niels Claeys

dbt DuckDB

One dbt repo, two targets: parallel dbt deployments on Azure SQL & Databricks

2025-06-24 · dbt Global Circuit Series: Belgium dbt Meetup #11 (in-person)

talk

by Emiel Ackermann (Port of Antwerp-Bruges) , Nicolas Jonckheere (Datashift)

dbt azure sql

We’ll start with a walkthrough of the technical setup of dbt at the Port of Antwerp-Bruges, in the context of a migration to Databricks. Then we'll dive into how we handle deploying dbt to multiple targets for the duration of the migration. Finally we'll compare both environments with insights from an analytics engineering perspective.

Roundtable: Unpacking the SAP & Databricks Partnership - A New Era of Data-Driven Decision Making, moderated by SAP

2025-06-18 · gartner-data-analytics-apac-2025

roundtable

by Jon Gooding (SAP) , Živilė Norkūnaitė (Databricks) , Kara Reed (SAP)

AI/ML Data Management SAP

Join us for an exclusive roundtable discussion featuring industry leaders and experts as we delve into the transformative power of the SAP and Databricks partnership. This session is designed to provide actionable insights and foster a collaborative dialogue on the ways this collaboration is reshaping the landscape of data management, AI, and business strategy.

This will be a dynamic, interactive roundtable where participants can share their viewpoints, explore real-world use cases, and address challenges and opportunities. The session is designed to encourage open discussion and provide valuable insights for navigating the evolving data and AI landscape.

SAP: Building a Modern Data Fabric Architecture for Electrifying Insights with Endeavour Energy

2025-06-17 · gartner-data-analytics-apac-2025

talk

by Robbie Cook (ENDEAVOUR ENERGY) , Živilė Norkūnaitė (Databricks) , Kara Reed (SAP)

AI/ML Analytics Cloud Computing Fabric SAP

SAP’s data and analytics solutions serve as the foundation for this data fabric; delivering an integrated, semantically-rich data layer that ensures seamless and scalable access to data without duplication.

Join this session to learn how Endeavour Energy is leveraging SAP’s data and analytic capabilities to build a flexible, business-ready data foundation critical for delivering analytical insights. We’ll explore the journey they took to get here, and the future they are building, including a look into SAP Business Data Cloud and, with Databricks, can facilitate a unified data platform to thrive in the AI economy.

Thursday Keynote (Virtual Replay)

2025-06-13 · Data + AI Summit 2025

keynote

AI/ML

Be first to witness the latest breakthroughs from Databricks and share the success of innovative data and AI companies.

Summit Live: Women In Data and AI Conversation

2025-06-12 · Data + AI Summit 2025 Watch

talk

by Lisa Cohen (Anthropic) , Kate Ostbye (Pfizer) , Pallavi Koppol (Databricks) , Holly Smith (Databricks)

AI/ML

Each year at Summit, Women in Data and AI have a half day for in-person discussions on empowering Women in Data and AI Breakfast, and networking with like-minded professionals and trailblazers. For this virtual discussion, hear from Kate Ostbye (Pfizer), Lisa Cohen (Anthropic), Pallavi Koppol and Holly Smith (Databricks) about navigating challenges, celebrating successes, and inspire one another as we champion diversity and innovation in data together. And how to get involved year-round.

Capitalizing Alternatives Data on the Addepar Platform: Private Markets Benchmarking

2025-06-12 · Data + AI Summit 2025 Watch

lightning_talk

by Ricky D'Sa (Addepar)

Data Lakehouse Delta

Addepar possesses an enormous private investment data set with 40% of the $7T assets on the platform allocated to alternatives. Leveraging the Addepar Data Lakehouse (ADL), built on Databricks, we have built a scalable data pipeline that assesses millions of private fund investment cash flows and translates it to a private fund benchmarks data offering. Investors on the Addepar platform can leverage this data seamlessly integrated against their portfolio investments and obtain actionable investment insights. At a high-level, this data offering consists of an extensive data aggregation, filtering, and construction logic that dynamically updates for clients through the Databricks job workflows. This derived dataset has gone through several iterations with investment strategists and academics that leveraged delta shared tables. Irrespective of the data source, the data pipeline coalesces all relevant cash flow activity against a unique identifier before constructing the benchmarks.

Route to Success: Scalable Routing Agents With Databricks and DSPy

2025-06-12 · Data + AI Summit 2025 Watch

lightning_talk

by Luis Moros (Databricks)

AI/ML GenAI

As companies increasingly adopt Generative AI, they're faced with a new challenge: managing multiple AI assistants. What if you could have a single, intuitive interface that automatically directs questions to the best assistant for the task? Join us to discover how to implement a flexible Routing Agent that streamlines working with multiple AI Assistants. We'll show you how to leverage Databricks and DSPy 3.0 to simplify adding this powerful pattern to your system. We'll dive into the essential aspects including: Using DSPy optimizers to maximize correct route selections Optimizing smaller models to reduce latency Creating stateful interactions Designing for growth and adaptability to support tens or hundreds of AI Assistants Ensuring authorized access to AI Assistants Tracking performance in production environments We'll share real-world examples that you can apply today. You'll leave with the knowledge to make your AI system run smoothly and efficiently.

Welcome Lakehouse, from a DWH transformation to a M&A data sharing

2025-06-12 · Data + AI Summit 2025 Watch

lightning_talk

by Gianfranco Arena (Dxc Technology)

AWS Data Lakehouse Delta DWH

At DXC, we helped our customer FastWeb with their "Welcome Lakehouse" project - a data warehouse transformation from on-premises to Databricks on AWS. But the implementation became something more. Thanks to features such as Lakehouse Federation and Delta Sharing, from the first day of the Fastweb+Vodafone merger, we have been able to connect two different platforms with ease and make the business focus on the value of data and not on the IT integration. This session will feature our customer Alessandro Gattolin of Fastweb to talk about the experience.

Advanced Governance and Auth With Databricks Apps

2025-06-12 · Data + AI Summit 2025 Watch

talk

by Andre Furlan Bueno (Databricks) , Doug Judice (Addepar)

API

Explore advanced governance and authentication patterns for building secure, enterprise-grade apps with Databricks Apps. Learn how to configure complex permissions and manage access control using Unity Catalog. We’ll dive into “on-behalf-of-user” authentication — allowing agents to enforce user-specific access controls — and cover API-based authentication, including PATs and OAuth flows for external integrations. We’ll also highlight how Addepar uses these capabilities to securely build and scale applications that handle sensitive financial data. Whether you're building internal tools or customer-facing apps, this session will equip you with the patterns and tools to ensure robust, secure access in your Databricks apps.

AI Evaluation from First Principles: You Can't Manage What You Can't Measure

2025-06-12 · Data + AI Summit 2025 Watch

talk

by Pallavi Koppol (Databricks) , Jonathan Frankle (Databricks)

AI/ML GenAI LLM

Is your AI evaluation process holding back your system's true potential? Many organizations struggle with improving GenAI quality because they don't know how to measure it effectively. This research session covers the principles of GenAI evaluation, offers a framework for measuring what truly matters, and demonstrates implementation using Databricks.Key Takeaways:-Practical approaches for establishing reliable metrics for subjective evaluations-Techniques for calibrating LLM judges to enable cost-effective, scalable assessment-Actionable frameworks for evaluation systems that evolve with your AI capabilitiesWhether you're developing models, implementing AI solutions, or leading technical teams, this session will equip you to define meaningful quality metrics for your specific use cases and build evaluation systems that expose what's working and what isn't, transforming AI guesswork into measurable success.

Automating Taxonomy Generation With Compound AI on Databricks

2025-06-12 · Data + AI Summit 2025 Watch

talk

by Allistair Cota (Lovelytics) , Sudhir Gajre (Lovelytics)

AI/ML API LLM

Taxonomy generation is a challenge across industries such as retail, manufacturing and e-commerce. Incomplete or inconsistent taxonomies can lead to fragmented data insights, missed monetization opportunities and stalled revenue growth. In this session, we will explore a modern approach to solving this problem by leveraging Databricks platform to build a scalable compound AI architecture for automated taxonomy generation. The first half of the session will walk you through the business significance and implications of taxonomy, followed by a technical deep dive in building an architecture for taxonomy implementation on the Databricks platform using a compound AI architecture. We will walk attendees through the anatomy of taxonomy generation, showcasing an innovative solution that combines multimodal and text-based LLMs, internal data sources and external API calls. This ensemble approach ensures more accurate, comprehensive and adaptable taxonomies that align with business needs.

talk-data.com

Activity Trend

Top Events

Top Speakers

Overview of Agent Bricks

Cost-effective AI on Databricks

Leveling Up: Analytics in the Gaming Industry (w/ Carly Taylor)

Amber Roberts — Staff Technical Marketing Manager at Databricks

Data Engineering Central Podcast - Episode 8

A Decade in Data Engineering - Lessons Realities and Where We Go From Here

Orchestrating Databricks with Airflow: Unlocking the Power of MVs, Streaming Tables, and AI

Orchestrating Databricks with Airflow: Unlocking the Power of MVs, Streaming Tables, and AI

Niels Claeys: Use dbt and DuckDB Instead of Databricks for Data Processing

One dbt repo, two targets: parallel dbt deployments on Azure SQL & Databricks

Roundtable: Unpacking the SAP & Databricks Partnership - A New Era of Data-Driven Decision Making, moderated by SAP

SAP: Building a Modern Data Fabric Architecture for Electrifying Insights with Endeavour Energy

Thursday Keynote (Virtual Replay)

Summit Live: Women In Data and AI Conversation

Capitalizing Alternatives Data on the Addepar Platform: Private Markets Benchmarking

Route to Success: Scalable Routing Agents With Databricks and DSPy

Welcome Lakehouse, from a DWH transformation to a M&A data sharing

Advanced Governance and Auth With Databricks Apps

AI Evaluation from First Principles: You Can't Manage What You Can't Measure

Automating Taxonomy Generation With Compound AI on Databricks