talk-data.com

Topic

dbt

dbt (data build tool)

data_transformation analytics_engineering sql

Activities

758

tagged

Activity Trend

134 peak/qtr

2020-Q1 2026-Q1

Top Events

Data Engineering Podcast 128 Dbt Coalesce 2024 89 dbt Coalesce 2025 87 dbt Coalesce 2022 81 dbt Coalesce 2023 79 The Analytics Engineering Podcast 77 dbt Coalesce 2020 25 Databricks DATA + AI Summit 2023 19 Data + AI Summit 2025 12 Airflow Summit 2025 11 The Joe Reis Show 8 Airflow Summit 2024 8

Top Speakers

Tobias Macey 128 Tristan Handy (dbt Labs) 67 Julia Schottenstein (dbt labs) 36 Joe Reis (DeepLearning.AI) 9 Drew Banin (Fishtown Analytics) 9 Jeremy Cohen (dbt Labs) 7 Grace Goheen (dbt Labs) 7 Gleb Mezhanskiy (Datafold) 7 Benn Stancil (Mode) 6 Maxime Beauchemin (Preset) 6 Dakota Kelley (phData) 5 Benoit Perigaud (dbt Labs) 5

Activities

758 activities · Newest first

All Video Podcast Book

Orchestrating MLOps and Data Transformation at EDB with Airflow

2025-07-01 · Airflow Summit 2025

session

by Karthik Dulam

AI/ML Airflow Analytics Analytics Engineering Azure Cosmos Data Governance Data Quality MLOps

This talk explores EDB’s journey from siloed reporting to a unified data platform, powered by Airflow. We’ll delve into the architectural evolution, showcasing how Airflow orchestrates a diverse range of use cases, from Analytics Engineering to complex MLOps pipelines. Learn how EDB leverages Airflow and Cosmos to integrate dbt for robust data transformations, ensuring data quality and consistency. We’ll provide a detailed case study of our MLOps implementation, demonstrating how Airflow manages training, inference, and model monitoring pipelines for Azure Machine Learning models. Discover the design considerations driven by our internal data governance framework and gain insights into our future plans for AIOps integration with Airflow.

Productionising dbt-core with Airflow

2025-07-01 · Airflow Summit 2025

session

by Pankaj Singh , Pankaj Koti , Tatiana Al-Chueyr Martins (Astronomer)

Airflow Analytics Analytics Engineering Astronomer Cosmos

As a popular open-source library for analytics engineering, dbt is often combined with Airflow. Orchestrating and executing dbt models as DAGs ensures an additional layer of control over tasks, observability, and provides a reliable, scalable environment to run dbt models. This workshop will cover a step-by-step guide to Cosmos , a popular open-source package from Astronomer that helps you quickly run your dbt Core projects as Airflow DAGs and Task Groups, all with just a few lines of code. We’ll walk through: Running and visualising your dbt transformations Managing dependency conflicts Defining database credentials (profiles) Configuring source and test nodes Using dbt selectors Customising arguments per model Addressing performance challenges Leveraging deferrable operators Visualising dbt docs in the Airflow UI Example of how to deploy to production Troubleshooting We encourage participants to bring their dbt project to follow this step-by-step workshop.

Simplifying Data Lineage: How OpenLineage Empowers Airflow and Beyond

2025-07-01 · Airflow Summit 2025

session

by Julien Le Dem (Astronomer) , Harel Shein (Datadog)

Airflow Flink Spark

OpenLineage has simplified collecting lineage metadata across the data ecosystem by standardizing its representation in an extensible model. It enabled a whole ecosystem improving data pipeline reliability and ease of troubleshooting in production environments. In this talk, we’ll briefly introduce the OpenLineage model and explore how this metadata is collected from Airflow, Spark, dbt, and Flink. We’ll demonstrate how to extract valuable insights and outline practical benefits and common challenges when building ingestion, processing and storage for OpenLineage data. We will also briefly show how OpenLineage events can be used to observe data pipelines exhastively and the benefits that brings.

Niels Claeys: Use dbt and DuckDB Instead of Databricks for Data Processing

2025-06-27 · DATA MINER Big Data Europe Conference 2020 Watch

video

by Niels Claeys

Databricks DuckDB

For Builders, by Builders: The Latest Tools from dbt Labs

2025-06-26 · dbt Global Circuit: Barcelona dbt Meetup (in-person) June 2025

talk

by Benoit Perigaud (dbt Labs)

In this talk, we will dive into the latest tools and ideas dbt Labs has been shipping — what they unlock, how they fit together, and why we built them the way we did.

One dbt repo, two targets: parallel dbt deployments on Azure SQL & Databricks

2025-06-24 · dbt Global Circuit Series: Belgium dbt Meetup #11 (in-person)

talk

by Emiel Ackermann (Port of Antwerp-Bruges) , Nicolas Jonckheere (Datashift)

Databricks azure sql

We’ll start with a walkthrough of the technical setup of dbt at the Port of Antwerp-Bruges, in the context of a migration to Databricks. Then we'll dive into how we handle deploying dbt to multiple targets for the duration of the migration. Finally we'll compare both environments with insights from an analytics engineering perspective.

A strategic approach to testing with Data Products in dbt

2025-06-24 · dbt Global Circuit Series: Belgium dbt Meetup #11 (in-person)

talk

by Mikkel Dengsøe (SYNQ)

data products observability testing

With data teams' growing ambition to build business automation, AI systems, or customer-facing products, we must shift our mindset about data quality. Mechanically applied testing will not be enough; we need a more robust strategy similar to software engineering. In this talk, I outline a new approach to data testing and observability anchored in the ‘Data Products’ concept and walk through the practical implementation of a production-grade analytics system with dbt as the backbone. The learnings will apply to data practitioners using dbt whether they're just getting started or working in a large enterprise.

From Docker to Dagger (w/ Solomon Hykes)

2025-06-22 · The Analytics Engineering Podcast Listen

podcast_episode

by Solomon Hykes (Docker) , Tristan Handy (dbt Labs)

AI/ML Analytics Analytics Engineering CI/CD DevOps Docker

In this season of the Analytics Engineering podcast, Tristan is digging deep into the world of developer tools and databases. There are few more widely used developer tools than Docker. From its launch back in 2013, Docker has completely changed how developers ship applications. In this episode, Tristan talks to Solomon Hykes, the founder and creator of Docker. They trace Docker's rise from startup obscurity to becoming foundational infrastructure in modern software development. Solomon explains the technical underpinnings of containerization, the pivotal shift from platform-as-a-service to open-source engine, and why Docker's developer experience was so revolutionary. The conversation also dives into his next venture Dagger, and how it aims to solve the messy, overlooked workflows of software delivery. Bonus: Solomon shares how AI agents are reshaping how CI/CD gets done and why the next revolution in DevOps might already be here. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com. The Analytics Engineering Podcast is sponsored by dbt Labs.

Raw to Ready: How dbt Powers the AI-Ready Data Lake

2025-06-18 · dbt Global Circuit Series: dbt Boston Meetup - CY25- June

talk

by Kanishk Mittal (InterSystems)

Data Lake ai

Discussion on how dbt powers the AI-ready data lake.

For Builders, by Builders: The Latest Tools from dbt Labs

2025-06-18 · dbt Global Circuit: London

talk

by Richard Persaud (dbt Labs)

A talk about the latest tools from dbt Labs for builders.

For Builders, by Builders: The Latest Tools from dbt Labs

2025-06-17 · dbt Global Circuit: New York

talk

Presentation detailing the latest tools from dbt Labs for builders.

What's New: Scaling Data Pipelines with SQL, dbt Projects, and Python

2025-06-17 · Summit 2025 - On Demand Watch

session

Data Engineering Python Snowflake SQL

Learn how to efficiently scale and manage data engineering pipelines with Snowflake's latest capabilities for SQL- and Python-based transformations. Join us for new product and feature overviews, best practices and live demos.

dbt Labs: The Journey to the Data Control Plane

2025-06-17 · gartner-data-analytics-apac-2025

talk

by Shabbir Khanbhai (dbt Labs)

AI/ML Data Quality SQL

In our recent study, an overwhelming majority—80% of respondents—reported using AI in their day-to-day workflows. This marks a significant increase from just a year ago, when only 30% were doing so.
But what about data quality? Can you trust your data?
In this session, we’ll discuss how dbt can help organizations increase trust in their data, improve performance and governance, and control costs more effectively.
dbt is widely regarded as the industry standard for AI on structured data. Its Fusion engine, with deep SQL comprehension, powers the next generation of dbt use cases.

Democratizing Data Engineering with Databricks and dbt at Ludia

2025-06-12 · Data + AI Summit 2025 Watch

talk

by Jean-Christophe Rodrigue (Ludia) , Huntting Buckley (Databricks)

Data Engineering Databricks

Ludia, a leading mobile gaming company, is empowering its analysts and domain experts by democratizing data engineering with Databricks and dbt. This talk explores how Ludia enabled cross-functional teams to build and maintain production-grade data pipelines without relying solely on centralized data engineering resources—accelerating time to insight, improving data reliability, and fostering a culture of data ownership across the organization.

Sponsored by: dbt Labs | Leveling Up Data Engineering at Riot: How We Rolled Out dbt and Transformed the Developer Experience

End-to-End Interoperable Data Platform: How Bosch Leverages Databricks Supply Chain Consolidation

2025-06-11 · Data + AI Summit 2025 Watch

talk

by Satish Karunakaran (Robert Bosch GmbH) , Marc-Alexander Frey (Robert Bosch GmbH)

Data Lakehouse Databricks LLM

This session will showcase Bosch’s journey in consolidating supply chain information using the Databricks platform. It will dive into how Databricks not only acts as the central data lakehouse but also integrates seamlessly with transformative components such as dbt and Large Language Models (LLMs). The talk will highlight best practices, architectural considerations, and the value of an interoperable platform in driving actionable insights and operational excellence across complex supply chain processes. Key Topics and Sections Introduction & Business Context Brief Overview of Bosch’s Supply Chain Challenges and the Need for a Consolidated Data Platform. Strategic Importance of Data-Driven Decision-Making in a Global Supply Chain Environment. Databricks as the Core Data Platform Integrating dbt for Transformation Leveraging LLM Models for Enhanced Insights

HP's Data Platform Migration Journey: Redshift to Lakehouse

2025-06-11 · Data + AI Summit 2025 Watch

talk

by Isaac Chan (HP Inc.) , Kavya Atmakuri (HP Inc.)

AWS Data Lakehouse Databricks ETL/ELT Redshift SQL

HP Print's data platform team took on a migration from a monolithic, shared resource of AWS Redshift, to a modular and scalable data ecosystem on Databricks lakehouse. The result was 30–40% cost savings, scalable and isolated resources for different data consumers and ETL workloads, and performance optimization for a variety of query types. Through this migration, there were technical challenges and learnings relating to the ETL migrations with DBT, new Databricks features like Liquid Clustering, predictive optimization, Photon, SQL serverless warehouses, managing multiple teams on Unity Catalog, and others. This presentation dives into both the business and technical sides of this migration. Come along as we share our key takeaways from this journey.

Hands-on-Learning: Accelerating the Analytics Journey: Leveraging Fivetran, dbt Cloud, and Sigma on Databricks | Sponsored Session

2025-06-11 · Data + AI Summit 2025

talk

by Nina Anderson (dbt Labs) , Mitch Ertle (Sigma) , David Hrncir (Fivetran) , Pradeep Anandapu (Databricks)

Analytics Cloud Computing Data Analytics Databricks Fivetran

This hands-on lab guides participants through the complete customer data analytics journey on Databricks, leveraging leading partner solutions - Fivetran, dbt Cloud, and Sigma. Attendees will learn how to:- Seamlessly connect to Fivetran, dbt Cloud, and Sigma using Databricks Partner Connect- Ingest data using Fivetran, transform and model data with dbt Cloud, and create interactive dashboards in Sigma, all on top of the Databricks Data Intelligence Platform- Empower teams to make faster, data-driven decisions by streamlining the entire analytics workflow using an integrated, scalable, and user-friendly platform

Accelerating Data Transformation: Best Practices for Governance, Agility and Innovation

2025-06-11 · Data + AI Summit 2025 Watch

lightning_talk

by Kevin Wilson (NCS Australia)

Analytics Data Governance Data Lakehouse Data Quality Databricks ETL/ELT SQL

In this session, we will share NCS’s approach to implementing a Databricks Lakehouse architecture, focusing on key lessons learned and best practices from our recent implementations. By integrating Databricks SQL Warehouse, the DBT Transform framework and our innovative test automation framework, we’ve optimized performance and scalability, while ensuring data quality. We’ll dive into how Unity Catalog enabled robust data governance, empowering business units with self-serve analytical workspaces to create insights while maintaining control. Through the use of solution accelerators, rapid environment deployment and pattern-driven ELT frameworks, we’ve fast-tracked time-to-value and fostered a culture of innovation. Attendees will gain valuable insights into accelerating data transformation, governance and scaling analytics with Databricks.

Selectively Overwrite Data With Delta Lake’s Dynamic Insert Overwrite

2025-06-11 · Data + AI Summit 2025 Watch

lightning_talk

by Bart Samwel (Databricks) , Thang Long Vu (Databricks)

Databricks Delta ETL/ELT SQL

Dynamic Insert Overwrite is an important Delta Lake feature that allows fine-grained updates by selectively overwriting specific rows, eliminating the need for full-table rewrites. For examples, this capability is essential for: DBT-Databricks' incremental models/workloads, enabling efficient data transformations by processing only new or updated records ETL Slowly Changing Dimension (SCD) Type 2 In this lightning talk, we will: Introduce Dynamic Insert Overwrite: Understand its functionality and how it works Explore key use cases: Learn how it optimizes performance and reduces costs Share best practices: Discover practical tips for leveraging this feature on Databricks, including on the cutting-edge Serverless SQL Warehouses

Page 10 of 38

← Previous

1 ... 8 9 10 11 12 ... 38