talk-data.com

Topic

dbt

dbt (data build tool)

data_transformation analytics_engineering sql

Activities

758

tagged

Activity Trend

134 peak/qtr

2020-Q1 2026-Q1

Top Events

Data Engineering Podcast 128 Dbt Coalesce 2024 89 dbt Coalesce 2025 87 dbt Coalesce 2022 81 dbt Coalesce 2023 79 The Analytics Engineering Podcast 77 dbt Coalesce 2020 25 Databricks DATA + AI Summit 2023 19 Data + AI Summit 2025 12 Airflow Summit 2025 11 The Joe Reis Show 8 Airflow Summit 2024 8

Top Speakers

Tobias Macey 128 Tristan Handy (dbt Labs) 67 Julia Schottenstein (dbt labs) 36 Joe Reis (DeepLearning.AI) 9 Drew Banin (Fishtown Analytics) 9 Jeremy Cohen (dbt Labs) 7 Grace Goheen (dbt Labs) 7 Gleb Mezhanskiy (Datafold) 7 Benn Stancil (Mode) 6 Maxime Beauchemin (Preset) 6 Dakota Kelley (phData) 5 Benoit Perigaud (dbt Labs) 5

Activities

758 activities · Newest first

All Video Podcast Book

From Stored Procedures to Scalable Data

2025-09-25 · Big Data LDN 2025

Face To Face

by Veronica Saha (Zopa Bank) , Francis Nwobu (Zopa Bank) , Tayo Moore (Zopa Bank)

Git

Zopa outgrew stored procedures: opaque logic, poor documentation, and inefficient pipelines. Moving to dbt Core brought modular models, tests, and version control, cutting change risk and spreading siloed knowledge. The dbt Platform then simplified onboarding, performance, and ownership, with live docs and lineage that boosted adoption and trust across teams, and supported regulatory reporting. Learn the tactics behind Zopa’s hub-and-spoke model, faster onboarding, and reusable definitions via dbt Mesh.

#89 SQLBits Unfiltered: dbt in Fabric, MLOps in Action & Copilot in Question

2025-09-25 · DataTopics: All Things Data, AI & Tech Listen

podcast_episode

by Sam Debruyn (Microsoft) , Dorian Van den Heede

AI/ML Azure Data Lakehouse Microsoft Fabric MLOps Python SQL SQLMesh

Send us a text In this episode, we're joined by Sam Debruyn and Dorian Van den Heede who reflect on their talks at SQL Bits 2025 and dive into the technical content they presented. Sam walks through how dbt integrates with Microsoft Fabric, explaining how it improves lakehouse and warehouse workflows by adding modularity, testing, and documentation to SQL development. He also touches on Fusion’s SQL optimization features and how it compares to tools like SQLMesh. Dorian shares his MLOps demo, which simulates beating football bookmakers using historical data,nshowing how to build a full pipeline with Azure ML, from feature engineering to model deployment. They discuss the role of Python modeling in dbt, orchestration with Azure ML, and the practical challenges of implementing MLOps in real-world scenarios. Toward the end, they explore how AI tools like Copilot are changing the way engineers learn and debug code, raising questions about explainability, skill development, and the future of junior roles in tech. It’s rich conversation covering dbt, MLOps, Python, Azure ML, and the evolving role of AI in engineering.

Analytics Engineering 2.0: From Pipelines to Intelligence Interfaces

2025-09-24 · Big Data LDN 2025

Face To Face

by Oliver Ramsay (Lightdash)

AI/ML Analytics Analytics Engineering Dashboard Lightdash SQL

Analytics engineers are at a crossroads. Back in 2018, dbt paved the way for for this new kind of data professional, people who had technical ability and could understand business context. But here's the thing: AI is automating traditional tasks like pipeline building and dashboard creation. So then what happens to analytics engineers? They don't disappear - they evolve.

The same skills that made analytics engineers valuable also make them perfect for a new role I'm calling 'Analytics Intelligence Engineers.' Instead of writing SQL, they're writing the context that makes AI actually useful for business users.

In this talk, I'll show you what this evolution looks like day-to-day. We'll explore building semantic layers, crafting AI context, and measuring AI performance - all through real examples using Lightdash. You'll see how the work shifts from data plumbing to data intelligence, and walk away with practical tips for making AI tools more effective in your organization. Whether you're an analytics engineer wondering about your future or a leader planning your data strategy, this session will help you understand where the field is heading and how to get there.

Transforming Data Chaos into Clarity: Lessons from Penguin Random House UK

2025-09-24 · Big Data LDN 2025

Face To Face

by Kerry Philips (Penguin Random House UK)

Dashboard Data Governance Data Quality

Penguin Random House, the world’s largest trade book publisher, relies on data to power every part of its global business, from supply chain operations to editorial workflows and royalty reconciliation. As the complexity of PRH’s dbt pipelines grew, manual checks and brittle tests could no longer keep pace. The Data Governance team knew they needed a smarter, scalable approach to ensure trusted data.

In this session, Kerry Philips, Head of Data Governance at Penguin Random House, will reveal how the team transformed data quality using Sifflet’s observability platform. Learn how PRH integrated column-level lineage, business-rule-aware logic, and real-time alerts into a single workspace, turning fragmented testing into a cohesive strategy for trust, transparency, and agility.

Attendees will gain actionable insights on:

- Rapidly deploying observability without disrupting existing dbt workflows

- Encoding business logic into automated data tests

- Reducing incident resolution times and freeing engineers to innovate

- Empowering analysts to act on data with confidence

If you’ve ever wondered how a company managing millions of ISBNs ensures every dashboard tells the truth, this session offers a behind-the-scenes look at how data observability became PRH’s newest bestseller.

Airflow & dbt: dynamic creation of inter-dag sensors with cosmos

2025-09-17 · Berlin Apache Airflow Meetup at GetYourGuide!

talk

by Giovanni Corsetti Silva (GetYourGuide)

Airflow astronomer cosmos

Apache Airflow is the go-to platform for data orchestration, while dbt is widely recognized for analytical data transformations. Using astronomer-cosmos library, integrating dbt projects into Airflow becomes straightforward, allowing each dbt model to be transformed into individual tasks or task groups equipped with Airflow features like retries and callbacks. However, organizing dbt models into separate Airflow DAGs based on domain or user-defined filters presents challenges in maintaining dependencies across these distinct DAGs. Ensuring that downstream dbt tasks only execute after the corresponding upstream tasks in different DAGs have successfully completed is crucial for data consistency—yet this functionality is not supported by default. Join GetYourGuide as we explore our method for dynamically creating inter-DAG sensors in Airflow using Astronomer Cosmos for dbt. We will show how we maintained dbt model dependencies across multiple DAGs, making our pipeline modular, scalable, and robust.

From Chaos to Cosmos: Simplifying DAG Creation in Apache Airflow

2025-09-17 · Berlin Apache Airflow Meetup at GetYourGuide!

talk

by Tatiana Al-Chueyr Martins (Astronomer)

Airflow astronomer cosmos cosmos

dbt has become the de facto standard for transforming data in modern analytics stacks. But as projects grow, so does the question: where should dbt run in production, and how can we make it faster? In this talk, we’ll compare the performance trade-offs between running dbt natively and orchestrating it through Airflow using Cosmos, with a focus on workflow efficiency at scale. Using a 200-model dbt project as a case study, we’ll show how workflow execution time in Cosmos was reduced from 15 minutes to just 5 minutes. We’ll also discuss opportunities to push performance further—ranging from better DAG optimization to warehouse-aware scheduling strategies. Whether you’re a data engineer, analytics engineer, or platform owner, you’ll leave with practical strategies to optimize dbt execution and inspiration for what’s next in large-scale orchestration

Real-World Success & Roadmap Ahead: Accelerating Analytics with Microsoft Fabric + dbt

2025-09-16 · Second Vienna dbt Meetup with dbt Labs CTO: FabCon Edition (in person)

talk

by Charles Webb (Microsoft)

Analytics Microsoft Fabric

What’s New from dbt

2025-09-16 · Second Vienna dbt Meetup with dbt Labs CTO: FabCon Edition (in person)

talk

by Mark Porter (dbt Labs)

Moving from Self-Managed dbt to dbt Cloud

2025-09-16 · Second Vienna dbt Meetup with dbt Labs CTO: FabCon Edition (in person)

talk

by Martin Finn-Sell (GoStudent)

Cloud Computing

For Builders, by Builders - The latest tools from dbt Labs

2025-09-16 · Prague dbt meetup

talk

Talk by Stephan Durry from dbt Labs about the latest tools from dbt Labs.

What’s new with dbt?

2025-09-11 · Berlin dbt Meetup - Pre-coalesce edition!

talk

Succesvol opzetten van een self-service analytics data platform voor sensor data

2025-09-10 · Data Expo NL 2025

talk

by Hedzer Keulen (Heerema Marine Contractors) , Stefan Hulsbos (Digital Power)

Analytics Databricks

Hoe maak je 100 miljoen sensormetingen per dag bruikbaar voor engineers en analisten? In deze sessie laten we zien hoe Heerema met een klein datateam een schaalbaar self-service data platform bouwde met Databricks en dbt, waarmee ruwe metingen worden omgezet in betrouwbare datamodellen voor verschillende analyses en teams.

Agentic coding in analytics engineering (w/ Mikkel Dengsøe)

2025-09-07 · The Analytics Engineering Podcast Listen

podcast_episode

by Mikkel Dengsøe (SYNQ) , Tristan Handy (dbt Labs)

AI/ML Analytics Analytics Engineering BI Omni Snowflake

Tristan talks with Mikkel Dengsøe, co-founder at SYNQ, to break down what agentic coding looks like in analytics engineering. Mikkel walks through a hands-on project using Cursor, the dbt MCP server, Omni's AI assistant, and Snowflake. They cover where agents shine (staging, unit tests, lineage-aware checks), where they're risky (BI chat for non-experts), and how observability is shifting from dashboards to root-cause explanations. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com. The Analytics Engineering Podcast is sponsored by dbt Labs.

What's new in dbt?

2025-09-04 · dbt Dublin September Meetup: All About Analytics Engineering

talk

dbt Platform at UPM

2025-08-28 · Helsinki dbt Meetup

talk

by Arto Rautapää (UPM)

For Builders, by Builders: The Latest Tools from dbt Labs

2025-08-28 · Helsinki dbt Meetup

talk

by Lucas Silbernagel (dbt Labs)

Under the hood of Apache Iceberg (w/ Christian Thiel)

2025-08-24 · The Analytics Engineering Podcast Listen

podcast_episode

by Christian Thiel (Lakekeeper) , Tristan Handy (dbt Labs)

Analytics Analytics Engineering Iceberg

Tristan digs deep into the world of Apache Iceberg. There's a lot happening beneath the surface: multiple catalog interfaces, evolving REST specs, and competing implementations across open source, proprietary, and academic contexts. Christian Thiel, co-founder of Lakekeeper, one of the most widely used Iceberg catalogs, joins to walk through the state of the Iceberg ecosystem. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com. The Analytics Engineering Podcast is sponsored by dbt Labs.

From Bits to Tables: The Evolution of S3 Storage

2025-08-05 · Data Engineering Podcast Listen

podcast_episode

by Andy Warfield (Amazon) , Tobias Macey

AI/ML Analytics Data Engineering Data Management Datafold Iceberg Lance Oracle Python S3 Snowflake

Summary In this episode of the Data Engineering Podcast Andy Warfield talks about the innovative functionalities of S3 Tables and Vectors and their integration into modern data stacks. Andy shares his journey through the tech industry and his role at Amazon, where he collaborates to enhance storage capabilities, discussing the evolution of S3 from a simple storage solution to a sophisticated system supporting advanced data types like tables and vectors crucial for analytics and AI-driven applications. He explains the motivations behind introducing S3 Tables and Vectors, highlighting their role in simplifying data management and enhancing performance for complex workloads, and shares insights into the technical challenges and design considerations involved in developing these features. The conversation explores potential applications of S3 Tables and Vectors in fields like AI, genomics, and media, and discusses future directions for S3's development to further support data-driven innovation.

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data managementTired of data migrations that drag on for months or even years? What if I told you there's a way to cut that timeline by up to 6x while guaranteeing accuracy? Datafold's Migration Agent is the only AI-powered solution that doesn't just translate your code; it validates every single data point to ensure perfect parity between your old and new systems. Whether you're moving from Oracle to Snowflake, migrating stored procedures to dbt, or handling complex multi-system migrations, they deliver production-ready code with a guaranteed timeline and fixed price. Stop burning budget on endless consulting hours. Visit dataengineeringpodcast.com/datafold to book a demo and see how they're turning months-long migration nightmares into week-long success stories.Your host is Tobias Macey and today I'm interviewing Andy Warfield about S3 Tables and VectorsInterview IntroductionHow did you get involved in the area of data management?Can you describe what your goals are with the Tables and Vector features of S3?How did the experience of building S3 Tables inform your work on S3 Vectors?There are numerous implementations of vector storage and search. How do you view the role of S3 in the context of that ecosystem?The most directly analogous implementation that I'm aware of is the Lance table format. How would you compare the implementation and capabilities of Lance with what you are building with S3 Vectors?What opportunity do you see for being able to offer a protocol compatible implementation similar to the Iceberg compatibility that you provide with S3 Tables?Can you describe the technical implementation of the Vectors functionality in S3?What are the sources of inspiration that you looked to in designing the service?Can you describe some of the ways that S3 Vectors might be integrated into a typical AI application?What are the most interesting, innovative, or unexpected ways that you have seen S3 Tables/Vectors used?What are the most interesting, unexpected, or challenging lessons that you have learned while working on S3 Tables/Vectors?When is S3 the wrong choice for Iceberg or Vector implementations?What do you have planned for the future of S3 Tables and Vectors?Contact Info LinkedInParting Question From your perspective, what is the biggest gap in the tooling or technology for data management today?Closing Announcements Thank you for listening! Don't forget to check out our other shows. Podcast.init covers the Python language, its community, and the innovative ways it is being used. The AI Engineering Podcast is your guide to the fast-moving world of building AI systems.Visit the site to subscribe to the show, sign up for the mailing list, and read the show notes.If you've learned something or tried out a project from the show then tell us about it! Email [email protected] with your story.Links S3 TablesS3 VectorsS3 ExpressParquetIcebergVector IndexVector DatabasepgvectorEmbedding ModelRetrieval Augmented GenerationTwelveLabsAmazon BedrockIceberg REST CatalogLog-Structured Merge TreeS3 MetadataSentence TransformerSparkTrinoDaftThe intro and outro music is from The Hug by The Freak Fandango Orchestra / CC BY-SA

The pragmatic guide to AI agents in the enterprise (w/ Sean Falconer)

2025-08-03 · The Analytics Engineering Podcast Listen

podcast_episode

by Tristan Handy (dbt Labs) , Sean Falconer (Confluent)

AI/ML Analytics Analytics Engineering Marketing

What does it mean to be agentic? Is there a spectrum of agency? In this episode of The Analytics Engineering Podcast, Tristan Handy talks to Sean Falconer, senior director of AI strategy at Confluent, about AI agents. They discuss what truly makes software "agentic," where agents are successfully being deployed, and how to conceptualize and build agents within enterprise infrastructure. Sean shares practical ideas about the changing trends in AI, the role of basic models, and why agents may be better for businesses than for consumers. This episode will give you a clear, practical idea of how AI agents can change businesses, instead of being a vague marketing buzzword. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com. The Analytics Engineering Podcast is sponsored by dbt Labs.

Python and the Data Platform in 2025

2025-07-31 · PyData Cardiff - July 2025

talk

by James C Yarrow (Future Publishing)

Airflow Python

This will be a walkthrough of the modern data platform, the capabilities and the challenges Python is solving and how other tools like Airflow and DBT play a role in the modern data platform.

Page 8 of 38

← Previous

1 ... 6 7 8 9 10 ... 38