talk-data.com talk-data.com

Topic

dbt

dbt (data build tool)

data_transformation analytics_engineering sql

758

tagged

Activity Trend

134 peak/qtr
2020-Q1 2026-Q1

Activities

758 activities · Newest first

dbt & data mesh: the perfect pair (?)

In this session, Guillermo Sanchez, (GoDataDriven) will attempt to summarize the last four major socio-technical changes that data teams have faced in the last few years, and why dbt plus the highly contentious data mesh might be the answer to each.

Check the slides here: https://drive.google.com/file/d/1LRmAJt3roIASxgJKIFsqbyZPcQrxjhAR/view?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

dbt Labs and Databricks: best practices and future roadmap

The Databricks Lakehouse Platform unifies the best of data warehouses and data lakes in one simple platform to handle all your data, analytics and AI use cases. Databricks now includes complete support for dbt Core and dbt Cloud and you will hear from Conde Nast using dbt and Databricks together to democratize insights. We will also share best practices for developing and productionizing dbt projects containing SQL and Python, governing data with standard SQL, and exciting features on our roadmap such as materialized views for Databricks SQL.

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Escape from Data Island - Orchestrate and Connect Your Data Stack for Smooth Sailing

The Modern Data Stack is becoming more and more fragmented. With new tools and processes popping up continuously, it’s easy to get stranded on various “data islands”, with everything running independently. In this session, we’ll teach you:

  • What benefits you gain by turning your Modern Data Stack into an Integrated Data Stack

  • How Shipyard can help you quickly connect the data tools you already use

  • How orchestration is the missing step in your data journey to get your team off “data islands”

  • How dbt fits into the picture of a connected data stack

Check the slides here: https://docs.google.com/presentation/d/1NT7RnMtTLxb5ew5_VXXzciJusfbBKZJx86B6BtNZv90/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Field-level lineage with dbt, ANTLR, and Snowflake

Lineage is a critical component of any root cause, impact analysis, and overall analytics heath assessment workflow. But it hasn’t always been easy to create, particularly at the field level. In this session, Mei Tao, Helena Munoz, and Xuanzi Han (Monte Carlo) tackle this challenge head-on by leveraging some of the most popular tools in the modern data stack, including dbt, Airflow, Snowflake, and ANother Tool for Language Recognition (ANTLR). Learn how they designed the data model, query parser, and larger database design for field-level lineage—highlighting learnings, wrong turns, and best practices developed along the way.

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Introducing dbt with Databricks

In this live, instructor-led hands-on lab, you’ll learn how to build a modern data stack with Databricks and dbt, using dbt to manage data transformations in Databricks and perform exploratory data analysis on the clean data sets using Databricks SQL. Based on the lakehouse architecture and built on an open data lake, data analysts, analytics engineers, and data scientists can use dbt and Databricks to work with the freshest and most complete data, and quickly derive new insights for accurate decision-making.

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Mastering the art of dbt package development

Welcome to the dbt Package show! Have you ever wondered how Fivetran approaches building dbt packages? Well, put on your aprons, because the Fivetran team is going to give you a glimpse at their recipe for building packages with dbt from 0 release to implementation. Sheri Nguyen of Fivetran will show you:

  • How they leverage the dbt Community to get the right ingredients for our packages and plan our roadmap for end state package models

  • Their methodologies and standards for preparing ingredients for our data models to provide flexibility for all our users

  • How they combine everything together and cook up our models for our dbt packages

Finally, after all the cooking is done, the best part is sharing what you've made with your friends and family. Sheri will also show you how different companies use our various packages and how you can contribute! This talk will benefit those who want to leverage Fivetran's dbt packages, understand why we apply certain modeling practices, and understand how we continually iterate on these packages. Additionally, this session will be beneficial for those who also want to be dbt package maintainers and understand how they can contribute future packages.

Check the slides here: https://docs.google.com/presentation/d/1QKvnxnfRBrZKOnBGwXkLnc-EjLS4S3mbSe59kjHhMos/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Moving to predictive: How to assemble the beginnings of your feature store with Snowflake & dbt Labs

Historically, analytics has been focused on "what happened." And to this day, newer and newer generations of tooling, dbt for example, have come forth accelerating the speed and utility of data in an enterprise for decision making. Machine learning, on the other hand (the "what will happen"), has seemingly been stood up as a separate silo with an organization with seemingly "more intricate" technical requirements, the need for "data scientist", and done so all in the name of how to handle "more special" data resulting in "more accurate" decision making. In this session, you will learn how to cut through the noise and extend and leverage your analytic practice with Snowflake and dbt Labs into the realm of machine learning by pairing your analytical pipelines with a feature store layer to declaratively serve both model training and model scoring scenarios, even at some of the lowest latency (real-time) production requirements.

This session requires pre-registration. Sign up here. If session is filled you are welcome to come to the room and join the waitlist onsite. Open seats will be made available 10 minutes after session start.

Check the slides here: https://docs.google.com/presentation/d/1H-aPsc2DkPGcJUV4pd_iLSdMMMAMZTEtsAFO4hpglXM/edit#slide=id.g15a4510fa6a_0_560

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Nobody puts metrics in a corner: How to activate your dbt models

dbt has changed the game for data practitioners, bringing velocity, organizational efficiency, and increased trust to modern analytics workflows. But what happens to your dbt models after they’ve been built? Too often the value you create goes untapped by the business, or accessed only by a select few. dbt Labs and ThoughtSpot are teaming up to unleash the true potential of your transformed data. Learn how to deliver trustworthy data and insights to frontline users at scale with safe, reliable self-service analytics.

Check the slides here: https://docs.google.com/presentation/d/1XFyP8d0wkfnA1_59zb1zbNYInSSzdI2qe09f6OkhRE0/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Outgrowing a single `dbt run`

When does your team decided it’s time to move beyond a singular dbt run? For most analytics engineers, there comes a time when the dbt run commands on fixed schedules simply won’t make the cut. Join Prratek Ramchandani (Vox Media) as he breaks down an alternative approach to orchestrating your dbt project with Dagster that balances meeting SLAs with safely handling the edge cases a simple schedule-based dbt run might create.

Check the slides here: https://docs.google.com/presentation/d/1zivYO_EpN6T9JYM9HjAJAz3bK3e2TREwdKffylkzuUw/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Petabyte-scale lakehouses with dbt and Apache Hudi

While the data lakehouse architecture offers many inherent benefits, it’s still relatively new to the dbt community, which creates hurdles to adoption.

In this talk, you’ll meet Apache Hudi, a platform used by organizations to build planet-scale data platforms according to all of the key design elements required by the lakehouse architecture. You’ll also learn how we’ve personaly used Hudi, along with dbt, Spark, Airflow, and many more open-source tools to build a truly reliable big data streaming lakehouse that cut the latency of our petabyte-scale data pipelines from hours to minutes.

Check the slides here: https://docs.google.com/presentation/d/18dv4TZzRnZQ-IK7xLkYJuind4Bcztkl19zV7b4HTaTU/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Snowflake yourself: Using dbt for Snowflake access control management

Calling all Snowflake and dbt admins! Come learn how to utilize dbt to manage Snowflake users and network policies and keep your data stack simple and lean while version controlling Snowflake’s RBAC and network policies.

Check the slides here: https://docs.google.com/presentation/d/1IxmDMgVaOYb0PTEWej-k3l0avKrmHI970oT7_Xpg0Z4/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Standardizing the unstandardized: dbt modeling for Web 3.0

Web 3.0 is constantly evolving: everyday there are new smart contracts, project updates, tokens and chains. And just because the data is on the blockchain and public does not mean it's easily accessible and digestible. It may be easy to monitor specific parts of the blockchain such as a particular smart contract, but how do you build a scalable infrastructure flexible enough to account for any new business request in a rapidly evolving industry?

Join Alec Kamra (Mythical Games) as he shows you how to do just this. As a blockchain gaming platform, Mythical Games has built a stable and flexible multi-chain solution using Google BQ's public datasets and a few external APIs that allow them to monitor all trades and transfers for their business needs.

Check the slides here:https://docs.google.com/presentation/d/14wcMHKAGhm9qvZ2NlPSv1wQj5FJF-ttL4k3lAivt6Fk/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Talk Talent to Me

Learn about the skills and abilities we seek at dbt Labs!

Check the slides here: https://docs.google.com/presentation/d/1Lbhz8R11sNedSUmo26ia_hPV8XHncBxhH09GMQdofa8/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

“The easy way” to launch analytics at a startup with dbt

Analytics at a startup….something that might scare most data folks. But not Lindsay Murphy of Maple! Join Lindsay as she draws on her experiences to demystify and template the process behind using dbt to create robust self-service analytics practices at startups and other small companies.

Check the slides here: https://docs.google.com/presentation/d/1GMe4K3UQgJESozHV2-p19OwJoBQJ3VEfZaX-cthsYdU/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Why Metrics Are Even More Valuable Than You Think They Are

Creating / migrating metric metadata to dbt can be a pain, because the level of underlying data knowledge required to create the YAML files properly. You might have found yourself wondering, “is this worth it just to standardize metric definitions?”. This talk will tell you why it is definitely worth it… because the functionality you unlock goes beyond just standard metric definitions. Adopting the dbt standard metric syntax unlocks three additional possibilities for your data:

  1. Automated time-aware metric calculations

  2. Dynamic drill downs and segmentation to empower slice and dice analysis

  3. Self-service dynamic transforms using templated SQL

Check slides here: https://docs.google.com/presentation/d/1nJHP2E6NGZ-KHG4_gNiI6w2lq4kjWgQIanAC9yf3cng

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Why you should not do lead scoring in your marketing automation tools

As your business and number of product lines grow, the out-of-the-box lead scoring in CRM tools starts becoming difficult to work with and lead scoring becomes that more important for sales teams. Join Ben Lewinsky as he shows how Culture Amp approaches multi-product lead scoring in their data warehouse using dbt.

Check the slides here: https://docs.google.com/presentation/d/1NOyZLs1QUf6HQqF6jusx32OjUb-Gi-PTnmiDQ8EFKM8/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Workshop: dbt Packages you didn’t know you needed

You’re probably familiar with the dbt-utils package, but how many others have you explored? If you’re looking to cut development time, make your next audit less painful, or wield dbt metrics confidently, join Elize and Dave as they dig into three of their most essential dbt packages—codegen, audit_helper, and metrics —in this hands-on workshop.

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Workshop: Get more out of your DAG

In this workshop, you’ll learn how to create and document macros that leverage the powerful introspective features of dbt to perform dynamic modeling including: run result storage in your warehouse, dynamic value lookup in models, and leveraging model metadata in macros.

You’ll learn how to: - Create macros to store your dbt run results within your data warehouse - Leverage internal dbt graph data for dynamic modeling - Incorporate dbt best-practices when developing macros

Prerequisites: - Basic familiarity with ANSI SQL - Some familiarity using Jinja and writing macros - Experience with dbt required

Check Notion document here: https://www.notion.so/6382db82046f41599e9ec39afb035bdb

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Intentional learning: How and why you should learn Data Jawn

What was the most recent data skill you developed? How did you go about doing that? With an ever-growing industry of skills, knowledge, frameworks, and tools, Amy Chen (dbt Labs) will unpack how data folks prioritize what to learn and how they actually learn them in this interactive session.

Check the slides here: https://docs.google.com/presentation/d/1-O3Woc3TpE8fG-HYWSAUa7sHcQp1NgrWNmE3lBIn60U/edit

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Breaking Bad (deployment habits)

Have you ever deployed a model without fully understanding the downstream effects? In this talk, we'll explore how to ensure quality throughout the model deployment process - answering questions, like: When are dbt tests not enough? Is it possible to conduct a quality PR review in under 5 minutes? How do we detect breaking changes before they reach production?

Check the slides here: https://docs.google.com/presentation/d/1ELeHag880v8bmJDmMaES95k90wu-twDXDg-aZGyP7AQ/edit#slide=id.g158fcd520ea_0_155

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.