dbt

How Preset Integrates dbt with Apache Superset to Deliver on Headless BI & Surface Metrics

2022-10-25 · dbt Coalesce 2022 Watch

video

BI Git GitHub Superset

At Preset, we offer a managed service for Apache Superset, the most popular open source business intelligence platform (by Github stars) in the world. We believe the future of BI is not only rooted in open source but also adopts the best ideas from the software development life cycle. To that end, we've created a workflow that enables you to manage Superset datasets, charts, and dashboards as code and we integrated dbt into our platform. In this talk, I'll showcase the speed and change management benefits that are enabled by this workflow of managing core BI assets using dbt and version control.

Check the slides here: https://docs.google.com/presentation/d/1SjbXOgJnuAnmu3B3cY1YAEOMZdARH72Siwneq2yRjfU/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Back to the Future: Where Dimensional Modeling Enters the Modern Data Stack

2022-10-25 · dbt Coalesce 2022 Watch

video

by John Barcheski (Analytics&) , Tony Dahlager (Analytics&)

Analytics dimensional modeling Modern Data Stack

dbt’s powerful capabilities allow data teams to deliver data products and analytics solutions to solve business problems faster than ever. Yet still, even with the best modern technologies, challenges arise. How can you be certain what your building will stand up to changing requirements? How can you connect disparate parts of your business to derive new insights? The answer may be a blast from the past—but the fundamentals never change. Learn how to apply fundamental techniques—like dimensional modeling—to modern tools, helping you to build scalable and reusable solutions to solve data problems today, and in the future.

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Beyond the buzz: 20 real metadata use cases in 20 minutes with Atlan and dbt Labs

2022-10-25 · dbt Coalesce 2022 Watch

video

by Prukalpa Sankar (Atlan)

Analytics AWS Glue Databricks Fivetran Looker Snowflake Tableau

for a few use cases like static and passive data catalogs. However, active metadata can be the key to unlock a variety of use cases, acting as the glue that binds together our diverse modern data stacks (e.g. dbt, Snowflake, Fivetran, Databricks, Looker, and Tableau) and diverse teams (e.g. analytics engineers, data analysts, data engineers, and business users)! At Atlan, we’ve worked closely with modern data teams like WeWork, Plaid, PayU, SnapCommerce, and Bestow. In this session, we’ll lay out all our learnings about how real-life data teams are using metadata to drive powerful use cases like column-level lineage, programmatic governance, root cause analysis, proactive upstream alerts, dynamic pipeline optimization, cost optimization, data deprecation, automated quality control, metrics management, and more. P.S. We’ll also reveal how active metadata and the dbt Semantic Layer can work together to transform the way your team works with metrics!

Check the slides here: https://docs.google.com/presentation/d/1xrC9yhHOQ00qWt-gVlgbakRELg2FzEPt-RwMsUWzdZA/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Build an Open Lakehouse with dbt Labs and Dremio

2022-10-25 · dbt Coalesce 2022 Watch

video

Analytics Data Lakehouse Dremio

Data teams are tasked with integrating a growing number of data sources, and enabling broad, self-service access to a consistent and unified view of that data to a growing number of technical and non-technical data consumers for analytics. In this session, learn how dbt and the Dremio open lakehouse platform work together to simplify data architectures, unify data sources, and get insights into the hands of data consumers fast, and how the new connector delivers a seamless user experience across platforms.

Check the slides here: https://docs.google.com/presentation/d/1ovzCrr1DnPF0n0JMVnPrceAcOZHSyD_aCaayjK8oISo/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Building a Data Platform from Scratch with dbt, Snowflake and Looker

2022-10-25 · dbt Coalesce 2022 Watch

video

by Prateek Chawla (Monte Carlo)

Airflow Cloud Computing Data Quality Looker Monte Carlo Snowflake Spark

When Prateek Chawla, founding engineer, joined Monte Carlo in 2019, he was responsible for spinning up our data platform from scratch. He was more of a backend/cloud engineer, but like with any startup had to wear many hats, so got the opportunity to play the role of data engineer too. In this talk, we’ll walk through how we spun up Monte Calro’s data stack with Snowflake, Looker, and dbt, touching on how and why we implemented dbt (and later, dbt Cloud), key use cases, and handy tricks for integrating dbt with other popular tools, like Airflow, and Spark. We’ll discuss what worked, what didn’t work, and other lessons learned along the way, as well as share how our data stack evolved over time to scale to meet the demands of our growing startup. We’ll also touch on a very critical component of the dbt value proposition, data quality testing, and discuss some of our favorite tests and what we’ve done to automate and integrate them with other elements of our stack.

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

But I won't do that — things you shouldn't do with dbt

2022-10-25 · dbt Coalesce 2022 Watch

video

by Randy Pitcher (dbt Labs)

dbt makes easy things easy and hard things possible. But there’s a few things that you really shouldn’t try if you value your time—and sanity. In this light-hearted session, Randy Pitcher (dbt Labs) will share some of the most common (and egregious) dbt workarounds that can tank your project, productivity, and pride.

Check the slides here: https://docs.google.com/presentation/d/1iDVPeDBfXGL5MiKHZlskL8tKBkdjy1Lxi8T8twNDPCg/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Data Apps in the Real World: How to Capture Value Locked in the Data Warehouse

2022-10-25 · dbt Coalesce 2022 Watch

video

by Kevin Chao (Ramp) , Tejas Manohar (Hightouch) , TJ Murphy (Multi Media LLC)

Data Science DWH Marketing Snowflake

Should you consider building a Data App?

How many times has your product team asked for data science models to be available in realtime to serve feature flags and product recommendations to customers? They don’t, but they should, and with data apps the data team can make this a reality.

Join TJ Murphy of Multi Media LLC, Kevin Chao from Ramp, and Tejas Manohar from Hightouch to hear examples of data apps in the real world. Their aim is to give data practitioners a framework for when and why to use the warehouse for production applications, and why the data team is the right team for this undertaking.

TJ will walk through the data apps he built at Minted, including a user personalization service and marketing automation tools. At Minted, the data team supported a GraphQL layer on top of the warehouse that supported both web and mobile app personalization on a per user basis.

Kevin Chao will share how Ramp, a fintech leader valued at $8B, is using dbt and Hightouch to power compliance via Snowflake as the source of truth.

Tejas will share how Supr Daily, the Instacart of India, runs product recommendations in their mobile app and automatically sends push notifications at opportune moments to convert users at a higher rate.

Lastly, TJ will give a practical overview of architecture, and a checklist of what to think through before building a Data App.

Check the slides here: https://docs.google.com/presentation/d/1LMuuuvVy3QD2ZAltp5c1Eh5Ik4LgM0q-AMlThsZVR40/edit#slide=id.g166573b6b47_0_0

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Data automation with dollars on the line: Forecasting 7-figure deals with Hex, dbt & Hightouch

2022-10-25 · dbt Coalesce 2022 Watch

video

by Adam Whitaker (Bluecore)

Your CCO slacks you at 9pm. A multi-million-dollar account is underperforming, and your quarter’s churn goal is on the line. But what does underperforming mean? Who decided? And why are you only just hearing about it now? It’s your job to answer all of those questions—before tomorrow morning. In his session, Adam Whitaker (Bluecore) shares how his team compeltely redesigned their customer retention and reporting programs with a new way to build reliable deal forecasts, automate operations and alerting, and swap opaque health scores for real-time actual/projection comparisons.

Check the slides here: https://docs.google.com/presentation/d/1CdMJ3_DnsJHwjXRyWvDifmio_WqPPVxrHOwTUY06y9Q/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

dbt & data mesh: the perfect pair (?)

2022-10-25 · dbt Coalesce 2022 Watch

video

by Guillermo Sanchez (GoDataDriven)

In this session, Guillermo Sanchez, (GoDataDriven) will attempt to summarize the last four major socio-technical changes that data teams have faced in the last few years, and why dbt plus the highly contentious data mesh might be the answer to each.

Check the slides here: https://drive.google.com/file/d/1LRmAJt3roIASxgJKIFsqbyZPcQrxjhAR/view?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

dbt Labs and Databricks: best practices and future roadmap

2022-10-25 · dbt Coalesce 2022 Watch

video

by Bilal Aslam (Databricks) , Nana Essuman (Conde Nast)

AI/ML Analytics Cloud Computing Data Lakehouse Databricks Python SQL

The Databricks Lakehouse Platform unifies the best of data warehouses and data lakes in one simple platform to handle all your data, analytics and AI use cases. Databricks now includes complete support for dbt Core and dbt Cloud and you will hear from Conde Nast using dbt and Databricks together to democratize insights. We will also share best practices for developing and productionizing dbt projects containing SQL and Python, governing data with standard SQL, and exciting features on our roadmap such as materialized views for Databricks SQL.

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Escape from Data Island - Orchestrate and Connect Your Data Stack for Smooth Sailing

2022-10-25 · dbt Coalesce 2022 Watch

video

Modern Data Stack

The Modern Data Stack is becoming more and more fragmented. With new tools and processes popping up continuously, it’s easy to get stranded on various “data islands”, with everything running independently. In this session, we’ll teach you:

What benefits you gain by turning your Modern Data Stack into an Integrated Data Stack
How Shipyard can help you quickly connect the data tools you already use
How orchestration is the missing step in your data journey to get your team off “data islands”
How dbt fits into the picture of a connected data stack

Check the slides here: https://docs.google.com/presentation/d/1NT7RnMtTLxb5ew5_VXXzciJusfbBKZJx86B6BtNZv90/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Field-level lineage with dbt, ANTLR, and Snowflake

2022-10-25 · dbt Coalesce 2022 Watch

video

by Helena Munoz , Mei Tao , Xuanzi Han (Monte Carlo)

Airflow Analytics Data Modelling Modern Data Stack Monte Carlo Snowflake

Lineage is a critical component of any root cause, impact analysis, and overall analytics heath assessment workflow. But it hasn’t always been easy to create, particularly at the field level. In this session, Mei Tao, Helena Munoz, and Xuanzi Han (Monte Carlo) tackle this challenge head-on by leveraging some of the most popular tools in the modern data stack, including dbt, Airflow, Snowflake, and ANother Tool for Language Recognition (ANTLR). Learn how they designed the data model, query parser, and larger database design for field-level lineage—highlighting learnings, wrong turns, and best practices developed along the way.

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Introducing dbt with Databricks

2022-10-25 · dbt Coalesce 2022 Watch

video

by Roberto Salcido (Databricks) , Prasad Kona (Databricks) , Pradeep Anandapu (Databricks)

Analytics Data Lake Data Lakehouse Databricks Modern Data Stack SQL

In this live, instructor-led hands-on lab, you’ll learn how to build a modern data stack with Databricks and dbt, using dbt to manage data transformations in Databricks and perform exploratory data analysis on the clean data sets using Databricks SQL. Based on the lakehouse architecture and built on an open data lake, data analysts, analytics engineers, and data scientists can use dbt and Databricks to work with the freshest and most complete data, and quickly derive new insights for accurate decision-making.

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Mastering the art of dbt package development

2022-10-25 · dbt Coalesce 2022 Watch

video

by Sheri Nguyen (Fivetran)

Fivetran

Welcome to the dbt Package show! Have you ever wondered how Fivetran approaches building dbt packages? Well, put on your aprons, because the Fivetran team is going to give you a glimpse at their recipe for building packages with dbt from 0 release to implementation. Sheri Nguyen of Fivetran will show you:

How they leverage the dbt Community to get the right ingredients for our packages and plan our roadmap for end state package models
Their methodologies and standards for preparing ingredients for our data models to provide flexibility for all our users
How they combine everything together and cook up our models for our dbt packages

Finally, after all the cooking is done, the best part is sharing what you've made with your friends and family. Sheri will also show you how different companies use our various packages and how you can contribute! This talk will benefit those who want to leverage Fivetran's dbt packages, understand why we apply certain modeling practices, and understand how we continually iterate on these packages. Additionally, this session will be beneficial for those who also want to be dbt package maintainers and understand how they can contribute future packages.

Check the slides here: https://docs.google.com/presentation/d/1QKvnxnfRBrZKOnBGwXkLnc-EjLS4S3mbSe59kjHhMos/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Moving to predictive: How to assemble the beginnings of your feature store with Snowflake & dbt Labs

2022-10-25 · dbt Coalesce 2022 Watch

video

by Miles Adkins (Snowflake)

AI/ML Analytics Snowflake

Historically, analytics has been focused on "what happened." And to this day, newer and newer generations of tooling, dbt for example, have come forth accelerating the speed and utility of data in an enterprise for decision making. Machine learning, on the other hand (the "what will happen"), has seemingly been stood up as a separate silo with an organization with seemingly "more intricate" technical requirements, the need for "data scientist", and done so all in the name of how to handle "more special" data resulting in "more accurate" decision making. In this session, you will learn how to cut through the noise and extend and leverage your analytic practice with Snowflake and dbt Labs into the realm of machine learning by pairing your analytical pipelines with a feature store layer to declaratively serve both model training and model scoring scenarios, even at some of the lowest latency (real-time) production requirements.

This session requires pre-registration. Sign up here. If session is filled you are welcome to come to the room and join the waitlist onsite. Open seats will be made available 10 minutes after session start.

Check the slides here: https://docs.google.com/presentation/d/1H-aPsc2DkPGcJUV4pd_iLSdMMMAMZTEtsAFO4hpglXM/edit#slide=id.g15a4510fa6a_0_560

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Nobody puts metrics in a corner: How to activate your dbt models

2022-10-25 · dbt Coalesce 2022 Watch

video

Analytics Thoughtspot

dbt has changed the game for data practitioners, bringing velocity, organizational efficiency, and increased trust to modern analytics workflows. But what happens to your dbt models after they’ve been built? Too often the value you create goes untapped by the business, or accessed only by a select few. dbt Labs and ThoughtSpot are teaming up to unleash the true potential of your transformed data. Learn how to deliver trustworthy data and insights to frontline users at scale with safe, reliable self-service analytics.

Check the slides here: https://docs.google.com/presentation/d/1XFyP8d0wkfnA1_59zb1zbNYInSSzdI2qe09f6OkhRE0/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Outgrowing a single `dbt run`

2022-10-25 · dbt Coalesce 2022 Watch

video

by Prratek Ramchandani (Vox Media)

Analytics Dagster

When does your team decided it’s time to move beyond a singular dbt run? For most analytics engineers, there comes a time when the dbt run commands on fixed schedules simply won’t make the cut. Join Prratek Ramchandani (Vox Media) as he breaks down an alternative approach to orchestrating your dbt project with Dagster that balances meeting SLAs with safely handling the edge cases a simple schedule-based dbt run might create.

Check the slides here: https://docs.google.com/presentation/d/1zivYO_EpN6T9JYM9HjAJAz3bK3e2TREwdKffylkzuUw/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Petabyte-scale lakehouses with dbt and Apache Hudi

2022-10-25 · dbt Coalesce 2022 Watch

video

Airflow Big Data Data Lakehouse Hudi Spark Data Streaming

While the data lakehouse architecture offers many inherent benefits, it’s still relatively new to the dbt community, which creates hurdles to adoption.

In this talk, you’ll meet Apache Hudi, a platform used by organizations to build planet-scale data platforms according to all of the key design elements required by the lakehouse architecture. You’ll also learn how we’ve personaly used Hudi, along with dbt, Spark, Airflow, and many more open-source tools to build a truly reliable big data streaming lakehouse that cut the latency of our petabyte-scale data pipelines from hours to minutes.

Check the slides here: https://docs.google.com/presentation/d/18dv4TZzRnZQ-IK7xLkYJuind4Bcztkl19zV7b4HTaTU/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Snowflake yourself: Using dbt for Snowflake access control management

2022-10-25 · dbt Coalesce 2022 Watch

video

by Daniel Corley (SpotOn)

Snowflake

Calling all Snowflake and dbt admins! Come learn how to utilize dbt to manage Snowflake users and network policies and keep your data stack simple and lean while version controlling Snowflake’s RBAC and network policies.

Check the slides here: https://docs.google.com/presentation/d/1IxmDMgVaOYb0PTEWej-k3l0avKrmHI970oT7_Xpg0Z4/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Standardizing the unstandardized: dbt modeling for Web 3.0

2022-10-25 · dbt Coalesce 2022 Watch

video

by Alec Kamra (Mythical Games)

API Blockchain Smart Contracts

Web 3.0 is constantly evolving: everyday there are new smart contracts, project updates, tokens and chains. And just because the data is on the blockchain and public does not mean it's easily accessible and digestible. It may be easy to monitor specific parts of the blockchain such as a particular smart contract, but how do you build a scalable infrastructure flexible enough to account for any new business request in a rapidly evolving industry?

Join Alec Kamra (Mythical Games) as he shows you how to do just this. As a blockchain gaming platform, Mythical Games has built a stable and flexible multi-chain solution using Google BQ's public datasets and a few external APIs that allow them to monitor all trades and transfers for their business needs.

Check the slides here:https://docs.google.com/presentation/d/14wcMHKAGhm9qvZ2NlPSv1wQj5FJF-ttL4k3lAivt6Fk/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

talk-data.com

Activity Trend

Top Events

Top Speakers

How Preset Integrates dbt with Apache Superset to Deliver on Headless BI & Surface Metrics

Back to the Future: Where Dimensional Modeling Enters the Modern Data Stack

Beyond the buzz: 20 real metadata use cases in 20 minutes with Atlan and dbt Labs

Build an Open Lakehouse with dbt Labs and Dremio

Building a Data Platform from Scratch with dbt, Snowflake and Looker

But I won't do that — things you shouldn't do with dbt

Data Apps in the Real World: How to Capture Value Locked in the Data Warehouse

Data automation with dollars on the line: Forecasting 7-figure deals with Hex, dbt & Hightouch

dbt & data mesh: the perfect pair (?)

dbt Labs and Databricks: best practices and future roadmap

Escape from Data Island - Orchestrate and Connect Your Data Stack for Smooth Sailing

Field-level lineage with dbt, ANTLR, and Snowflake

Introducing dbt with Databricks

Mastering the art of dbt package development

Moving to predictive: How to assemble the beginnings of your feature store with Snowflake & dbt Labs

Nobody puts metrics in a corner: How to activate your dbt models

Outgrowing a single `dbt run`

Petabyte-scale lakehouses with dbt and Apache Hudi

Snowflake yourself: Using dbt for Snowflake access control management

Standardizing the unstandardized: dbt modeling for Web 3.0