talk-data.com talk-data.com

Event

dbt Coalesce 2022

2022-10-11 YouTube Visit website ↗

Activities tracked

130

Sessions & talks

Showing 101–125 of 130 · Newest first

Search within this event →
Democratizing data at Zillow with dbt, Airflow, Spark, and Kubernetes

Democratizing data at Zillow with dbt, Airflow, Spark, and Kubernetes

2022-10-25 Watch
video
Deepak Konidena (Zillow)

Building data pipelines is difficult—and adding a data governance and observability framework doesn’t make it any easier. But that was the task ahead for Deepak Konidena during his early days at Zillow. In this session, he’ll share how the platform they build on top of dbt, Airflow, Spark, and Kubernetes—ZSQL—eliminated the need for internal data teams to build their own DAGs, models, schemas and lineage from scratch, while also providing an easy way to enforce data quality, monitor changes, and alert on disruptions.

Check the slides here: https://docs.google.com/presentation/d/18HEil3_nXD8nYBhcg4m-Kpy8I8Na6MXI/edit?usp=sharing&ouid=110293204340061069659&rtpof=true&sd=true

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Demystifying event streams: Transforming events into tables with dbt

Demystifying event streams: Transforming events into tables with dbt

2022-10-25 Watch
video
Charlie Summers (Merit)

Pulling data directly out of application databases is commonplace in the MDS, but also risky. Apps change quickly, and application teams might update database schemas in unexpected ways, leading to pipeline failures, data quality issues, data delivery slow-downs. There is a better way. In his session, Charlie Summers (Merit) describes how their organization transforms application event streams into analytics-ready tables, more resilient to event scheme changes.

Check the slides here:https://docs.google.com/presentation/d/1K5PcoVshiHKZs_xI3K4P5JRNYTkbmnQJPMl8NmBlGfo/edit

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Detecting Data Anomalies via an Inspection Layer

Detecting Data Anomalies via an Inspection Layer

2022-10-25 Watch
video
John Rensberger (Red Pill Analytics) , Neal Achord (Red Pill Analytics)

Let's face it, we can't get enough data these days and often ingest from various sources like vendors, IoT devices, and more. Unfortunately, you've likely encountered times when the data just isn't what you're expecting. For instance; when the data has nulls, duplicates, is arranged differently than the schema specification, or others - this can be a weak point for many data pipelines. We'll showcase a way to handle this using dbt native methods to implement an inspection layer to ensure erroneous data sets can be flagged and quarantined while the rest can load uninterrupted.

Check the slides here: https://docs.google.com/presentation/d/11Q9wwMfyz6xuxMXCPizFg4DKSY_zOIHPNOrsNI8oBn8/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Driving actionable insights

Driving actionable insights

2022-10-25 Watch
video
Christian Franklin (phData) , Cory Koster (phData)

See how visual data modeling and dbt combine to improve interaction and understanding between analytics engineering practitioners, product owners, and business partners. We will demonstrate conceptual and logical modeling techniques and diagrams to establish common understanding, enhance business partner collaboration, enhance translation of requirements, and ultimately complement analytics engineering within dbt to improve time to value. Demonstrate how to pair data modeling concepts (conceptual, logical, physical) and tools (SqlDBM) to engage your customers and inform the analytics engineering with dbt and Snowflake. We will show how this workbench and tools complement the analytics lifecycle for engineers and data consumers alike. The workbench includes a dbt, a visual modeling tool, and phData Toolkit CLI.

This session requires pre-registration. Sign up here. If session is filled you are welcome to come to the room and join the waitlist onsite. Open seats will be made available 10 minutes after session start.

Check the slides here: https://docs.google.com/presentation/d/1fJhaMGvD7TvVft4nEJYhMRhyanQTw3lbzLrgZFsmj-0/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Driving impact with a fine-toothed comb

Driving impact with a fine-toothed comb

2022-10-25 Watch
video
Andie DeLeon (Thirty Madison)

Today’s analytics engineers weren’t at the top of their class for any one thing in particular—they’re misfits—having tried and maybe even failed at any number of things. But these misadventures always revealed a common trait—insatiable curiosity for the way things work—the logic and order applied to anything from the music industry to particle physics. In this talk, Andie DeLeon (Thirty Madison) shares how current and aspiring analytics engineers can translate their experience in “passion projects” to driving real business impact for organizations in any domain.

Check the slides here: https://docs.google.com/presentation/d/14-pTSULJCn6j7PKJB81U33nClurvds25Hr1eZkvsfI8/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

From Data Magician to Data Coach

From Data Magician to Data Coach

2022-10-25 Watch
video
Jerrie Kumalah (SeatGeek)

Are you constantly feeling forced to choose between “on time” or “actually useful?” The latter only feels possible when data teams have the luxury to slow down enough to ask the question beneath the question—to understand what stakeholders are actually trying to solve, and whether your work will have a material impact. But that shouldn’t be a luxury! In this session you’ll learn how to slow down without slowing down your deliverables—for happy stakeholders, and actually useful data products.

Check the slides here: https://www.canva.com/design/DAFNqYtPL3o/nUgayAXaPRTDz1GUnfNTpA/edit?utm_content=DAFNqYtPL3o&utm_campaign=designshare&utm_medium=link2&utm_source=sharebutton

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Getting jiggy with jsonschema: The power of contracts for building data systems

Getting jiggy with jsonschema: The power of contracts for building data systems

2022-10-25 Watch
video

Is your SQL query the problem, or how you ask for the data you need, when you need it. In this deep dive, Jake Thomas shares his hypothesis for why the jsonschema is the ticket to contract-driven communication, system interoperability, and an overall improvement to data processing quality of life.

Check the slides here: https://docs.google.com/presentation/d/1kiGyQF7NUWfx-5RyIyeEwSUCwqtIdrXADeI2iixUgiI/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Hands-on: the dbt Semantic Layer

Hands-on: the dbt Semantic Layer

2022-10-25 Watch
video
Cameron Afzal (dbt Labs)
dbt

The long-awaited dbt Semantic Layer is finally here. By defining metrics centrally in dbt, data teams can trust that business logic referenced anywhere will be exactly the same everywhere. Experience it in action in this hands-on session with the dbt Labs product team and dbt Labs partners.

Check the slides here: https://docs.google.com/presentation/d/1lOH6Sb8DQnnlmZkYOlqqHgQeXKkUEQCm_LOxsjBRJlM/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

How the Content Analytics team at Spotify avoids data indigestion in BigQuery with dbt

How the Content Analytics team at Spotify avoids data indigestion in BigQuery with dbt

2022-10-25 Watch
video
Nick Baker (Spotify) , Mitchell Silverman (Spotify) , Brian Pei (Spotify)

When the content analytics team at Spotify adopted dbt and shifted away from an internally developed transformation tool, they needed to figure out how to access data produced by other teams using sharded partitions. Enter: Waluigi. Nick Baker, Brian Pei, and Mitchell Silverman show us how an internal package used to safely and smoothly ingest the data they need also helped empower other data teams to more easily adopt dbt and leverage the data they produce.

Check the slides here: https://docs.google.com/presentation/d/1uAzfKa2Usbbr7J6jcI-IwdK1gPTKEsWSgZEfTU5bulM/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Jumpstart dbt: How to Achieve Speed and Scale

Jumpstart dbt: How to Achieve Speed and Scale

2022-10-25 Watch
video
Virendra Singh (Cisco) , Alex Garbarini (Cisco)

For enterprises integrating dbt into their transformation technology stack, of paramount importance is how to achieve speed and scale. Sure, dbt will "infinitely scale" due to its underlying cloud native deployment- but only in reference to hosting, execution, and other platform services. HOW does one onboard hundreds or thousands of users with repeatability, conformity, and engineering excellence by design? HOW does an organization integrate dependent platforms and services? Centrally monitor? Share reusable assets? Maintain security?... This talk identifies how Cisco enabled automated services and processes to achieve that scale. Walk away knowing what's in store for you if onboarding dbt, headwinds we faced, and the success Cisco is seeing in our chosen deployment paradigm.

Check the slides here: https://docs.google.com/presentation/d/1e4fG0_60APnCmFDV5a8X7sPOCbKlto3L/edit?usp=sharing&ouid=110293204340061069659&rtpof=true&sd=true

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Operational AI for the Modern Data Stack

Operational AI for the Modern Data Stack

2022-10-25 Watch
video
Tristan Zajonc (Continual)

The opportunities for AI and machine learning are everywhere in modern businesses, but today's MLOps ecosystem is drowning in complexity. In this talk, we'll show how to use dbt and Continual to scale operational AI — from customer churn predictions to inventory forecasts — without complex engineering or operational burden.

Check the slides here: https://docs.google.com/presentation/d/1vNcQxCjAK4xZVZC1ZHzqBzPiJE7uwhDIVWGeT9Poi1U/edit#slide=id.g15b1f544dd5_0_1500

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Preparing for the Next Wave: Data Apps

Preparing for the Next Wave: Data Apps

2022-10-25 Watch
video
Kevin Marr (Firebolt) , Jay Rajendran (Firebolt)

Data apps are the next wave in analytics engineering. The explosion of data volume and variety combined with an increasing demand for analytics by consumers, and a leap in cloud data technologies triggered an evolution of traditional analytics into the realms of modern data apps. Question is: How do you prepare for this wave? In this session we’ll explore real-world examples of modern data apps, and how the modern data stack is advancing to support sub-second and high concurrency analytics to meet the new wave of demand. We will cover: performance challenges, semi-structured data, data freshness, data modeling and toolsets.

Check the slides here: https://docs.google.com/presentation/d/1MC18SgT_ZHOJePjYizz_WT7dVveaycNw/edit?usp=sharing&ouid=110293204340061069659&rtpof=true&sd=true

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Sculpting data for machine learning

Sculpting data for machine learning

2022-10-25 Watch
video
Rishabh Misra (Twitter) , Jigyasa Grover (Twitter)

In the contemporary world of machine learning algorithms - “data is the new oil”. For the state-of-the-art ML algorithms to work their magic, it’s important to lay a strong foundation with access to relevant data. But the skills required to make those datasets meaningful for machine learning...that's an art.

Join Jigyasa Grover and Rishabh Misra of Twitter as they talk about the power of the most fundamental aspect of machine learning—dataset curation—and walk through the process of constructing good quality datasets with a hands-on Pythonic example.

Check the slides here: https://docs.google.com/presentation/d/1HOii7Z_F-75euxcjIY2H6OmJ6EsYLOLRLutMxGcrW_Y/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Streaming with dbt: the Jaffle Shop don’t stop!

Streaming with dbt: the Jaffle Shop don’t stop!

2022-10-25 Watch
video
Anna Glander (Materialize) , Marta Paes (Materialize)

In between JVM languages, high-maintenance frameworks and academic papers, streaming remains a hard beast to tame for most of us. What if nothing had to change, and streaming just meant…still writing dbt models? At Materialize, we’re exploring how to make the most of dbt for streaming — from real-time analytics to continuous testing, and beyond! Join us to learn how to get started with no blood, sweat or tears, using the Jaffle Shop as a playground. Our toolbox? A database that feels like Postgres but works like all the streaming systems you’ve been avoiding, some SQL and a dash of magic.

Check the slides here: https://docs.google.com/presentation/d/11PANQElVxtzqgzmRCcQfZy24vdMeYDokpxr7LdlrbrE/edit#slide=id.g105b4fffa32_0_942

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Testing: Our assertions vs. reality

Testing: Our assertions vs. reality

2022-10-25 Watch
video
Mariah Rogers (Palmetto)
dbt

Testing data models is sometimes like trying to form a bust from clay made of cornstarch and water. Right when you think you've got it into the right shape and set it on a shelf, it completely melts into a puddle of mush. Our practice of testing transformations on top of shifting, changing data falls apart in the same way over and over again, yet we don't learn our lesson. Come learn from Mariah Rogers (Palmetto) why we're doing model testing wrong, how we can change our ways to do it better, and what problems will be essential for the dbt Community to solve together to bridge the gap.

Check the slides here: https://docs.google.com/presentation/d/1oTWnOJxCSRN7ihgI-SflQCBkA7cwmcpGvryOh1vWKoc/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

The modern data team

The modern data team

2022-10-25 Watch
video
Abhi Sivasailam (Flexport)

The "socio" is inseparable from the "technical". In fact, technological change often begets social and organizational change.

And in the data space, the technical changes that some now refer to as the "modern data stack" call for changes in how teams work with data, and in turn how data specialists work within those teams. Enter the Modern Data Team.

In this talk, Abhi Sivasailam will unpack the changing landscape of data roles and teams and what this looks like in action at Flexport. Come learn how Flexport approaches data contracts, management, and governance, and the central role that Analytics Engineers and Product Analysts play in these processes.

Check the slides here: https://docs.google.com/presentation/d/1Sgm3J6EkeKQf5D1MKopsLLAMOhAZ05CxDlei2mbDE90/edit#slide=id.g16424dcc8d3_0_1145

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

When sparks fly: How a post-merger 1,000+ person company founded a data team

When sparks fly: How a post-merger 1,000+ person company founded a data team

2022-10-25 Watch
video
Patrick Miller (Newfront)

Not every company has a data team in it’s founding five… or even five hundred. It wasn’t until Newfront was nearing 1,000 employees after an M&A event that data became a serious focus. But bringing order to so many distributed systems wasn't easy. Epic migrations, stakeholder clashes, and adventures in activations… the story of how Newfront made the warehouse the center of their universe is the hero story every data team at a larger company needs to hear.

Check the slides here: https://docs.google.com/presentation/d/1i3BrxtQrsKoAg7IiCABw-A1Gr7QGigpb4hqTxSK_8y4/

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

When the Real World Messes with Your Schedule: Event Driven Dbt Models for the MDS

When the Real World Messes with Your Schedule: Event Driven Dbt Models for the MDS

2022-10-25 Watch
video

The real world is unreliable. Planes take off late, trains leave early, and cars break down. Sometimes, we need to get data from a source without a standard connector. Sometimes, a schedule really doesn't cut it. In this talk, we'll build a pipeline that responds to events to ensure that data is delivered quickly and reliably. We'll also ensure it can handle failure and keep bad data from clogging the plumbing.

Check the slides here: https://docs.google.com/presentation/d/1W9p7H4l0fUr7iAJ3GxEGUTmWGtmc_iu02N-MKb2BSFM/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Why rent when you can own? Build your modern data lakehouse with true optionality

Why rent when you can own? Build your modern data lakehouse with true optionality

2022-10-25 Watch
video
Tom Nats (dbt Labs) , Brian Zhan (dbt Labs)

With Trino (formerly PrestoSQL) and dbt combined, you can get faster access to your data and the ability to analyze data across multiple data sources with ease. Extract, load and transform data in your data lakehouse easier than ever before using dbt’s Trino adapter. Join Brian Zhan and Tom Nats as they talk about the new dbt connector for Trino and how it works, along with a demo showing how easy it is to deploy, build and serve up analytics using dbt and Starburst Galaxy.

Check the slides here: https://docs.google.com/presentation/d/1-A-mfc1RIj87ypz6KeZvxK62QLaGthmMqBPy10vNnDk/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Workshop: Advanced Testing

Workshop: Advanced Testing

2022-10-25 Watch
video
Lauren Benezra (dbt Labs) , Bennie Regenold (dbt Labs)
dbt

Do you want to take your dbt project beyond simple unique and not-null tests, but don’t know where to start? Join the dbt Labs team for a deep dive into testing. You’ll learn how to customize tests to fit your unique needs, lean on the amazing dbt community for pre-built tests you can add straight to your project, and flex your Jinja skills by creating your own custom tests. By the end of this course you’ll be walking tall knowing that the data you’re providing to your customers is clean, reliable, and consistent.

Check the slides here: https://docs.google.com/presentation/d/1TCehN5TxHYIuE6gk3rCGx1f9kLkkcXM7TnfcDejUnqo/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Workshop: Build your first dbt Python model

Workshop: Build your first dbt Python model

2022-10-25 Watch
video
Nicholas Yager (dbt Labs) , Wasila Quader (dbt Labs)

Description: dbt now supports Python models! In this hands-on workshop you’ll learn how to build your first Python models in dbt, alongside SQL at the center of your transformations.

You’ll learn how to: - Build your Python transformation in a notebook - Add this transformation as a model in your dbt project - Decide between building models in SQL or in Python

Prerequisites: - Basic familiarity with Python and DataFrames - If you want to use your own Warehouse and dbt project, make sure that you have dbt 1.3 installed and have followed the “additional setup” from our docs

Check the slides here: https://docs.google.com/presentation/d/133CVwwAxc5qT80ZJwngQ_ZSikOkCttvzWwGpdZCgOHQ/edit#slide=id.g1693e59a4f4_0_0

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

How Entity Modeling Accelerates Product Led Growth

How Entity Modeling Accelerates Product Led Growth

2022-10-25 Watch
video
Rachel Bradley-Haas (BigTimeData.io)

The gap between engineering and business teams is widening. The better engineering teams get at iterating to support new features, the harder it is for business teams to keep up with the nuance in a rapidly evolving customer journey. In this session, Rachel Bradley-Haas (BigTimeData.io) takes a step back to explain why defining entities, relationships, and properties, helps build a scalable and cohesive data model that business users can action to accelerate PLG motions.

Check the slides here: https://docs.google.com/presentation/d/1genfMH9v8mZgZBCW-yMhvRla4DwVwoeD6fqSf5L9RGU/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Data Change Management: Lessons Learned at Vouch

Data Change Management: Lessons Learned at Vouch

2022-10-25 Watch
video
Kshitij Aranke (Vouch Insurance)
dbt

Allowing more people into the data development process can improve shipment speed, but can also cause some anxiety for folks that wonder how to preserve best practices as participation expands. In his session, Kshitij Aranke (Vouch Insurance), shares how his team created safe inroads to communal development through automated change management on dbt projects that provided automatic best practice checks on each pull request.

Check Google Slides here: https://docs.google.com/presentation/d/17D2DC4KUxfLopYLMvK4ywFVy5MPaPFeRE9fskkir0CM/edit#slide=id.gac0f4c9a75_0_0

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Data Led is Dumb

Data Led is Dumb

2022-10-25 Watch
video
Emilie Schario (Netlify)

More and more companies are raising the “data-led” banner as manifest operational excellence. But is it true? Who's leading who? In her talk, long-time Coalesce dynamo, Emilie Schario (Amplify Partners) challenges data teams to start treating data like a product not a prophet—no better or worse than the people and processes it represents.

Check the slides here: https://docs.google.com/presentation/d/16Zeffh3ODiRkTj_mnugnMdozC4ex-Q4ejyJl_Y33mJc/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Minimum viable (data) product

Minimum viable (data) product

2022-10-25 Watch
video
Michal Kolacek (Slido)

Analytics work mirrors product development: identify a user need, build a minimum viable product to address that need, evaluate the impact and iterate. In this talk, Michal Kolacek, analytics engineer at Slido describes how MVP-like thinking can help data teams counterbalance and complement the standardized approaches of dbt.

We will walk through Slido’s evolution in their approach, tooling and the vision of building better data products using Deepnote notebooks. Finally, we will take a look under the hood of the new dbt integration in Deepnote and outline how data teams can use it to accelerate model prototyping and metrics workflows.

Check the slides here: https://docs.google.com/presentation/d/1-L7ndud6z5gsFtF3WdjA6AVG40_vrCAcVRNarqWNtPg/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.