talk-data.com

Topic

Modern Data Stack

Activities

tagged

Activity Trend

28 peak/qtr

2020-Q1 2026-Q2

Top Events

Data Engineering Podcast 125 Databricks DATA + AI Summit 2023 16 dbt Coalesce 2022 16 O'Reilly Data Engineering Books 15 dbt Coalesce 2023 13 DataFramed 13 The Analytics Engineering Podcast 10 The Joe Reis Show 9 Data Council 2023 5 Making Data Simple 5 Modern Data Stack Conference 2023 5 Modern Data Stack Conference 2021 5

Top Speakers

Tobias Macey 126 Benn Stancil (Mode) 10 Joe Reis (DeepLearning.AI) 10 Richie (DataCamp) 8 Prukalpa Sankar (Atlan) 5 Al Martin (IBM) 5 Tristan Handy (dbt Labs) 5 Maxime Beauchemin (Preset) 4 Yuliia Tkachova (Masthead Data) 4 Jon Tate 4 Tarush Aggarwal (5xData) 4 Boris Jabes (Census) 3

Activities

Showing filtered results

All Video Podcast Book

Filtering by: dbt Coalesce 2022 ×

Customer showcase: Miro (hosted by dbt Labs)

2022-12-21 · dbt Coalesce 2022 Watch

video

by Stephen Pastan (Miro) , Felipe Leite (Miro)

Analytics dbt

How do you efficiently scale your analytics stack when your data and data team grows 10x in 2 years? How do you even start prioritizing what gets done when there's that much growth? In this talk, Felipe Leite and Stephen Pastan of Miro unpack their shift to a Modern Data Stack and share the vital technical changes they made to build a scalable and tech-forward data stack. Come join them in learning how they got to where they are today and what they’re working on for the future. More details coming soon!

Check the slides here: https://docs.google.com/presentation/d/1lLoRBYAv8wlJQuhSrflX4F6y4lqV3SnDNAU7BYxHYR8/edit#slide=id.p

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Babies and bathwater: Is Kimball still relevant?

2022-11-21 · dbt Coalesce 2022 Watch

video

by Sydney Burns (Brooklyn Data Co) , Josh Devlin

dimensional modeling

In a popular Coalesce 2020 talk, Dave Fowler challenged the relevancy of dimensional modeling and the star schema popularized by Ralph Kimball over 25 years ago, arguing the dimensional modeling method now does “more harm than good,” in the context of the modern data stack. But should we throw the baby out with the bathwater? In this talk, Josh Devlin and Sydney Burns (Brooklyn Data Co) suggests a different approach to bringing Kimball’s system into the modern era.

Check the slides here:https://docs.google.com/presentation/d/1HLP1FfCNZJUIF7JgT1ote5LaG2tDvsNRfrA9vIkf2n4/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Beyond pretty graphs: How end-to-end lineage drives better actions

2022-10-25 · dbt Coalesce 2022 Watch

video

Airflow Analytics Data Quality dbt Redshift Spark

Everyone is talking about data lineage these days, and for a good reason. Data lineage helps ensure better data quality across your modern data stack. But not everyone speaks the same lineage language. Data engineers use lineage for impact and root cause analysis. Analysts and Analytics engineers use lineage to trace jobs and transformations in their warehouses. And consumers use lineage to understand why data never reached their expected destination. This results in a narrow, siloed view lineage in which only one group benefits. It’s time to stop using siloed lineage views for pretty graphs and start using end-to-end lineage to drive focused actions. In the talk, you will learn:

• How data quality tailors to specific needs of data engineers, analysts, & consumers

• How data lineage should drive actions

• A real-world example of end-to-end data lineage with Airflow, dbt, Spark, and Redshift

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Back to the Future: Where Dimensional Modeling Enters the Modern Data Stack

2022-10-25 · dbt Coalesce 2022 Watch

video

by John Barcheski (Analytics&) , Tony Dahlager (Analytics&)

Analytics dbt dimensional modeling

dbt’s powerful capabilities allow data teams to deliver data products and analytics solutions to solve business problems faster than ever. Yet still, even with the best modern technologies, challenges arise. How can you be certain what your building will stand up to changing requirements? How can you connect disparate parts of your business to derive new insights? The answer may be a blast from the past—but the fundamentals never change. Learn how to apply fundamental techniques—like dimensional modeling—to modern tools, helping you to build scalable and reusable solutions to solve data problems today, and in the future.

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Escape from Data Island - Orchestrate and Connect Your Data Stack for Smooth Sailing

2022-10-25 · dbt Coalesce 2022 Watch

video

dbt

The Modern Data Stack is becoming more and more fragmented. With new tools and processes popping up continuously, it’s easy to get stranded on various “data islands”, with everything running independently. In this session, we’ll teach you:

What benefits you gain by turning your Modern Data Stack into an Integrated Data Stack
How Shipyard can help you quickly connect the data tools you already use
How orchestration is the missing step in your data journey to get your team off “data islands”
How dbt fits into the picture of a connected data stack

Check the slides here: https://docs.google.com/presentation/d/1NT7RnMtTLxb5ew5_VXXzciJusfbBKZJx86B6BtNZv90/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Field-level lineage with dbt, ANTLR, and Snowflake

2022-10-25 · dbt Coalesce 2022 Watch

video

by Helena Munoz , Mei Tao , Xuanzi Han (Monte Carlo)

Airflow Analytics Data Modelling dbt Monte Carlo Snowflake

Lineage is a critical component of any root cause, impact analysis, and overall analytics heath assessment workflow. But it hasn’t always been easy to create, particularly at the field level. In this session, Mei Tao, Helena Munoz, and Xuanzi Han (Monte Carlo) tackle this challenge head-on by leveraging some of the most popular tools in the modern data stack, including dbt, Airflow, Snowflake, and ANother Tool for Language Recognition (ANTLR). Learn how they designed the data model, query parser, and larger database design for field-level lineage—highlighting learnings, wrong turns, and best practices developed along the way.

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Introducing dbt with Databricks

2022-10-25 · dbt Coalesce 2022 Watch

video

by Roberto Salcido (Databricks) , Prasad Kona (Databricks) , Pradeep Anandapu (Databricks)

Analytics Data Lake Data Lakehouse Databricks dbt SQL

In this live, instructor-led hands-on lab, you’ll learn how to build a modern data stack with Databricks and dbt, using dbt to manage data transformations in Databricks and perform exploratory data analysis on the clean data sets using Databricks SQL. Based on the lakehouse architecture and built on an open data lake, data analysts, analytics engineers, and data scientists can use dbt and Databricks to work with the freshest and most complete data, and quickly derive new insights for accurate decision-making.

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Keynote: The End of the Road for The Modern Data Stack You Know

2022-10-25 · dbt Coalesce 2022 Watch

video

by Margaret Francis (dbt Labs) , Tristan Handy (dbt Labs)

The products that make up the “modern data stack” have all grown to prominence over the past decade. In this heady time, so much has changed about how data work is done.

But some of the “rules of engagement” that defined the original modern data stack are starting to break down. As a result, big changes are coming for the data tooling ecosystem.

The end result? Better, more integrated tooling, used by more humans inside of every company, that actually understands the data that it is operating on.

This modern data stack—if we still want to call it that!—will be unrecognizable to its former self.

Check the slides here: https://docs.google.com/presentation/d/1G0c3w19AwBEWEzyd9vwTKK5zMvXR76-NPQn6x0xZoSg/view

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Modern Data Management: how to setup your data for success

2022-10-25 · dbt Coalesce 2022 Watch

video

by Alec Bialosky (Select Star)

Data Management

Got your Modern Data Stack setup, now what? A mature data practice goes beyond setting up the data pipeline, and ensures there are both systems and processes in place to make it easy for everyone to find and understand data. At Select Star, we work with many organizations to enable data discovery, so the “tribal knowledge” of data is searchable and understandable for everyone. In this session, we’ll share the best practices and change management tips for setting up a data discovery portal and making it the single source of truth of data for your business.

Check the slides here:https://docs.google.com/presentation/d/1F3CPBhWenf2jt5hmXrhXvOBe5wei6hcLUj95jZtVZEw/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Maximizing data leverage at Vendr with dbt and Metaplane

2022-10-25 · dbt Coalesce 2022 Watch

video

by Erik Edelmann (Vendr) , Kevin Hu (Metaplane)

Data Quality dbt GitLab SaaS

How do you support exponentially growing companies without breaking as a data team? The answer is increasing your leverage with tools and processes. This session centers around four principles to achieve this goal: 1. don’t reinvent the wheel, 2. make your own job easier, 3. save time for innovation, and 4. invest in onboarding.

First, the first data leader at Vendr, the SaaS buying platform with customers like GitLab, Brex, and The Washington Post, will share his learnings on building a stack and team that scaled as the company grew 10x from 30 to 300 employees in under two years.

Second, we’ll give a demo of how Metaplane pulls lineage and metadata from a modern data stack that is centered around dbt. By the end of the demo, you’ll know how to setup tests, extract lineage throughout your data stack, and triage data quality alerts.More details coming soon!

Check the slides here: https://docs.google.com/presentation/d/15dQJIGeGhG0WGO6MLXtxWhmf8neY-u0c8ZLRG9GJB-s/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

dbt and MDS in small-batch academic research: a working example

2022-10-25 · dbt Coalesce 2022 Watch

video

by Šimon Podhajský (iLife Technologies)

Analytics Analytics Engineering dbt

Academia/open science is an as-yet untapped market for analytics engineering, as well as one that could majorly benefit from the tight coupling of data transformation and software engineering best practices. But introducing dbt into this context comes with its own set of challenges. In this session, Šimon Podhajský (iLife Technologies), explains what’s slowing progress here,, and what academics can do to progress this work.

Check the slides here: https://docs.google.com/presentation/d/1aw_cs6V0n-oT9Lp7Vq3MNcRbthEFJEYwcvkCBfuzlR0/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Demystifying event streams: Transforming events into tables with dbt

2022-10-25 · dbt Coalesce 2022 Watch

video

by Charlie Summers (Merit)

Analytics Data Quality dbt

Pulling data directly out of application databases is commonplace in the MDS, but also risky. Apps change quickly, and application teams might update database schemas in unexpected ways, leading to pipeline failures, data quality issues, data delivery slow-downs. There is a better way. In his session, Charlie Summers (Merit) describes how their organization transforms application event streams into analytics-ready tables, more resilient to event scheme changes.

Check the slides here:https://docs.google.com/presentation/d/1K5PcoVshiHKZs_xI3K4P5JRNYTkbmnQJPMl8NmBlGfo/edit

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Operational AI for the Modern Data Stack

2022-10-25 · dbt Coalesce 2022 Watch

video

by Tristan Zajonc (Continual)

AI/ML dbt MLOps

The opportunities for AI and machine learning are everywhere in modern businesses, but today's MLOps ecosystem is drowning in complexity. In this talk, we'll show how to use dbt and Continual to scale operational AI — from customer churn predictions to inventory forecasts — without complex engineering or operational burden.

Check the slides here: https://docs.google.com/presentation/d/1vNcQxCjAK4xZVZC1ZHzqBzPiJE7uwhDIVWGeT9Poi1U/edit#slide=id.g15b1f544dd5_0_1500

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Preparing for the Next Wave: Data Apps

2022-10-25 · dbt Coalesce 2022 Watch

video

by Kevin Marr (Firebolt) , Jay Rajendran (Firebolt)

Analytics Analytics Engineering Cloud Computing Data Modelling

Data apps are the next wave in analytics engineering. The explosion of data volume and variety combined with an increasing demand for analytics by consumers, and a leap in cloud data technologies triggered an evolution of traditional analytics into the realms of modern data apps. Question is: How do you prepare for this wave? In this session we’ll explore real-world examples of modern data apps, and how the modern data stack is advancing to support sub-second and high concurrency analytics to meet the new wave of demand. We will cover: performance challenges, semi-structured data, data freshness, data modeling and toolsets.

Check the slides here: https://docs.google.com/presentation/d/1MC18SgT_ZHOJePjYizz_WT7dVveaycNw/edit?usp=sharing&ouid=110293204340061069659&rtpof=true&sd=true

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

The modern data team

2022-10-25 · dbt Coalesce 2022 Watch

video

by Abhi Sivasailam (Flexport)

Analytics Data Contracts

The "socio" is inseparable from the "technical". In fact, technological change often begets social and organizational change.

And in the data space, the technical changes that some now refer to as the "modern data stack" call for changes in how teams work with data, and in turn how data specialists work within those teams. Enter the Modern Data Team.

In this talk, Abhi Sivasailam will unpack the changing landscape of data roles and teams and what this looks like in action at Flexport. Come learn how Flexport approaches data contracts, management, and governance, and the central role that Analytics Engineers and Product Analysts play in these processes.

Check the slides here: https://docs.google.com/presentation/d/1Sgm3J6EkeKQf5D1MKopsLLAMOhAZ05CxDlei2mbDE90/edit#slide=id.g16424dcc8d3_0_1145

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

When the Real World Messes with Your Schedule: Event Driven Dbt Models for the MDS

2022-10-25 · dbt Coalesce 2022 Watch

video

dbt

The real world is unreliable. Planes take off late, trains leave early, and cars break down. Sometimes, we need to get data from a source without a standard connector. Sometimes, a schedule really doesn't cut it. In this talk, we'll build a pipeline that responds to events to ensure that data is delivered quickly and reliably. We'll also ensure it can handle failure and keep bad data from clogging the plumbing.

Check the slides here: https://docs.google.com/presentation/d/1W9p7H4l0fUr7iAJ3GxEGUTmWGtmc_iu02N-MKb2BSFM/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.