talk-data.com talk-data.com

Topic

Analytics

data_analysis insights metrics

4552

tagged

Activity Trend

398 peak/qtr
2020-Q1 2026-Q1

Activities

4552 activities · Newest first

I don’t build models, I construct knowledge

In this talk, Marielle Dado (Paddle) will introduce the concept of knowledge construction (a theory that featured heavily in their previous life as a learning psychologist/cognitive scientist), why they think analytics engineers should see themselves as "knowledge creators," and how this framing can lead to less ad-hoc querying and more robust data models.

Check the slides here: https://docs.google.com/presentation/d/1aMwNiFd7B5pArOP8Nc_jfHCLtNvKimaUJFUuMMPQC04/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Move Fast and (Don’t) Break Things: Testing dbt Models with Bigeye

Growing demand for data puts Analytics Engineers under pressure to move quickly, without introducing bugs to their models. If you find yourself pushing dbt’s testing framework to the limit, or waiting for careful code reviews before deploying to production, this talk is for you. Join Egor Gryaznov, CTO of Bigeye and former Staff Data Engineer at Uber, to learn how to take your workflow to the next level with blue-green deployments. This talk will cover: -CI/CD pipeline best practices for Analytics Engineers -Automatically validating pipeline changes with blue-green deployments -Tracking data changes through merge requests for visibility and reliability More details coming soon!

Check the slides here: https://docs.google.com/presentation/d/1zkzKkhjm1WshCMSOksfi24BEGK62wrLOGKKEg0I0Yhg/edit#slide=id.g11c331d5e28_0_6192

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Beyond pretty graphs: How end-to-end lineage drives better actions

Everyone is talking about data lineage these days, and for a good reason. Data lineage helps ensure better data quality across your modern data stack. But not everyone speaks the same lineage language. Data engineers use lineage for impact and root cause analysis. Analysts and Analytics engineers use lineage to trace jobs and transformations in their warehouses. And consumers use lineage to understand why data never reached their expected destination. This results in a narrow, siloed view lineage in which only one group benefits. It’s time to stop using siloed lineage views for pretty graphs and start using end-to-end lineage to drive focused actions. In the talk, you will learn:

• How data quality tailors to specific needs of data engineers, analysts, & consumers

• How data lineage should drive actions

• A real-world example of end-to-end data lineage with Airflow, dbt, Spark, and Redshift

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

How to leverage dbt Community as the first & ONLY data hire to survive

As data science and machine learning adoption grew over the last few years, Python moved up the ranks catching up to SQL in popularity in the world of data processing. SQL and Python are both powerful on their own, but their value in modern analytics is highest when they work together. This was a key motivator for us at Snowflake to build Snowpark for Python: to help modern analytics, data engineering, and data science teams generate insights without complex infrastructure management for separate languages.

Join this session to learn more about how dbt's new support for Python-based models and Snowpark for Python can help polyglot data teams get more value from their data through secure, efficient and performant metrics stores, feature stores, or data factories in the Data Cloud.

Check the slides here: https://docs.google.com/presentation/d/1xJEyfg81azw2hVilhGZ5BptnAQo8q1L7aDLGrnSYoUM/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

SQL: The video game

Do you enjoy waking up in the morning and playing the daily Wordle puzzle? Have you been wishing there was a similar game for you to play, but was built specifically for data folks? Well, you are in luck!

Join Joe Markiewicz (analytics engineer by day, video game maker by night) as he explains his inspiration and how he leveraged dbt and BigQuery to create a new video game aimed at helping experienced analysts keep their SQL skills sharp and data newcomers increase their SQL literacy.

Check the slides here: https://docs.google.com/presentation/d/1C1qUZEcpfBa6oA_CTHGx3GR1WLW3s2adVXnX2BRKVWA/edit

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Data as a Product: How dbt powers Slido’s Data API

What do you do when you find out that your team is being tasked with building a single platform that should be able to serve everyone's data needs, no matter whether they are internal (from within your company) or external (your customers)? What's more it's expected to be fast, stable, granular, sophisticated, simple, scalable, usable, easy to maintain, compatible… the list goes on.

Well, time to find a new-school solution. We'll walk you through our story of how and why we built Slido's dataAPI using everyone's favourite Analytics Engineering tool, dbt.

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Back to the Future: Where Dimensional Modeling Enters the Modern Data Stack

dbt’s powerful capabilities allow data teams to deliver data products and analytics solutions to solve business problems faster than ever. Yet still, even with the best modern technologies, challenges arise. How can you be certain what your building will stand up to changing requirements? How can you connect disparate parts of your business to derive new insights? The answer may be a blast from the past—but the fundamentals never change. Learn how to apply fundamental techniques—like dimensional modeling—to modern tools, helping you to build scalable and reusable solutions to solve data problems today, and in the future.

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Beyond the buzz: 20 real metadata use cases in 20 minutes with Atlan and dbt Labs

for a few use cases like static and passive data catalogs. However, active metadata can be the key to unlock a variety of use cases, acting as the glue that binds together our diverse modern data stacks (e.g. dbt, Snowflake, Fivetran, Databricks, Looker, and Tableau) and diverse teams (e.g. analytics engineers, data analysts, data engineers, and business users)! At Atlan, we’ve worked closely with modern data teams like WeWork, Plaid, PayU, SnapCommerce, and Bestow. In this session, we’ll lay out all our learnings about how real-life data teams are using metadata to drive powerful use cases like column-level lineage, programmatic governance, root cause analysis, proactive upstream alerts, dynamic pipeline optimization, cost optimization, data deprecation, automated quality control, metrics management, and more. P.S. We’ll also reveal how active metadata and the dbt Semantic Layer can work together to transform the way your team works with metrics!

Check the slides here: https://docs.google.com/presentation/d/1xrC9yhHOQ00qWt-gVlgbakRELg2FzEPt-RwMsUWzdZA/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Build an Open Lakehouse with dbt Labs and Dremio

Data teams are tasked with integrating a growing number of data sources, and enabling broad, self-service access to a consistent and unified view of that data to a growing number of technical and non-technical data consumers for analytics. In this session, learn how dbt and the Dremio open lakehouse platform work together to simplify data architectures, unify data sources, and get insights into the hands of data consumers fast, and how the new connector delivers a seamless user experience across platforms.

Check the slides here: https://docs.google.com/presentation/d/1ovzCrr1DnPF0n0JMVnPrceAcOZHSyD_aCaayjK8oISo/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

dbt Labs and Databricks: best practices and future roadmap

The Databricks Lakehouse Platform unifies the best of data warehouses and data lakes in one simple platform to handle all your data, analytics and AI use cases. Databricks now includes complete support for dbt Core and dbt Cloud and you will hear from Conde Nast using dbt and Databricks together to democratize insights. We will also share best practices for developing and productionizing dbt projects containing SQL and Python, governing data with standard SQL, and exciting features on our roadmap such as materialized views for Databricks SQL.

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Engineering your analytics career path

When Phoenix Jay (K Health) started her Master's in Applied Analytics, it was with the assumption that she’d quickly land a gig in data science. But by the time she graduated, the field had changed—as had every Data Scientist job posting. This shift brought uncertainty, but also opportunity. Learn how she adjusted her path to begin applying “classical” data science paradigms to analytics.

Check the slides here: https://docs.google.com/presentation/d/19DviXCiciRh3dq-5cSTXkPqA7CppUUKAsfpm6dmn0l8/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Excel at nothing: How to be an effective generalist

"They’re looking for someone technical, but business-oriented.” Says the newly minted analytics engineer—quietly content in her ability to be everything to everyone. But "purple" work can still be perilous: roadmap uncertainty, stakeholder confusion, an unending list of new things to learn... In practice, becoming an efficient generalist requires a specific skill set, one not often taught in schools. In this talk, Stephen Bailey (Whatnot) shares how to navigate the most complex, but rewarding bits of interdisciplinary data work.

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Field-level lineage with dbt, ANTLR, and Snowflake

Lineage is a critical component of any root cause, impact analysis, and overall analytics heath assessment workflow. But it hasn’t always been easy to create, particularly at the field level. In this session, Mei Tao, Helena Munoz, and Xuanzi Han (Monte Carlo) tackle this challenge head-on by leveraging some of the most popular tools in the modern data stack, including dbt, Airflow, Snowflake, and ANother Tool for Language Recognition (ANTLR). Learn how they designed the data model, query parser, and larger database design for field-level lineage—highlighting learnings, wrong turns, and best practices developed along the way.

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

How insensitive: Increasing analytics capacity through empathy

Insensatez in Portuguese means folly, unwisdom. In other words, doing something you deep down know you shouldn't. Data practitioners don't always have the right incentives or organizational influence to advocate for the solutions that they believe are best fit for the problems they are solving. Join Felippe Felisola Caso (Loft) as he bravely wades into the awkward and often unspoken feelings between leaders and analysts. Through an exploration of organizational feelings and perspectives, attendees can expect to walk away with a greater capacity for empathy, as well as proven strategies for improving communication and building alignment.

Check the slides here: https://docs.google.com/presentation/d/1TKtveMXtFBoCALp_pb7bO9ggV5KU_xeeZRhFSS72OJw/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Introducing dbt with Databricks

In this live, instructor-led hands-on lab, you’ll learn how to build a modern data stack with Databricks and dbt, using dbt to manage data transformations in Databricks and perform exploratory data analysis on the clean data sets using Databricks SQL. Based on the lakehouse architecture and built on an open data lake, data analysts, analytics engineers, and data scientists can use dbt and Databricks to work with the freshest and most complete data, and quickly derive new insights for accurate decision-making.

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Money, Python, and the Holy Grail: Designing Operational Data Models

Most analysts don’t become analysts to build dashboards. We don’t become analysts to do data pulls, or clean up messy data, or put together pitch decks. We become analysts to do impactful, strategic analysis. This is our calling; it’s the most valuable work that we do; and it’s why we put up with the rest of our job—for that afternoon with nothing but a big question, a clear calendar, and a trajectory-changing aha moment buried somewhere in our well-prepped datasets.

But the rapid rise of analytics engineering should make us question all of this. Is strategic analysis actually the holy grail of analytics? Is it the most valuable thing we could do? Is it even what we want to do?

In chasing this ambition, Benn Stancil (Mode) thinks we’ve lost sight of something even more important—and potentially, more interesting: Designing operational models. These frameworks, which are a natural extension of the semantic models built by analytics engineers, are often more valuable than any dashboard, any dataset, or any deep dive analysis.

In his talk, Benn will share what these models are, why they’re valuable, and why, in our eternal quest to both quantify our value and to find work we love, they could prove to be our holy grail we’ve always been looking for.

Check the slides here: https://docs.google.com/presentation/d/1lOH6Sb8DQnnlmZkYOlqqHgQeXKkUEQCm_LOxsjBRJlM/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Moving to predictive: How to assemble the beginnings of your feature store with Snowflake & dbt Labs

Historically, analytics has been focused on "what happened." And to this day, newer and newer generations of tooling, dbt for example, have come forth accelerating the speed and utility of data in an enterprise for decision making. Machine learning, on the other hand (the "what will happen"), has seemingly been stood up as a separate silo with an organization with seemingly "more intricate" technical requirements, the need for "data scientist", and done so all in the name of how to handle "more special" data resulting in "more accurate" decision making. In this session, you will learn how to cut through the noise and extend and leverage your analytic practice with Snowflake and dbt Labs into the realm of machine learning by pairing your analytical pipelines with a feature store layer to declaratively serve both model training and model scoring scenarios, even at some of the lowest latency (real-time) production requirements.

This session requires pre-registration. Sign up here. If session is filled you are welcome to come to the room and join the waitlist onsite. Open seats will be made available 10 minutes after session start.

Check the slides here: https://docs.google.com/presentation/d/1H-aPsc2DkPGcJUV4pd_iLSdMMMAMZTEtsAFO4hpglXM/edit#slide=id.g15a4510fa6a_0_560

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Nobody puts metrics in a corner: How to activate your dbt models

dbt has changed the game for data practitioners, bringing velocity, organizational efficiency, and increased trust to modern analytics workflows. But what happens to your dbt models after they’ve been built? Too often the value you create goes untapped by the business, or accessed only by a select few. dbt Labs and ThoughtSpot are teaming up to unleash the true potential of your transformed data. Learn how to deliver trustworthy data and insights to frontline users at scale with safe, reliable self-service analytics.

Check the slides here: https://docs.google.com/presentation/d/1XFyP8d0wkfnA1_59zb1zbNYInSSzdI2qe09f6OkhRE0/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

Outgrowing a single `dbt run`

When does your team decided it’s time to move beyond a singular dbt run? For most analytics engineers, there comes a time when the dbt run commands on fixed schedules simply won’t make the cut. Join Prratek Ramchandani (Vox Media) as he breaks down an alternative approach to orchestrating your dbt project with Dagster that balances meeting SLAs with safely handling the edge cases a simple schedule-based dbt run might create.

Check the slides here: https://docs.google.com/presentation/d/1zivYO_EpN6T9JYM9HjAJAz3bK3e2TREwdKffylkzuUw/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.

The accidental analytics engineer

There’s a good chance you’re an analytics engineer who just sort of landed in an analytics engineering career. Or made a murky transition from data science/data engineering/software engineering to full-time analytics person. When did you realize you fell into the wild world of analytics engineering?

In this session, Michael Chow (RStudio) draws upon his experience building open source data science tools and working with the data science community to discuss the early signs of a budding analytics engineer, and the small steps these folks can take to keep the best parts of Python and R, all while moving towards engineering best practices.

Check the slides here: https://docs.google.com/presentation/d/1H2fVa-I4D8ibanlqLutIrwPOVypIlXVzEITDUNzzPpU/edit?usp=sharing

Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/.