talk-data.com talk-data.com

Event

The Analytics Engineering Podcast

2021-07-01 – 2025-11-23 Podcasts Visit website ↗

Activities tracked

77

Tristan Handy has been curating the Analytics Engineering Roundup newsletter since 2015, pulling together the internet's best data science & analytics articles.

Tristan and co-host Julia Schottenstein now bring the Roundup to real life, hosting biweekly conversations with data practitioners inventing the future of analytics engineering.

You can view full episode summaries and read back issues of the Roundup newsletter at https://roundup.getdbt.com.

The podcast is sponsored by dbt labs, makers of the data transformation framework dbt. To reach our team, drop a note to [email protected].

Filtering by: dbt ×

Sessions & talks

Showing 26–50 of 77 · Newest first

Search within this event →

The End of the Modern Data Stack (w/ Benn Stancil, Mode)

2024-02-25 Listen
podcast_episode
Benn Stancil (ThoughtSpot)

Benn Stancil, cofounder and CTO at Mode, returns to The Analytics Engineering Podcast to discuss the evolution of the term "modern data stack" and its value today. Tristan wrote on this idea for The Analytics Engineering Roundup in Is the Modern Data Stack Still a Useful Idea? For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com. The Analytics Engineering Podcast is sponsored by dbt Labs.

Data Mesh Architecture at Large Enterprises (w/ Moritz Heimpel and Ben Flusberg)

2023-12-08 Listen
podcast_episode
Ben Flusberg (Cox Automotive) , Moritz Heimpel (Siemens)

Moritz Heimpel from Siemens and Ben Flusberg from Cox Automotive have very similar jobs. They both act as stewards of the data strategies at large, complex companies. In this episode, we get into what it's like to collaborate with data at scale. Ben and Mortitz share their experiences adopting a data mesh architecture and what that looks like at their organizations. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com. The Analytics Engineering Podcast is sponsored by dbt Labs.

Let's Talk About Data Vault (w/ Brandon Taylor and Michael Olschimke)

2023-11-17 Listen
podcast_episode
Michael Olschimke (Scalefree) , Tristan , Julia , Brandon Taylor (Guild)

If Data Vault is a new term for you, it's a data modeling design pattern. We're joined by Brandon Taylor, a senior data architect at Guild, and Michael Olschimke, who is the CEO of Scalefree—the consulting firm whose co-founder Dan Lindstedt is credited as the designer of the data vault architecture.  In this conversation with Tristan and Julia, Michael and Brandon explore the Data Vault approach among data warehouse design methodologies. They discuss Data Vault's adoption in Europe, its alignment with data mesh architecture, and the ongoing debate over Data Vault vs. Kimball methods.  For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com. The Analytics Engineering Podcast is sponsored by dbt Labs.

Navigating AI Complexity (w/ Jonathan Frankle)

2023-11-03 Listen
podcast_episode
Tristan , Jonathan Frankle (MosaicML) , Julia

Jonathan Frankle is the Chief Scientist at MosaicML, which was recently bought by Databricks for $1.3 billion.  MosaicML helps customers train generative AI models on their data. Lots of companies are excited about gen AI, and the hope is that their company data and information will be what sets them apart from the competition.  In this conversation with Tristan and Julia, Jonathan discusses a potential future where you can train specialized, purpose-built models, the future of MosaicML inside of Databricks, and the importance of responsible AI practices. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com. The Analytics Engineering Podcast is sponsored by dbt Labs.

Career Growth in Data Roles (w/ Hubspot's Kasey Mazza at Coalesce 2023)

2023-10-20 Listen
podcast_episode
Tristan , Kasey Mazza (HubSpot)

In this conversation with Tristan recorded at Coalesce 2023, Kasey Mazza, an analytics engineering manager on the RevOps team at HubSpot, discusses the roles of data analysts and analytics engineers, the importance of building internal data communities, and the evolving landscape of data teams.  Watch Kasey's Coalescse 2023 presentation The career growth software development lifecycle. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com.  The Analytics Engineering Podcast is sponsored by dbt Labs.

Operationalizing Your Warehouse, Streaming Analytics, and Cereal (W/ Arjun Narayan of Materialize and Nathan Bean of General Mills)

2023-10-06 Listen
podcast_episode
Nathan Bean (General Mills) , Arjun Narayan

It turns out data plays a big role in getting cereal manufactured and delivered so you can enjoy your Cheerios reliably for breakfast. We talk with Arjun Narayan, CEO of Materialize, a company building an operational warehouse, and Nathan Bean, a data leader at General Mills responsible for all of the company's manufacturing analytics and insights.  We discuss Materialize's founding story, how streaming technology has matured, and how exactly companies are leveraging their warehouse to operationalize their business—in this case, at one of the largest consumer product companies in the United States.  For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com.  The Analytics Engineering Podcast is sponsored by dbt Labs.

Roche's Data Transformation Journey (w/ Yannick Misteli)

2023-09-22 Listen
podcast_episode
Yannick Misteli (Roche)

Yannick Misteli is the head of engineering for the go-to-market domain at Roche, a $250 billion multinational pharmaceutical and diagnostics company.  Roche was an early supporter of dbt Cloud, and Yannick helped move his team of 120+ engineers to a modern data stack. He always finds a way to push the boundaries to make a large company founded in 1896 incredibly modern and innovative. We wanted to know more about the "how" of the work—the people, process, and technology.  Read more about Roche's data journey here: https://docs.getdbt.com/blog/dbt-squared

Bring Your Own Data to LLMs (W/ Jerry Liu of LlamaIndex)

2023-08-25 Listen
podcast_episode
Jerry Liu (LlamaIndex)

Jerry Liu is the CEO and co-founder of LlamaIndex. LlamaIndex is an open-source framework that helps people prep their data for use with large language models in a process called retrieval augmented generation. LLMs are great decision engines, but in order for them to be useful for organizations, they need additional knowledge and context, and Jerry discusses how companies are bringing their data to tailor LLMs for their needs. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com.  The Analytics Engineering Podcast is sponsored by dbt Labs.

Ramp's $8 Billion Data Strategy (W/ Ian Macomber and Ryan Delgado)

2023-08-11 Listen
podcast_episode
Tristan , Ryan Delgado (Ramp) , Julia , Ian Macomber (Ramp)

Ian Macomber, head of analytics engineering and data science at Ramp and formerly the VP of analytics and data engineering at Drizly, and Ryan Delgado, a staff software engineer at Ramp, have played pivotal roles in establishing Ramp's data team from the ground up and are spearheading the development of their comprehensive roadmap. In this conversation with Tristan and Julia, Ian and Ryan share insights on how Ramp's data team transformed unstructured data from contracts into valuable insights to enable faster decision-making. The $8 billion company values speed and empowers teams to build, ship, and measure products quickly. Ian and Ryan also talked about their approach to adopting new tech and elevating data as an equal player alongside product engineering and design. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com.  The Analytics Engineering Podcast is sponsored by dbt Labs.

dbt Labs on dbt (w/ Daniel Le)

2023-07-28 Listen
podcast_episode
Julia , Daniel Le (dbt Labs)

Daniel Le is the CFO at dbt Labs where he has built multiple teams. He is also the former head of FP&A and operations at Zoom, and he helped scale FP&A as the former finance director at Okta.  In this conversation with Julia, Daniel shares his view as CFO on the challenges SaaS companies face and the importance of finance teams creating a holistic view of their business. Daniel gives advice to data leaders about how they can automate business processes with dbt Cloud and use self-service analytics to automate revenue recognition, generate consistent headcount analytics, and more to impact their organization. Read more about Daniel's story here. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com.  The Analytics Engineering Podcast is sponsored by dbt Labs.

The Arc of Data Innovation (w/ Bob Muglia, former CEO of Snowflake)

2023-07-12 Listen
podcast_episode
Tristan , Julia , Bob Muglia (Snowflake; Microsoft)

Bob Muglia likely needs no introduction. The former CEO of Snowflake led the company during its early, transformational years after a long career at Microsoft and Juniper.  Bob recently released the book The Datapreneurs about the arc of innovation in the data industry, starting with the first relational databases all the way to the present craze of LLMs and beyond. In this conversation with Tristan and Julia, Bob shares insights into the future of data engineering and its potential business impact while offering a glimpse into his professional journey.  For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com.  The Analytics Engineering Podcast is sponsored by dbt Labs.

It's 2023, and Privacy Is Now Fun! (w/ Ian Coe of Tonic.ai + Abhishek Bhowmick of Samooha)

2023-04-21 Listen
podcast_episode
Ian Coe (Tonic.ai) , Abhishek Bhowmick (Samooha) , Tristan , Julia

Advances in ML have transformed data privacy from a regulatory necessity into an opportunity to improve the work of data people. Synthetic data for modeling + testing is one example of a hard thing that's now easy - and in this conversation with Tristan and Julia, Ian + Abhishek cover many other ways that privacy can actually be a skill that propels your work forward, rather than a mere legal best practice. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com.  The Analytics Engineering Podcast is sponsored by dbt Labs.

Cloud Warehouse Cost Optimization (w/ Niall Woodward + Brad Culberson)

2023-03-24 Listen
podcast_episode
Tristan , Niall Woodward (SELECT) , Julia , Brad Culberson (Snowflake (Field CTO's office))

Brad Culberson is a Principal Architect in the Field CTO's office at Snowflake. Niall Woodward is a co-founder of SELECT, a startup providing optimization and spend management software for Snowflake customers. In this conversation with Tristan and Julia, Brad and Niall discuss all things cost optimization: cloud vs on-prem, measuring ROI, and tactical ways to get more out of your budget. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com.  The Analytics Engineering Podcast is sponsored by dbt Labs.

dbt Labs + Transform Join Forces on Metrics (w/ Nick Handel + Drew Banin)

2023-03-10 Listen
podcast_episode
Drew Banin (Fishtown Analytics / dbt Labs) , Tristan , Nick Handel

Nick Handel, as co-founder at Transform, helped develop the popular open source metrics framework MetricFlow. Drew Banin, a co-founder at dbt Labs, helped build the initial version of the dbt Semantic Layer, which launched last year.   Transform was acquired in February by dbt Labs, and in this conversation with Tristan, they talk through their collective plans for the future of the dbt Semantic Layer. For full show notes and to read 7+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com.  The Analytics Engineering Podcast is sponsored by dbt Labs.

What Can Generative AI Do for Data People? (W/ Sarah Nagy + Chris Aberger)

2023-02-24 Listen
podcast_episode
Tristan , Chris Aberger (Numbers Station AI) , Julia , Sarah Nagy (Seek AI)

Sarah and Chris are both at the forefront of bringing the promise of gen AI to our actual work as data people—which is a unique challenge!  Precise truth is critical for business questions in a way that it's not for a consumer search query. Sarah Nagy is the CEO of Seek AI, a startup that aims to use natural language processing to change how professionals work with data. Chris Aberger currently leads Numbers Station AI, a startup focused on data-intensive workflow automation. In this conversation with Tristan and Julia, they dive into what this future might actually look like, and tangibly what we can expect from gen AI in the short/medium term. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com.  The Analytics Engineering Podcast is sponsored by dbt Labs.

3rd Party Data, Demystified

2023-02-10 Listen
podcast_episode
Tristan , Auren Hoffman (SafeGraph) , Julia

Auren Hoffman currently serves as the CEO and Chief Historian at SafeGraph, a data-as-a-service company he founded, which provides primarily location data.  In this conversation with Tristan and Julia, Auren shares how truly few companies are making use of 3rd-party datasets today, how opening up more datasets to public research could help us solve big problems, and a fun fact about Abraham Lincoln's (!) work in the industry.  For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com.  The Analytics Engineering Podcast is sponsored by dbt Labs.

A Romp Through Database History (w/ Postgres co-creator Mike Stonebraker + Andy Palmer)

2023-01-27 Listen
podcast_episode

Mike Stonebraker is a veritable database pioneer and a Turing Award recipient. In addition to teaching at MIT, he is a serial entrepreneur and co-creator of Postgres. Andy Palmer is a veteran business leader who serves as the CEO of Tamr, a company he co-founded with Mike. Through his seed fund Koa Labs, Andy has helped found and/or fund numerous innovative companies in diverse sectors, including health care, technology, and the life sciences.  In this conversation with Tristan and Julia, Mike and Andy take us through the evolution of database technology over 5+ decades. They share unique insights into relational databases, the switch from row-based to columnar databases, and some of the patterns of database adoption they see repeated over time. For full show notes and to read 7+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com.  The Analytics Engineering Podcast is sponsored by dbt Labs.

What Does Apache Arrow Unlock for Analytics? (w/ Wes McKinney)

2023-01-06 Listen
podcast_episode

Wes McKinney is the creator of pandas, co-creator of Apache Arrow, and now Co-founder/CTO at Voltron Data. In this conversation with Tristan and Julia, Wes takes us on a tour of the underlying guts, from hardware to data formats, of the data ecosystem. What innovations, down to the hardware level, will stack to lead to significantly better performance for analytics workloads in the coming years? To dig deeper on the Apache Arrow ecosystem, check out replays from their recent conference at https://thedatathread.com. For full show notes and to read 7+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com.  The Analytics Engineering Podcast is sponsored by dbt Labs.

Minimum Viable Experimentation

2022-12-16 Listen
podcast_episode
Tristan , Julia , Vijaye Raji (Statsig) , Sean Taylor (Motif Analytics)

Product experimentation is full of potholes for companies of any size, given the number of pieces (tooling, culture, process, persistence) that need to come together to be successful. Vijaye Raji (currently Statsig, formerly Facebook + Microsoft) and Sean Taylor (currently Motif Analytics, formerly Facebook + Lyft) have navigated these failure modes, and are here to help you (hopefully) do the same. This convo with Tristan + Julia is light on tooling + heavy on process: how to watch out for spillover effects in experiments, avoiding bias, how to run an experiment review, and why experiment throughput is a better indicator of success than individual experiment results. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com.  The Analytics Engineering Podcast is sponsored by dbt Labs.

The Data Generalist's Vision Quest (LIVE w/ Stephen Bailey)

2022-12-02 Listen
podcast_episode
Stephen Bailey (Immuta) , Tristan

The first LIVE IRL episode!   Stephen Bailey, data engineer at Whatnot and writer of an incredibly entertaining data substack, joins Tristan for a follow-up conversation to Stephen's Coalesce talk, "Excel at nothing: how to be an effective generalist." You can read Stephen's writing at https://stkbailey.substack.com/. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com.  The Analytics Engineering Podcast is sponsored by dbt Labs.

Why You'll Need Data Contracts (w/ Chad Sanderson + Prukalpa)

2022-11-18 Listen
podcast_episode
Chad Sanderson (Gable.ai) , Prukalpa Sankar (Atlan)

WARNING: This episode contains detailed discussion of data contracts. The modern data stack introduces challenges in terms of collaboration between data producers and consumers. How might we solve them to ultimately build trust in data quality? Chad Sanderson leads the data platform team at Convoy, a late-stage series-E freight technology startup. He manages everything from instrumentation and data ingestion to ETL, in addition to the metrics layer, experimentation software and ML.  Prukalpa Sankar is a co-founder of Atlan, where she develops products that enable improved collaboration between diverse users like businesses, analysts, and engineers, creating higher efficiency and agility in data projects.  For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com.  The Analytics Engineering Podcast is sponsored by dbt Labs.

How Does Data Drive Growth in Practice? (w/ Abhi Sivasailam)

2022-11-04 Listen
podcast_episode
Tristan , Julia , Abhi Sivasailam (Flexport)

Abhi is a growth and data leader, and an excellent Twitter follow. Most recently, he was Head of Growth and Analytics at Flexport, where he helped the company to grow 10x over the past 3 years. Previously, Abhi led growth and data teams at Keap, Hustle, and Honeybook. In this conversation with Tristan and Julia, Abhi explains his methodology for setting up a new growth data organization, and how you might be falling victim to the dreaded "arbitrary uniqueness" bug. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com.  The Analytics Engineering Podcast is sponsored by dbt Labs

Katie Bauer: Data Scientists Are Not Pizza

2022-07-29 Listen
podcast_episode

Katie was a founding member of Reddit's data science team and, currently, as Twitter's Data Science Manager, she leads the company's infrastructure data science and analytics organization. In this conversation with Tristan and Julia, Katie explores how, as a manager, to help data people (especially those new to the field!) do their best work. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com.  The Analytics Engineering Podcast is sponsored by dbt Labs.

Data Activation Everywhere (w/ Julie Beynon of Clearbit)

2022-07-15 Listen
podcast_episode
Tristan , Julia , Julie Beynon (Clearbit)

As Head of Analytics at Clearbit, Julie serves as a data team of one in a 200+ person company (wow!). In this conversation with Tristan and Julia, Julie dives into how she's helped Clearbit implement data activation throughout the business, and realize the glorious dream of self-serve analytics. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com.  The Analytics Engineering Podcast is sponsored by dbt Labs.

Making Sense of the Last 2 Years in Data

2022-06-17 Listen
podcast_episode
Jennifer Li , Matt Bornstein (a16z) , Tristan Handy (dbt Labs) , Martin Casado (a16z)

Matt Bornstein and Jennifer Li (and their co-author Martin Casado) of a16z have compiled arguably the most nuanced diagram of the data ecosystem ever made.  They recently refreshed their classic 2020 post, "Emerging Architectures for Modern Data Infrastructure" and in this conversation, Tristan attempts to pin down: what does all of this innovation in tooling mean for data people + the work we're capable of doing? When will the glorious future come to our laptops? For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com.  The Analytics Engineering Podcast is sponsored by dbt Labs.