talk-data.com talk-data.com

Event

The Analytics Engineering Podcast

2021-07-01 – 2025-11-23 Podcasts Visit website ↗

Activities tracked

49

Tristan Handy has been curating the Analytics Engineering Roundup newsletter since 2015, pulling together the internet's best data science & analytics articles.

Tristan and co-host Julia Schottenstein now bring the Roundup to real life, hosting biweekly conversations with data practitioners inventing the future of analytics engineering.

You can view full episode summaries and read back issues of the Roundup newsletter at https://roundup.getdbt.com.

The podcast is sponsored by dbt labs, makers of the data transformation framework dbt. To reach our team, drop a note to [email protected].

Filtering by: Tristan ×

Sessions & talks

Showing 26–49 of 49 · Newest first

Search within this event →

What Does Apache Arrow Unlock for Analytics? (w/ Wes McKinney)

2023-01-06 Listen
podcast_episode

Wes McKinney is the creator of pandas, co-creator of Apache Arrow, and now Co-founder/CTO at Voltron Data. In this conversation with Tristan and Julia, Wes takes us on a tour of the underlying guts, from hardware to data formats, of the data ecosystem. What innovations, down to the hardware level, will stack to lead to significantly better performance for analytics workloads in the coming years? To dig deeper on the Apache Arrow ecosystem, check out replays from their recent conference at https://thedatathread.com. For full show notes and to read 7+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com.  The Analytics Engineering Podcast is sponsored by dbt Labs.

Minimum Viable Experimentation

2022-12-16 Listen
podcast_episode
Tristan , Julia , Vijaye Raji (Statsig) , Sean Taylor (Motif Analytics)

Product experimentation is full of potholes for companies of any size, given the number of pieces (tooling, culture, process, persistence) that need to come together to be successful. Vijaye Raji (currently Statsig, formerly Facebook + Microsoft) and Sean Taylor (currently Motif Analytics, formerly Facebook + Lyft) have navigated these failure modes, and are here to help you (hopefully) do the same. This convo with Tristan + Julia is light on tooling + heavy on process: how to watch out for spillover effects in experiments, avoiding bias, how to run an experiment review, and why experiment throughput is a better indicator of success than individual experiment results. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com.  The Analytics Engineering Podcast is sponsored by dbt Labs.

The Data Generalist's Vision Quest (LIVE w/ Stephen Bailey)

2022-12-02 Listen
podcast_episode
Stephen Bailey (Immuta) , Tristan

The first LIVE IRL episode!   Stephen Bailey, data engineer at Whatnot and writer of an incredibly entertaining data substack, joins Tristan for a follow-up conversation to Stephen's Coalesce talk, "Excel at nothing: how to be an effective generalist." You can read Stephen's writing at https://stkbailey.substack.com/. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com.  The Analytics Engineering Podcast is sponsored by dbt Labs.

How Does Data Drive Growth in Practice? (w/ Abhi Sivasailam)

2022-11-04 Listen
podcast_episode
Tristan , Julia , Abhi Sivasailam (Flexport)

Abhi is a growth and data leader, and an excellent Twitter follow. Most recently, he was Head of Growth and Analytics at Flexport, where he helped the company to grow 10x over the past 3 years. Previously, Abhi led growth and data teams at Keap, Hustle, and Honeybook. In this conversation with Tristan and Julia, Abhi explains his methodology for setting up a new growth data organization, and how you might be falling victim to the dreaded "arbitrary uniqueness" bug. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com.  The Analytics Engineering Podcast is sponsored by dbt Labs

Katie Bauer: Data Scientists Are Not Pizza

2022-07-29 Listen
podcast_episode

Katie was a founding member of Reddit's data science team and, currently, as Twitter's Data Science Manager, she leads the company's infrastructure data science and analytics organization. In this conversation with Tristan and Julia, Katie explores how, as a manager, to help data people (especially those new to the field!) do their best work. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com.  The Analytics Engineering Podcast is sponsored by dbt Labs.

Data Activation Everywhere (w/ Julie Beynon of Clearbit)

2022-07-15 Listen
podcast_episode
Tristan , Julia , Julie Beynon (Clearbit)

As Head of Analytics at Clearbit, Julie serves as a data team of one in a 200+ person company (wow!). In this conversation with Tristan and Julia, Julie dives into how she's helped Clearbit implement data activation throughout the business, and realize the glorious dream of self-serve analytics. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com.  The Analytics Engineering Podcast is sponsored by dbt Labs.

The Personal Data Warehouse (w/ Jordan Tigani of MotherDuck)

2022-07-01 Listen
podcast_episode
Tristan , Julia , Jordan Tigani (Motherduck)

Jordan Tigani is an expert in large-scale data processing, having spent a decade+ in the development and growth of BigQuery, and later SingleStore. Today, Jordan and his team at MotherDuck are in the early days of working on commercial applications for the open source DuckDB OLAP database. In this conversation with Tristan and Julia, Jordan dives into the origin story of BigQuery, why he thinks we should do away with the concept of working in files, and how truly performant "data apps" will require bringing data to an end user's machine (rather than requiring them to query a warehouse directly).

Building an Open Source Company (w/ Aaron Katz of ClickHouse)

2022-06-03 Listen
podcast_episode
Tristan , Julia , Aaron Katz (ClickHouse)

ClickHouse, the lightning-fast open source OLAP database, was initially released in 2016 as an open source project out of Yandex, the Russian search giant. In 2021, Aaron Katz helped form a group to spin it out of Yandex as an independent company, dedicated to the development + commercialization of the open source project. In this conversation with Tristan and Julia, Aaron gets into why he believes open source, independent software companies are the future. And of course, this conversation wouldn't be complete without a riff on the classic "one database to rule all workloads" thread. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com.  The Analytics Engineering Podcast is sponsored by dbt Labs.

"To Move, or Not to Move" (Data). That is the Question.

2022-05-20 Listen
podcast_episode
Tristan , Justin Borgman (Starburst Data) , Julia

Justin Borgman is the co-founder, Chairman and CEO of Starburst, and has almost a decade spent in senior executive roles building new businesses in the data warehousing and analytics space.  In this conversation with Tristan and Julia, Justin dives into the nuts and bolts of Trino, the open source distributed query engine, and explores how teams are adopting a data mesh architecture without making a mess.  For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com.  The Analytics Engineering Podcast is sponsored by dbt Labs.

What's The Role Of AI in BI?

2022-05-06 Listen
podcast_episode
Tristan , Julia , Amit Prakash (ThoughtSpot)

Amit Prakash is Co-founder and CTO at ThoughtSpot. He has a deep background in search, having previously led the AdSense engineering team at Google and served on the early Bing team at Microsoft. In this conversation with Tristan and Julia, Amit gets real about the promise of AI in data: which applications are being widely used today, and which are still a few years out? For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com.  The Analytics Engineering Podcast is sponsored by dbt Labs.

Automating Away Your Work w/ Configuration-as-Code (w/ Sarah Krasnik)

2022-04-22 Listen
podcast_episode
Tristan , Julia , Sarah Krasnik (Perpay)

Most recently leading a data engineering team at Perpay, Sarah has built and managed data platforms end to end by working closely with internal engineering, product, and operational teams. She recently left her role to pursue a wide variety of endeavors, including writing on her Substack (https://sarahsnewsletter.substack.com/). In this conversation with Tristan and Julia, Sarah dives into how configuration-as-code can automate away data work, why you might want to consider adding a data lake to your architecture, and how those looking to build a self-serve data culture can look to self-serve frozen yogurt shops for inspiration. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com. The Analytics Engineering Podcast is sponsored by dbt Labs.

The Hard Problems™️ of Data Observability w/ Kevin Hu of Metaplane

2022-04-08 Listen
podcast_episode
Tristan , Julia , Kevin Hu (Metaplane)

As a PhD candidate at MIT, Kevin (and friends) published Sherlock, a data type detection engine (a surprisingly bedeviling problem) for data cleaning + data discovery. Now as co-founder and CEO of Metaplane, a data observability startup, Kevin applies these same automated data discovery methods to help data teams keep their data healthy. In this conversation with Tristan & Julia, Kevin wins the coveted award for "most crystal-clear explanations of complex technical concepts through physics analogy."   For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com.

Ashley Sherwood (AE @ Hubspot): Permissionless Innovation for Data Teams

2022-02-25 Listen
podcast_episode
Ashley Sherwood (HubSpot) , Tristan , Julia

Ashley is a Principal Analytics Engineer at Hubspot, and has helped lead their implementation of dbt. Ashley makes unique connections in her writing and work. On her Substack, "syntax error at or near ❤️," Ashley might be found comparing growing companies to butterflies, or going deep on how to accommodate sensitive people in the workplace. In this conversation with Tristan & Julia, Ashley dives into the nuts and bolts of her trajectory pushing data innovation forward at Hubspot. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com.  The Analytics Engineering Podcast is sponsored by dbt Labs.

DeVaris Brown: Bringing Streaming Data to Analysts

2021-12-02 Listen
podcast_episode
Tristan , Julia , DeVaris Brown (Meroxa)

As a product leader at companies like Heroku and Zendesk, DeVaris specialized in building infrastructure-grade products. Currently, as the CEO of Meroxa, he enables teams to build real-time data infrastructure with the same ease as we now take for granted in batch. In this romp of an episode, Tristan, Julia and DeVaris flow from his experience in tech mentorship, into the nuts and bolts of Change Data Capture (CDC), and how streaming data infrastructure can help data teams provide better end user experiences. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com.  The Analytics Engineering Podcast is sponsored by dbt Labs.

David Jayatillake: Should Great Data People Become Managers or Not?

2021-11-18 Listen
podcast_episode

David is Sr. Director of Data at Lyst, and as leader of their analytics + data science teams he has followed the evolution of data roles closely over the past decade. David spends a lot of time thinking about career progression + data team structure, and in this conversation with Tristan + Julia they dive into the classic individual contributor vs manager conundrum, migrating between warehouses, and reactive vs proactive data workflows. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com.  The Analytics Engineering Podcast is sponsored by dbt Labs.

Julien Le Dem: Why Data Lineage Matters

2021-11-04 Listen
podcast_episode

Julien has a unique history of building open frameworks that make data platforms interoperable. He's contributed in various ways to Apache Arrow, Apache Iceberg, Apache Parquet, and Marquez, and is currently leading OpenLineage, an open framework for data lineage collection and analysis. In this episode, Tristan & Julia dive into how open source projects grow to become standards, and why data lineage in particular is in need of an open standard. They also cover into some of the compelling use cases for this data lineage metadata, and where you might be able to deploy it in your work. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com.  The Analytics Engineering Podcast is sponsored by dbt Labs.

Seth Rosen: On Becoming a Full-stack Data Analyst

2021-10-07 Listen
podcast_episode
Tristan , Julia , Seth Rosen (HashPath)

Seth Rosen has broken data Twitter many times, and in his early-fatherhood sleep deprivation developed a wonderful Twitter persona as the battle-tested data analyst. IRL though Seth is a serious data practitioner, and as Founder at the data consultancy HashPath has helped dozens of companies get into the modern data stack + build public-facing data apps.  Now, as the founder of TopCoat, he's empowering analysts to build + publish those same public-facing data apps. In this episode, Tristan, Julia & Seth graciously dive into spicy debates around data mesh + "dashboard factories", and explore a future where data analysts become full-stack application developers. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com.  The Analytics Engineering Podcast is sponsored by dbt Labs.

Brittany Bennett: Training the Next Generation of 'Data for Good' Practitioners @ Sunrise Movement

2021-09-23 Listen
podcast_episode
Tristan , Julia , Brittany Bennett (Sunrise Movement)

Brittany Bennett is Data Director at Sunrise Movement, the youth climate movement that numbers tens of thousands of members throughout every US state.  Given how quickly our industry moves, developing junior data talent is hard, but Brittany's team at Sunrise makes it look easy. And that's no accident—because Sunrise hires for mission alignment rather than technical background, they dedicate significant resources to training + mentorship. In this conversation, Tristan, Julia & Brittany dive deep into the opportunity of developing junior data practitioners. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com. The Analytics Engineering Podcast is sponsored by dbt Labs.

Caitlin Colgrove (CTO @ Hex): Notebooks for the Rest of Us

2021-09-09 Listen
podcast_episode

Caitlin Colgrove is Co-founder & CTO at Hex, a data workspace that allows teams to collaborate in both SQL and Python to publish interactive data apps. In this conversation, Tristan, Julia and Caitlin dive into the possibilities that real-time collaborative notebooks unlock for data teams — what if our collaboration style looked more like Google Docs than a Git workflow? For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com. The Analytics Engineering Podcast is sponsored by dbt Labs.

Erik Bernhardsson: The missing tool in the data team's toolbox

2021-08-26 Listen
podcast_episode
Tristan , Julia , Erik Bernhardsson (Spotify; Better.com (former CTO))

Erik Bernhardsson spent six years at Spotify, where he contributed to the first version of the music recommendation system. After a stint as CTO at Better.com, he's now working on building new infrastructure tooling for data teams. In this wide-ranging conversation with Tristan & Julia, Erik dives into the nuts and bolts of Spotify's recommendation algorithm, (paradoxically) why you should rarely need to use ML, and the fundamental infrastructure challenges that drag down the productivity of data teams. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com. The Analytics Engineering Podcast is sponsored by dbt Labs.

Co-Host Julia on the Hot Seat

2021-08-12 Listen
podcast_episode

In this episode, we're going to do something a little different, and turn the spotlight on co-host Julia Schottenstein. In this conversation with Tristan, you'll get to know Julia a bit—from her early childhood ambitions of becoming a "computer tycoon" (adorable!), to working in venture at NEA and now as a Product Manager at dbt Labs. They also dive into Julia's opinions on key trends shaping the future of the data industry (the phrase oligopoly makes an appearance). For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com. The Analytics Engineering Podcast is sponsored by dbt Labs.

Brian Amadio: The Practice of Experimentation @ Stitch Fix

2021-07-29 Listen
podcast_episode
Tristan , Julia , Brian Amadio (Stitch Fix)

Brian Amadio is a Data Platform Engineer at Stitch Fix, where experimentation underpins everything they do across merchandising, planning, forecasting, operations and more.  In this conversation with Tristan, Julia, and Brian you'll get into the weeds of executing multi-armed bandit experiments and learn how you can perform experiments even with limited data.  For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com. The Analytics Engineering Podcast is sponsored by dbt Labs.

Venkat Venkataramani: The Future is Real-time

2021-07-15 Listen
podcast_episode

Step with Venkat into a world where data is always fresh, queries run in 1ms, and analytics engineers build web-scale, real-time data apps. As Engineering Director at Facebook, Venkat helped build the RocksDB real-time database that powered growth to 5 billion queries per second(!)—and now with his colleagues at Rockset, he's bringing that real-time database infrastructure to the rest of us. In this conversation, Tristan, Julia and Venkat explore the fundamental technological advances that are empowering analytics engineers to enter the real-time future. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com. The Analytics Engineering Podcast is sponsored by dbt Labs.

Robert Chang: Building the Minerva Metrics Store @ Airbnb

2021-07-01 Listen
podcast_episode
Tristan , Julia , Robert Chang (Airbnb)

Robert Chang is a product manager for the data platform at Airbnb, where he helped build and roll out Minerva, Airbnb's internal metrics store. They use Minerva to track over 12,000(!) metrics and 4,000(!) dimensions with consistency across the organization. In this conversation with Tristan and Julia, Robert dives into why they built it, what it took to get it done—and crucially, what you should do if your company doesn't have the resources to build your own internal metrics store. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com. The Analytics Engineering Podcast is sponsored by dbt Labs.