talk-data.com

Topic

Iceberg

Apache Iceberg

table_format data_lake schema_evolution file_format storage open_table_format

Activities

tagged

Activity Trend

39 peak/qtr

2020-Q1 2026-Q2

Top Events

Data Engineering Podcast 65 Data + AI Summit 2025 23 Big Data LDN 2025 13 dbt Coalesce 2025 9 O'Reilly Data Engineering Books 9 Databricks DATA + AI Summit 2023 6 Big Data & AI Paris 2025 5 AWS re:Invent 2024 5 Snowflake World Tour Berlin 5 Google Cloud Next '25 4 The Analytics Engineering Podcast 4 Big Data LDN 2024 4

Top Speakers

Tobias Macey 65 Yingjun Wu (RisingWave Labs) 5 Tom Scott (Streambased) 5 Tristan Handy (dbt Labs) 4 Ryan Blue (Tabular) 4 Adi Polak (Treeverse) 3 Dipti Borkar (Microsoft) 3 alex merced (Dremio) 3 Holly Smith (Databricks) 3 Julien Le Dem (Astronomer) 3 Jean-Baptiste Onofre (Apache Software Foundation) 2 Melvyn Peignon (ClickHouse) 2

Activities

Showing filtered results

All Video Podcast Book

Filtering by: Tristan Handy ×

Under the hood of Apache Iceberg (w/ Christian Thiel)

2025-08-24 · The Analytics Engineering Podcast Listen

podcast_episode

by Christian Thiel (Lakekeeper) , Tristan Handy (dbt Labs)

Analytics Analytics Engineering dbt

Tristan digs deep into the world of Apache Iceberg. There's a lot happening beneath the surface: multiple catalog interfaces, evolving REST specs, and competing implementations across open source, proprietary, and academic contexts. Christian Thiel, co-founder of Lakekeeper, one of the most widely used Iceberg catalogs, joins to walk through the state of the Iceberg ecosystem. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com. The Analytics Engineering Podcast is sponsored by dbt Labs.

How Amazon S3 works (w/ Andy Warfield)

2025-07-20 · The Analytics Engineering Podcast Listen

podcast_episode

by Tristan Handy (dbt Labs) , Andy Warfield (Amazon)

Analytics Analytics Engineering AWS Cloud Computing dbt S3

In this season of the Analytics Engineering podcast, Tristan is deep into the world of developer tools and databases. If you're following us here, you've almost definitely used Amazon S3 it and its Blob Storage siblings. They form the foundation for nearly all data work in the cloud. In many ways, it was the innovations that happened inside of S3 that have unlocked all of the progress in cloud data over the last decade. In this episode, Tristan talks with Andy Warfield, VP and senior principal engineer at AWS, where he focuses primarily on storage. They go deep on S3, how it works, and what it unlocks. They close out italking about Iceberg, S3 table buckets, and what this all suggests about the outlines of the S3 product roadmap moving forward. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com. The Analytics Engineering Podcast is sponsored by dbt Labs.

Data engineering at Snowflake (w/ Rahul Jain)

2025-01-12 · The Analytics Engineering Podcast Listen

podcast_episode

by Rahul Jain (Mentoring Club) , Tristan Handy (dbt Labs)

Analytics Analytics Engineering Data Engineering dbt Snowflake Data Streaming

A look inside at the data work happening at a company making some of the most advanced technologies in the industry. Rahul Jain, data engineering manager at Snowflake, joins Tristan to discuss Iceberg, streaming, and all things Snowflake. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com. The Analytics Engineering Podcast is sponsored by dbt Labs.

Julien Le Dem: Why Data Lineage Matters

2021-11-04 · The Analytics Engineering Podcast Listen

podcast_episode

by Tristan Handy (dbt Labs) , Julia Schottenstein (dbt labs) , Julien Le Dem (Astronomer)

Analytics Analytics Engineering Arrow dbt Parquet

Julien has a unique history of building open frameworks that make data platforms interoperable. He's contributed in various ways to Apache Arrow, Apache Iceberg, Apache Parquet, and Marquez, and is currently leading OpenLineage, an open framework for data lineage collection and analysis. In this episode, Tristan & Julia dive into how open source projects grow to become standards, and why data lineage in particular is in need of an open standard. They also cover into some of the compelling use cases for this data lineage metadata, and where you might be able to deploy it in your work. For full show notes and to read 6+ years of back issues of the podcast's companion newsletter, head to https://roundup.getdbt.com. The Analytics Engineering Podcast is sponsored by dbt Labs.