Speaker

Tyler Croy

Activities

2

talks

Valued Employee Scribd, Inc.

At Scribd Tyler created the delta-rs project and has presented a number of times on Scribd's large scale data and ML workloads combining Rust, Delta Lake, and the Databricks platform. He has written extensively on using Delta Lake from Python and Rust, joining the talented authors of the award-winning #1 best selling book "Delta Lake: The Definitive Guide", by contributing Chapter 6. Tyler is also a Databricks MVP and a generally tall guy.

Bio from: Data + AI Summit 2025

Filter by Event / Source

Data + AI Summit 2025 2

Talks & appearances

2 activities · Newest first

Search activities →

Rust and Lakehouse Format — Ask Us Anything

2025-06-12 · Data + AI Summit 2025

lightning_talk

with Robert Pack (Databricks) , Denny Lee (Databricks) , Tyler Croy (Scribd, Inc.)

Data Lakehouse Delta Iceberg Rust

Join us for an in-depth Ask Me Anything (AMA) on how Rust is revolutionizing Lakehouse formats like Delta Lake and Apache Iceberg through projects like delta-rs and iceberg-rs! Discover how Rust’s memory safety, zero-cost abstractions and fearless concurrency unlock faster development and higher-performance data operations. Whether you’re a data engineer, Rustacean or Lakehouse enthusiast, bring your questions on how Rust is shaping the future of open table formats!

Let's Save Tons of Money With Cloud-Native Data Ingestion!

2025-06-10 · Data + AI Summit 2025 Watch

talk

Airbyte AWS Aurora Kinesis Azure Cloud Computing

Delta Lake is a fantastic technology for quickly querying massive data sets, but first you need those massive data sets! In this session we will dive into the cloud-native architecture Scribd has adopted to ingest data from AWS Aurora, SQS, Kinesis Data Firehose and more. By using off-the-shelf open source tools like kafka-delta-ingest, oxbow and Airbyte, Scribd has redefined its ingestion architecture to be more event-driven, reliable, and most importantly: cheaper. No jobs needed! Attendees will learn how to use third-party tools in concert with a Databricks and Unity Catalog environment to provide a highly efficient and available data platform. This architecture will be presented in the context of AWS but can be adapted for Azure, Google Cloud Platform or even on-premise environments.