Join us for an in-depth Ask Me Anything (AMA) on how Rust is revolutionizing Lakehouse formats like Delta Lake and Apache Iceberg through projects like delta-rs and iceberg-rs! Discover how Rust’s memory safety, zero-cost abstractions and fearless concurrency unlock faster development and higher-performance data operations. Whether you’re a data engineer, Rustacean or Lakehouse enthusiast, bring your questions on how Rust is shaping the future of open table formats!
talk-data.com
Speaker
Tyler Croy
2
talks
At Scribd Tyler created the delta-rs project and has presented a number of times on Scribd's large scale data and ML workloads combining Rust, Delta Lake, and the Databricks platform. He has written extensively on using Delta Lake from Python and Rust, joining the talented authors of the award-winning #1 best selling book "Delta Lake: The Definitive Guide", by contributing Chapter 6. Tyler is also a Databricks MVP and a generally tall guy.
Bio from: Data + AI Summit 2025
Filter by Event / Source
Talks & appearances
2 activities · Newest first
Delta Lake is a fantastic technology for quickly querying massive data sets, but first you need those massive data sets! In this session we will dive into the cloud-native architecture Scribd has adopted to ingest data from AWS Aurora, SQS, Kinesis Data Firehose and more. By using off-the-shelf open source tools like kafka-delta-ingest, oxbow and Airbyte, Scribd has redefined its ingestion architecture to be more event-driven, reliable, and most importantly: cheaper. No jobs needed! Attendees will learn how to use third-party tools in concert with a Databricks and Unity Catalog environment to provide a highly efficient and available data platform. This architecture will be presented in the context of AWS but can be adapted for Azure, Google Cloud Platform or even on-premise environments.