talk-data.com talk-data.com

T

Speaker

Tathagata Das

2

talks

Sr. Staff Software Engineer Databricks

Tathagata Das is a Staff Software Engineer at Databricks and has been one of the core developers of Apache Spark (especially Structured Streaming) and Delta Lake. He is a member of Apache Spark PMC, and a Delta Lake committer. He is also one of the authors of Learning Spark: Lighting-fast Data Analytics (2nd edition). Previously, he was a grad student in the UC Berkeley at AMPLab where he conducted research about data-center processing frameworks and networks with Scott Shenker and Ion Stoica.

Bio from: Databricks DATA + AI Summit 2023

Frequent Collaborators

Filtering by: Data + AI Summit 2025 ×

Filter by Event / Source

Talks & appearances

Showing 2 of 5 activities

Search activities →
Extending the Lakehouse: Power Interoperable Compute With Unity Catalog Open APIs

The lakehouse is built for storage flexibility, but what about compute? In this session, we’ll explore how Unity Catalog enables you to connect and govern multiple compute engines across your data ecosystem. With open APIs and support for the Iceberg REST Catalog, UC lets you extend access to engines like Trino, DuckDB, and Flink while maintaining centralized security, lineage, and interoperability. We will show how you can get started today working with engines like Apache Spark and Starburst to read and write to UC managed tables with some exciting demos. Learn how to bring flexibility to your compute layer—without compromising control.