talk-data.com talk-data.com

S

Speaker

Sandip Agarwala

1

talks

Staff Software Engineer Databricks

Sandip is a Staff Software Engineer at Databricks, where he specializes in designing and building scalable, efficient data ingestion technologies. Prior to joining Databricks, he was the founding engineer and technical lead at Springpath—a scale-out storage startup acquired by Cisco—where he architected and developed a petabyte-scale distributed file system from the ground up for private cloud environments.

Bio from: Data + AI Summit 2025

Filter by Event / Source

Talks & appearances

1 activities · Newest first

Search activities →
Lakeflow Connect: Smarter, Simpler File Ingestion With the Next Generation of Auto Loader

Auto Loader is the definitive tool for ingesting data from cloud storage into your lakehouse. In this session, we’ll unveil new features and best practices that simplify every aspect of cloud storage ingestion. We’ll demo out-of-the-box observability for pipeline health and data quality, walk through improvements for schema management, introduce a series of new data formats and unveil recent strides in Auto Loader performance. Along the way, we’ll provide examples and best practices for optimizing cost and performance. Finally, we’ll introduce a preview of what’s coming next — including a REST API for pushing files directly to Delta, a UI for creating cloud storage pipelines and more. Join us to help shape the future of file ingestion on Databricks.