Data is the backbone of modern decision-making, but centralizing it is only the tip of the iceberg. Entitlements, secure sharing and just-in-time availability are critical challenges to any large-scale platform. Join Goldman Sachs as we reveal how our Legend Lakehouse, coupled with Databricks, overcomes these hurdles to deliver high-quality, governed data at scale. By leveraging an open table format (Apache Iceberg) and open catalog format (Unity Catalog), we ensure platform interoperability and vendor neutrality. Databricks Unity Catalog then provides a robust entitlement system that aligns with our data contracts, ensuring consistent access control across producer and consumer workspaces. Finally, Legend functions, integrating with Databricks User Defined Functions (UDF), offer real-time data enrichment and secure transformations without exposing raw datasets. Discover how these components unite to streamline analytics, bolster governance and power innovation.
talk-data.com
G
Speaker
George Wu
1
talks
Vice President
Goldman Sachs
George Wu oversees the orchestration and data availability of the firm's cloud-based Legend Lakehouse platform. He led the integration of Databricks into the platform, enhancing data analytics and processing. As the firm’s primary Databricks liaison, he manages vendor strategy and partnerships. George also leads a team maintaining the firm’s on-premise Hadoop-based Data Lake, ensuring scalable, reliable, and efficient data infrastructure.
Bio from: Data + AI Summit 2025
Filter by Event / Source
Talks & appearances
1 activities · Newest first