talk-data.com talk-data.com

G

Speaker

George Wu

1

talks

Vice President Goldman Sachs

George Wu oversees the orchestration and data availability of the firm's cloud-based Legend Lakehouse platform. He led the integration of Databricks into the platform, enhancing data analytics and processing. As the firm’s primary Databricks liaison, he manages vendor strategy and partnerships. George also leads a team maintaining the firm’s on-premise Hadoop-based Data Lake, ensuring scalable, reliable, and efficient data infrastructure.

Bio from: Data + AI Summit 2025

Filter by Event / Source

Talks & appearances

1 activities · Newest first

Search activities →

Data is the backbone of modern decision-making, but centralizing it is only the tip of the iceberg. Entitlements, secure sharing and just-in-time availability are critical challenges to any large-scale platform. Join Goldman Sachs as we reveal how our Legend Lakehouse, coupled with Databricks, overcomes these hurdles to deliver high-quality, governed data at scale. By leveraging an open table format (Apache Iceberg) and open catalog format (Unity Catalog), we ensure platform interoperability and vendor neutrality. Databricks Unity Catalog then provides a robust entitlement system that aligns with our data contracts, ensuring consistent access control across producer and consumer workspaces. Finally, Legend functions, integrating with Databricks User Defined Functions (UDF), offer real-time data enrichment and secure transformations without exposing raw datasets. Discover how these components unite to streamline analytics, bolster governance and power innovation.