talk-data.com talk-data.com

Google Cloud Next session 2025-04-09 at 20:15

Overcoming data privacy obstacles: Data de-identification in data lakes and lakehouses

Description

Datalakes and lakehouses are becoming more popular for storing sensitive data. This session will explore the challenges of implementing effective de-identification and potential Google Cloud based solutions.

Data de-identification is complex due to the diversity, scale, and variety of formats. Ensuring compliance and maintaining data utility are key challenges. Careful evaluation of techniques like tokenization and masking is essential, along with robust monitoring and auditing.