talk-data.com talk-data.com

David Tempelmann

Speaker

David Tempelmann

1

talks

Staff Data Scientist Databricks

David is a Senior Resident Solution Architect and Data Scientist at Databricks. He works with customers to architect and deploy production-ready data science solutions. David has extensive experience implementing data and AI projects across various industries including retail, finance, and audit, both at Databricks and in previous roles.

Bio from: Databricks DATA + AI Summit 2023

Filtering by: Databricks DATA + AI Summit 2023 ×

Filter by Event / Source

Talks & appearances

Showing 1 of 2 activities

Search activities →
Map Your Lakehouse Content with DiscoverX

An enterprise lakehouse contains many different datasets which are related to different sources and might belong to different business units. These datasets can span across hundreds of tables, and each table has a different schema, and those schemas evolve over time. The cyber security domain is a good example where datasets come from many different source systems and land in the lakehouse. With such a complex dataset ecosystem, answers to simple questions like “Have we ever detected this IP address?” or “Which columns contain IP addresses?” can become impractical and expensive.

DiscoverX can automate the discovery of all columns that might contain specific patterns, (e.g., IP addresses, MAC addresses, fully qualified domain names, etc.) and automatically generate search and indexing queries that span across multiple tables and columns.

Talk by: Erni Durdevic and David Tempelmann

Connect with us: Website: https://databricks.com Twitter: https://twitter.com/databricks LinkedIn: https://www.linkedin.com/company/databricks Instagram: https://www.instagram.com/databricksinc Facebook: https://www.facebook.com/databricksinc