talk-data.com talk-data.com

Filter by Source

Select conferences and events

People (97 results)

See all 97 →

Companies (1 result)

Collibra 7 speakers
Executive Vice President, Global Solutions and Services Field CTO Founder and Chief Data Citizen

Activities & events

Title & Speakers Event
Richard Vasseur – Head of IA & Data Governance, FDJ United , Franck Chrétien – Area Vice President Sales Southern Europe, Collibra

La transformation digitale exige une agilité et une efficacité sans précédent dans la gestion des données. Confronté aux limites de son architecture existante, FDJ United a adopté une approche novatrice pour relever ce défi : celle du Data Product.

Dans ce retour d’expérience live, Collibra et FDJ United vous montreront comment la mise en place d’un Data Supermarket a permis de créer un environnement de gouvernance des données efficace. Nous aborderons les bénéfices concrets de cette démarche, du concept de Data Product aux perspectives d'avenir, pour vous inspirer et faire progresser vos propres projets d’innovation.

Collibra
Big Data & AI Paris 2025
Event Big Data LDN 2025 2025-09-25
Reece Griffiths – Field CTO @ Collibra

Abstract:AI promises transformation, but most enterprises are still held back by a blind spot: unstructured data. The documents, conversations, and content that hold the richest business insights remain locked away—unusable, ungoverned, and invisible to AI. This untapped treasure delays projects, derails accuracy, and creates governance risks.

In this exclusive launch session, Teresa Rojas, Senior Director of Product Management at Collibra, and Reece Griffiths, co-founder of Deasy Labs (now part of Collibra), will unveil Collibra Unstructured AI. They’ll show how breakthrough technology, combined with Collibra’s unified governance platform, enables organizations to finally move beyond the black box and unlock the hidden wealth in their data to power trustworthy, scalable AI.

AI/ML Collibra
James Finlason – Partner & Data Governance Lead @ John Lewis Partnership , Dylan Saxby – Metadata Product Manager @ John Lewis Partnership , Stijn Christiaens – Founder and Chief Data Citizen @ Collibra

The John Lewis Partnership is building the foundation for AI success by creating a centralized, self-service data hub powered by Collibra. Through a collaborative governance framework, John Lewis Partnership is delivering trusted data products at scale, enabling faster, more confident decisions and strengthening oversight of AI initiatives.

In this session, you’ll learn:

• How JLP is overcoming fragmented, unreliable data with a single source of truth

• What drove adoption and business alignment for Collibra

• How trusted data products are accelerating AI readiness and governance

AI/ML Collibra
Behnaz Derakhshani – Specialist Data Engineer @ Diconium

Expect a hands-on journey showing how modern data lake tools and governance platforms connect the dots - making your data discoverable, governed, and productized for real-world use.

AWS Collibra data lake tools governance platforms
Yingjun Wu – Speaker @ RisingWave Labs

Stream processing systems have traditionally relied on local storage engines such as RocksDB to achieve low latency. While effective in single-node setups, this model doesn't scale well in the cloud, where elasticity and separation of compute and storage are essential. In this talk, we'll explore how RisingWave rethinks the architecture by building directly on top of S3 while still delivering sub-100 ms latency. At the core is Hummock, a log-structured state engine designed for object storage. Hummock organizes state into a three-tier hierarchy: in-memory cache for the hottest keys, disk cache managed by Foyer for warm data, and S3 as the persistent cold tier. This approach ensures queries never directly hit S3, avoiding its variable performance. We'll also examine how remote compaction offloads heavy maintenance tasks from query nodes, eliminating interference between user queries and background operations. Combined with fine-grained caching policies and eviction strategies, this architecture enables both consistent query performance and cloud-native elasticity. Attendees will walk away with a deeper understanding of how to design streaming systems that balance durability, scalability, and low latency in an S3-based environment.

S3 Data Streaming hummock object storage log-structured state engine
Erik Schmiegelow – CEO @ Hivemind Technologies

Successful gen AI projects strike the balance between impact, accuracy and cost - in this talk, we cover how to create agentic data applications effectively, choosing when and how to integrate them in data streams and keep response quality issues and costs in check.

genai data streaming ai in data pipelines

Dear data-loving community, we can't wait to present to you our new Meetup event: This time, it will be a collaboration with RisingWave, a platform for real-time streaming data management and analysis. Yingjun Wu, Founder and CEO at RisingWave Labs, will share his experience in a techy talk, as well as Behnaz Derakhshani, who works as a Specialist Data Engineer at Diconium's data department. Additionally, we're going to welcome external guest speaker Erik Schmiegelow, CEO at Hivemind Technologies. Exciting line-up, right? :D

Join us on September 16th in Berlin and bring all your questions! Here are the topics you can expect:

Yingjun Wu: Achieving Sub‑100 ms Real‑Time Stream Processing with an S3‑Native Architecture

Stream processing systems have traditionally relied on local storage engines such as RocksDB to achieve low latency. While effective in single-node setups, this model doesn't scale well in the cloud, where elasticity and separation of compute and storage are essential. In this talk, we'll explore how RisingWave rethinks the architecture by building directly on top of S3 while still delivering sub-100 ms latency. At the core is Hummock, a log-structured state engine designed for object storage. Hummock organizes state into a three-tier hierarchy: in-memory cache for the hottest keys, disk cache managed by Foyer for warm data, and S3 as the persistent cold tier. This approach ensures queries never directly hit S3, avoiding its variable performance. We'll also examine how remote compaction offloads heavy maintenance tasks from query nodes, eliminating interference between user queries and background operations. Combined with fine-grained caching policies and eviction strategies, this architecture enables both consistent query performance and cloud-native elasticity. Attendees will walk away with a deeper understanding of how to design streaming systems that balance durability, scalability, and low latency in an S3-based environment.

Behnaz Derakhshani: From Raw Data to Trusted Assets: A Practical Walkthrough with AWS services and Collibra

Expect a hands-on journey of Behnaz showing how modern data lake tools and governance platforms connect the dots, making your data discoverable, governed, and productized for real-world use.

Erik Schmiegelow: Effective Agentic GenAI in Data Streaming

Successful genAI projects strike the balance between impact, accuracy, and cost. In this talk, Erik will cover how to create agentic data applications effectively, choosing when and how to integrate them in data streams and keep response quality issues and costs in check.

What you can expect:

  • 3 expert talks
  • Interactive Q&A
  • Networking opportunities
  • Pizza & drinks (indoor or at our terrace)

Timetable:

  • 18:00 - Event admission
  • 18:30 - Welcome & introduction
  • 18:35 - Keynote by Yingjun Wu & Q&A
  • 19:05 - Short break
  • 19:15 - Keynote by Behnaz Derakhshani & Q&A
  • 19:45 - Keynote by Erik Schmiegelow & Q&A
  • 20:15 - Snacks, drinks & networking
  • 21:30 - End *

Our goal is to form a local data-loving community, so join us and let's talk data together!

-> Our event page, where you can also contact us if you want to present in the future at our Meetup: Data Engineering MeetUp Berlin - applydata

--- At the event, sound, image and video recordings are created and published for documentation purposes as well as for the presentation of the event in publicly accessible media, on websites and blogs and for presentation on social media. By participating the event, the participant implicitly consents to the aforementioned photo and/or video recordings. Find more information on data protection here.

Data Builders’ Evening: Architecture, Engineering & Beyond | Berlin, Sep. 16th

Air France KLM partnered with Datashift to tackle data silos, regulatory complexity, and fragmented access by building a central data marketplace. Using Collibra and Google Cloud, they now empower the organization with trusted, governed, and self-service data to support critical operations and decision-making.

Cloud Computing Collibra GCP
Data Expo NL 2025
Simon Bennett – Group Product Manager @ Schroders

Data teams are faced with increased demand for trusted data assets, but struggle to quickly and safely grant access to the most validated data. In this session, Schroders will share how they've enhanced discoverability and governance of data across the enterprise using Collibra's unified governance for data and AI to enable data consumers to easily find and access the trusted data they need. Learn how building a curated marketplace of data products has helped foster greater collaboration and increased the safe usage of data for greater advantage.

AI/ML Collibra
Axel Frank @ Siemens AG , Stan Christiaens @ Collibra , Kevin Falkenstein @ Siemens AG

Join Kevin Falkenstein, Service Manager of Data Governance Platform, and Axel Frank, Senior Solution Architect, from Siemens as they share Siemens' transformative journey in scaling Data & AI governance. Discover how Siemens is paving the way for the future by managing Data & AI models across departments, ensuring compliance, and building a foundation for long-term success by unifying Data and AI strategies with Collibra.

AI/ML Collibra Data Governance
Jobin George – Partner Engineering Lead, Data Analytics @ Google Cloud , Stephen Gilderdale – Executive Vice President, Global Solutions and Services @ Collibra , John Tucker – Director, Enterprise Data Governance @ McDonald's

McDonald’s is advancing its AI capabilities and achieving transformative business outcomes with Collibra and Google Cloud Gemini. By integrating a multi-cloud, multi-data platform with unified data governance for AI, McDonald’s is reducing governance fragmentation, activating unused data, and addressing AI deployment risks. This session examines how McDonald's utilizes Collibra and Google Cloud's data and AI technologies to quickly implement innovative solutions and improve customer, restaurant team, and employee experiences.

Google Cloud Next '25
Dan Bruckner – co-founder and CTO @ Tamr , Tobias Macey – host

Summary In this episode of the Data Engineering Podcast Dan Bruckner, co-founder and CTO of Tamr, talks about the application of machine learning (ML) and artificial intelligence (AI) in master data management (MDM). Dan shares his journey from working at CERN to becoming a data expert and discusses the challenges of reconciling large-scale organizational data. He explains how data silos arise from independent teams and highlights the importance of combining traditional techniques with modern AI to address the nuances of data reconciliation. Dan emphasizes the transformative potential of large language models (LLMs) in creating more natural user experiences, improving trust in AI-driven data solutions, and simplifying complex data management processes. He also discusses the balance between using AI for complex data problems and the necessity of human oversight to ensure accuracy and trust.

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data managementData migrations are brutal. They drag on for months—sometimes years—burning through resources and crushing team morale. Datafold's AI-powered Migration Agent changes all that. Their unique combination of AI code translation and automated data validation has helped companies complete migrations up to 10 times faster than manual approaches. And they're so confident in their solution, they'll actually guarantee your timeline in writing. Ready to turn your year-long migration into weeks? Visit dataengineeringpodcast.com/datafold today for the details. As a listener of the Data Engineering Podcast you clearly care about data and how it affects your organization and the world. For even more perspective on the ways that data impacts everything around us don't miss Data Citizens® Dialogues, the forward-thinking podcast brought to you by Collibra. You'll get further insights from industry leaders, innovators, and executives in the world's largest companies on the topics that are top of mind for everyone. In every episode of Data Citizens® Dialogues, industry leaders unpack data’s impact on the world; like in their episode “The Secret Sauce Behind McDonald’s Data Strategy”, which digs into how AI-driven tools can be used to support crew efficiency and customer interactions. In particular I appreciate the ability to hear about the challenges that enterprise scale businesses are tackling in this fast-moving field. The Data Citizens Dialogues podcast is bringing the data conversation to you, so start listening now! Follow Data Citizens Dialogues on Apple, Spotify, YouTube, or wherever you get your podcasts.Your host is Tobias Macey and today I'm interviewing Dan Bruckner about the application of ML and AI techniques to the challenge of reconciling data at the scale of businessInterview IntroductionHow did you get involved in the area of data management?Can you start by giving an overview of the different ways that organizational data becomes unwieldy and needs to be consolidated and reconciled?How does that reconciliation relate to the practice of "master data management"What are the scaling challenges with the current set of practices for reconciling data?ML has been applied to data cleaning for a long time in the form of entity resolution, etc. How has the landscape evolved or matured in recent years?What (if any) transformative capabilities do LLMs introduce?What are the missing pieces/improvements that are necessary to make current AI systems usable out-of-the-box for data cleaning?What are the strategic decisions that need to be addressed when implementing ML/AI techniques in the data cleaning/reconciliation process?What are the risks involved in bringing ML to bear on data cleaning for inexperienced teams?What are the most interesting, innovative, or unexpected ways that you have seen ML techniques used in data resolution?What are the most interesting, unexpected, or challenging lessons that you have learned while working on using ML/AI in master data management?When is ML/AI the wrong choice for data cleaning/reconciliation?What are your hopes/predictions for the future of ML/AI applications in MDM and data cleaning?Contact Info LinkedInParting Question From your perspective, what is the biggest gap in the tooling or technology for data management today?Closing Announcements Thank you for listening! Don't forget to check out our other shows. Podcast.init covers the Python language, its community, and the innovative ways it is being used. The AI Engineering Podcast is your guide to the fast-moving world of building AI systems.Visit the site to subscribe to the show, sign up for the mailing list, and read the show notes.If you've learned something or tried out a project from the show then tell us about it! Email [email protected] with your story.Links TamrMaster Data ManagementCERNLHCMichael StonebrakerConway's LawExpert SystemsInformation RetrievalActive LearningThe intro and outro music is from The Hug by The Freak Fandango Orchestra / CC BY-SA

AI/ML Collibra Data Engineering Data Management Datafold LLM Master Data Management Python
Hugo Lu – Founder , Tobias Macey – host

Summary The core task of data engineering is managing the flows of data through an organization. In order to ensure those flows are executing on schedule and without error is the role of the data orchestrator. Which orchestration engine you choose impacts the ways that you architect the rest of your data platform. In this episode Hugo Lu shares his thoughts as the founder of an orchestration company on how to think about data orchestration and data platform design as we navigate the current era of data engineering.

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data managementIt’s 2024, why are we still doing data migrations by hand? Teams spend months—sometimes years—manually converting queries and validating data, burning resources and crushing morale. Datafold's AI-powered Migration Agent brings migrations into the modern era. Their unique combination of AI code translation and automated data validation has helped companies complete migrations up to 10 times faster than manual approaches. And they're so confident in their solution, they'll actually guarantee your timeline in writing. Ready to turn your year-long migration into weeks? Visit dataengineeringpodcast.com/datafold today to learn how Datafold can automate your migration and ensure source to target parity. As a listener of the Data Engineering Podcast you clearly care about data and how it affects your organization and the world. For even more perspective on the ways that data impacts everything around us don't miss Data Citizens® Dialogues, the forward-thinking podcast brought to you by Collibra. You'll get further insights from industry leaders, innovators, and executives in the world's largest companies on the topics that are top of mind for everyone. In every episode of Data Citizens® Dialogues, industry leaders unpack data’s impact on the world, from big picture questions like AI governance and data sharing to more nuanced questions like, how do we balance offense and defense in data management? In particular I appreciate the ability to hear about the challenges that enterprise scale businesses are tackling in this fast-moving field. The Data Citizens Dialogues podcast is bringing the data conversation to you, so start listening now! Follow Data Citizens Dialogues on Apple, Spotify, YouTube, or wherever you get your podcasts.Your host is Tobias Macey and today I'm interviewing Hugo Lu about the data platform and orchestration ecosystem and how to navigate the available optionsInterview IntroductionHow did you get involved in building data platforms?Can you describe what an orchestrator is in the context of data platforms?There are many other contexts in which orchestration is necessary. What are some examples of how orchestrators have adapted (or failed to adapt) to the times?What are the core features that are necessary for an orchestrator to have when dealing with data-oriented workflows?Beyond the bare necessities, what are some of the other features and design considerations that go into building a first-class dat platform or orchestration system?There have been several generations of orchestration engines over the past several years. How would you characterize the different coarse groupings of orchestration engines across those generational boundaries?How do the characteristics of a data orchestrator influence the overarching architecture of an organization's data platform/data operations?What about the reverse?How have the cycles of ML and AI workflow requirements impacted the design requirements for data orchestrators?What are the most interesting, innovative, or unexpected ways that you have seen data orchestrators used?What are the most interesting, unexpected, or challenging lessons that you have learned while working on data orchestration?When is an orchestrator the wrong choice?What are your predictions and/or hopes for the future of data orchestration?Contact Info MediumLinkedInParting Question From your perspective, what is the biggest thing data teams are missing in the technology today?Closing Announcements Thank you for listening! Don't forget to check out our other shows. Podcast.init covers the Python language, its community, and the innovative ways it is being used. The AI Engineering Podcast is your guide to the fast-moving world of building AI systems.Visit the site to subscribe to the show, sign up for the mailing list, and read the show notes.If you've learned something or tried out a project from the show then tell us about it! Email [email protected] with your story.Links OrchestraPrevious Episode: Overview Of The State Of Data OrchestrationCronArgoCDDAGKubernetesData MeshAirflowSSIS == SQL Server Integration ServicesPentahoKettleDataVoloNiFiPodcast EpisodeDagstergRPCCoalescePodcast EpisodedbtDataHubPalantirThe intro and outro music is from The Hug by The Freak Fandango Orchestra / CC BY-SA

AI/ML Collibra Data Engineering Data Management Datafold Python SQL
Anna Geller – guest , Tobias Macey – host

Summary In this episode of the Data Engineering Podcast, Anna Geller talks about the integration of code and UI-driven interfaces for data orchestration. Anna defines data orchestration as automating the coordination of workflow nodes that interact with data across various business functions, discussing how it goes beyond ETL and analytics to enable real-time data processing across different internal systems. She explores the challenges of using existing scheduling tools for data-specific workflows, highlighting limitations and anti-patterns, and discusses Kestra's solution, a low-code orchestration platform that combines code-driven flexibility with UI-driven simplicity. Anna delves into Kestra's architectural design, API-first approach, and pluggable infrastructure, and shares insights on balancing UI and code-driven workflows, the challenges of open-core business models, and innovative user applications of Kestra's platform.

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data managementData migrations are brutal. They drag on for months—sometimes years—burning through resources and crushing team morale. Datafold's AI-powered Migration Agent changes all that. Their unique combination of AI code translation and automated data validation has helped companies complete migrations up to 10 times faster than manual approaches. And they're so confident in their solution, they'll actually guarantee your timeline in writing. Ready to turn your year-long migration into weeks? Visit dataengineeringpodcast.com/datafold today for the details.As a listener of the Data Engineering Podcast you clearly care about data and how it affects your organization and the world. For even more perspective on the ways that data impacts everything around us you should listen to Data Citizens® Dialogues, the forward-thinking podcast from the folks at Collibra. You'll get further insights from industry leaders, innovators, and executives in the world's largest companies on the topics that are top of mind for everyone. They address questions around AI governance, data sharing, and working at global scale. In particular I appreciate the ability to hear about the challenges that enterprise scale businesses are tackling in this fast-moving field. While data is shaping our world, Data Citizens Dialogues is shaping the conversation. Subscribe to Data Citizens Dialogues on Apple, Spotify, Youtube, or wherever you get your podcasts.Your host is Tobias Macey and today I'm interviewing Anna Geller about incorporating both code and UI driven interfaces for data orchestrationInterview IntroductionHow did you get involved in the area of data management?Can you start by sharing a definition of what constitutes "data orchestration"?There are many orchestration and scheduling systems that exist in other contexts (e.g. CI/CD systems, Kubernetes, etc.). Those are often adapted to data workflows because they already exist in the organizational context. What are the anti-patterns and limitations that approach introduces in data workflows?What are the problems that exist in the opposite direction of using data orchestrators for CI/CD, etc.?Data orchestrators have been around for decades, with many different generations and opinions about how and by whom they are used. What do you see as the main motivation for UI vs. code-driven workflows?What are the benefits of combining code-driven and UI-driven capabilities in a single orchestrator?What constraints does it necessitate to allow for interoperability between those modalities?Data Orchestrators need to integrate with many external systems. How does Kestra approach building integrations and ensure governance for all their underlying configurations?Managing workflows at scale across teams can be challenging in terms of providing structure and visibility of dependencies across workflows and teams. What features does Kestra offer so that all pipelines and teams stay organised?What are

AI/ML Analytics API CI/CD Collibra Data Engineering Data Management Datafold ETL/ELT Kestra Kubernetes
Lalith Suresh – guest , Mihai Budiu – guest , Leonid Ryzhyk – guest , Tobias Macey – host

Summary In this episode of the Data Engineering Podcast, the creators of Feldera talk about their incremental compute engine designed for continuous computation of data, machine learning, and AI workloads. The discussion covers the concept of incremental computation, the origins of Feldera, and its unique ability to handle both streaming and batch data seamlessly. The guests explore Feldera's architecture, applications in real-time machine learning and AI, and challenges in educating users about incremental computation. They also discuss the balance between open-source and enterprise offerings, and the broader implications of incremental computation for the future of data management, predicting a shift towards unified systems that handle both batch and streaming data efficiently.

Announcements Hello and welcome to the Data Engineering Podcast, the show about modern data managementImagine catching data issues before they snowball into bigger problems. That’s what Datafold’s new Monitors do. With automatic monitoring for cross-database data diffs, schema changes, key metrics, and custom data tests, you can catch discrepancies and anomalies in real time, right at the source. Whether it’s maintaining data integrity or preventing costly mistakes, Datafold Monitors give you the visibility and control you need to keep your entire data stack running smoothly. Want to stop issues before they hit production? Learn more at dataengineeringpodcast.com/datafold today!As a listener of the Data Engineering Podcast you clearly care about data and how it affects your organization and the world. For even more perspective on the ways that data impacts everything around us you should listen to Data Citizens® Dialogues, the forward-thinking podcast from the folks at Collibra. You'll get further insights from industry leaders, innovators, and executives in the world's largest companies on the topics that are top of mind for everyone. They address questions around AI governance, data sharing, and working at global scale. In particular I appreciate the ability to hear about the challenges that enterprise scale businesses are tackling in this fast-moving field. While data is shaping our world, Data Citizens Dialogues is shaping the conversation. Subscribe to Data Citizens Dialogues on Apple, Spotify, Youtube, or wherever you get your podcasts.Your host is Tobias Macey and today I'm interviewing Leonid Ryzhyk, Lalith Suresh, and Mihai Budiu about Feldera, an incremental compute engine for continous computation of data, ML, and AI workloadsInterview IntroductionCan you describe what Feldera is and the story behind it?DBSP (the theory behind Feldera) has won multiple awards from the database research community. Can you explain what it is and how it solves the incremental computation problem?Depending on which angle you look at it, Feldera has attributes of data warehouses, federated query engines, and stream processors. What are the unique use cases that Feldera is designed to address?In what situations would you replace another technology with Feldera?When is it an additive technology?Can you describe the architecture of Feldera?How have the design and scope evolved since you first started working on it?What are the state storage interfaces available in Feldera?What are the opportunities for integrating with or building on top of open table formats like Iceberg, Lance, Hudi, etc.?Can you describe a typical workflow for an engineer building with Feldera?You advertise Feldera's utility in ML and AI use cases in addition to data management. What are the features that make it conducive to those applications?What is your philosophy toward the community growth and engagement with the open source aspects of Feldera and how you're balancing that with sustainability of the project and business?What are the most interesting, innovative, or unexpected ways that you have seen Feldera used?What are the most interesting, unexpected, or challenging lessons that

AI/ML Collibra Data Engineering Data Management Datafold Hudi Iceberg Lance Data Streaming
Fede Frumento – Senior Data Governance Manager - Vodafone , Stijn 'Stan' Christiaens – Co-founder @ Collibra

As the hype for AI grows, organizations are still wrestling with the fundamentals of data governance. The ambitions of executives and boardrooms to implement next-gen AI use cases hinges on a solid data foundation including cataloging, ownership, and data quality. Join Collibra’s Chief Data Citizen, Stijn Christiaens and Vodafone’s Sr. Data Governance Manager, Fede Frumento, to learn how Vodafone has used data governance fundamentals to increase the scalability and collaboration of GenAI use cases.

AI/ML Collibra Data Governance Data Quality GenAI
Big Data LDN 2024

Hicham Amchaar, Artisan Cloud/Data Architect Indépendant nous présentra son parcours pour transformer une équipe d'infrastructure cloud en une entité axée sur les données.

Nous commencerons par examiner le contexte métier et les défis qui ont conduit à cette décision, en mettant en lumière notamment la mise en place d'un système de gestion des "Landing zones".

Ensuite, nous aborderons les méthodes, les technologies (logiciels, services et outils) et les procédures mises en œuvre pour concrétiser cette transition, mettant en avant des concepts tels que l'Event-Driven, le CDC, l'Architecture Hexagonale, les Données de référence, DataStream et Collibra.

Enfin, nous conclurons en tirant des leçons de cette transformation inspirante.

Bshare: Évolution vers une approche Data-Driven au sein d'une équipe CloudOps
Richie – host @ DataCamp , Bart Vandekerckhove – Co-founder and CEO @ Raito

Data used to be the exhaust of our work activities, until we started seeing the value it can provide. Today, data is a strategic asset, used to gain a competitive advantage and well guarded from those that might use it to harm others. With this change in attitude, how we access and safeguard our data has improved massively. However, data breaches are not a thing of the past, and with the advent of AI, many new techniques for maliciously accessing data are being created. With the extra importance of data security, it is always pertinent to iterate on how we keep our data safe, and how we manage who has access to it.  Bart Vandekerckhove is the co-founder and CEO at Raito. Raito is on a mission to bring back balance in data democratization and data security. Bart helps data teams save time on data access management, so they can focus on innovation. As the former PM Privacy at Collibra, Bart has seen first hand how slow data access management processes can harm progress.  In the full episode, Richie and Bart explore the importance of data access management, the roles involved in data access including senior management’s role in data access, data security and privacy tools, the impact of AI on data security, how culture feeds into data security, the challenges of a creating a good data access management culture, common mistakes organizations make, advice for improving data security and much more.  Links Mentioned in the Show: RaitoCapital One Data BreachOptus Data BreachIAMCourse: Introduction to Data Privacy

AI/ML Collibra Cyber Security
DataFramed

A data lake is only as good as its governance. Understanding what data you have, performing classification, defining/applying security policies and auding how it's used is the data governance lifecycle. Unity Catalog with its rich ecosystem of supported tools simplifies all stages of the data governance lifecycle. Learn how metadata can be hydrated, into Collibra directly from Unity Catalog. Once the metadata is available in Collibra we will demonstrate classification, defining security policies on the data and pushing those policies into Databricks. All access and usage of data is automatically audited with real time lineage provided in the data explorer as well as system tables.

Talk by: Leon Eller and Antonio Castelo

Connect with us: Website: https://databricks.com Twitter: https://twitter.com/databricks LinkedIn: https://www.linkedin.com/company/databricks Instagram: https://www.instagram.com/databricksinc Facebook: https://www.facebook.com/databricksin

Collibra Data Governance Data Lake Databricks Cyber Security
Rick Wear , Sarah Pollitt – Group Product Manager for ETL @ Matillion

In this presentation, Matillion’s Sarah Pollitt, Group Product Manager for ETL, will discuss how you can use Matillion to load data from popular data sources such as Salesforce, SAP, and over a hundred out-of-the-box connectors into your data lakehouse. You can quickly transform this data using powerful tools like Matillion or dbt, or your own custom notebooks, to derive valuable insights. She will also explore how you can run streaming pipelines to ensure real-time data processing, and how you can extract and manage this data using popular governance tools such as Alation or Collibra, ensuring compliance and data quality. Finally, Sarah will showcase how you can seamlessly integrate this data into your analytics tools of choice, such as Thoughtspot, PowerBI, or any other analytics tool that fits your organization's needs.

Talk by: Rick Wear

Connect with us: Website: https://databricks.com Twitter: https://twitter.com/databricks LinkedIn: https://www.linkedin.com/company/databricks Instagram: https://www.instagram.com/databricksinc Facebook: https://www.facebook.com/databricksinc

Alation Analytics Collibra Data Lakehouse Data Quality Databricks dbt ETL/ELT Matillion Power BI SAP Data Streaming Thoughtspot