talk-data.com talk-data.com

Mandy Chessell

Speaker

Mandy Chessell

2

talks

Mandy Chessell CBE FREng CEng FBCS is a trusted advisor to executives from large organisations, working with them to develop their strategy and architecture relating to the governance, integration and management of information. Mandy worked for IBM for 35 years, the last 15 as an IBM Distinguished Engineer. She is now one of the founders of Pragmatic Data Research Ltd, dedicated to improving the transparency, security and efficiency of digital operations and data management. Mandy is also the honorary president of the Institution of Engineering Designers (IED).

Mandy has been developing integration software throughout her career. Her focus has always been on using and supporting open standards to achieve heterogeneous-interoperability. Today Mandy is the leader and top contributor to the Egeria Open Source project (https://egeria-project.org) which is part of the LF AI & Data Foundation. Egeria is focused on providing an open metadata and governance technology that is able to exchange, integrate and correlate metadata from different tools, engines and platforms.

Mandy is a Fellow of the Royal Academy of Engineering. In 2015 she received a CBE for services to software engineering. In 2000, she was identified as one of MIT Technology Review's hundred young people most likely

Bio from: Big Data LDN 2025

Filtering by: Data Driven LDN Conference ×

Filter by Event / Source

Talks & appearances

Showing 2 of 5 activities

Search activities →

Following on from the Building consumable data products keynote, we will dive deeper into the interactions around the data product catalog, to show how the network effect of explicit data sharing relationships starts to pay dividends to the participants. Such as:

For the product consumer:

• Searching for products, understanding content, costs, terms and conditions, licenses, quality certifications etc

• Inspecting sample data, choosing preferred data format, setting up a secure subscription, and seeing data provisioned into a database from the product catalog.

• Providing feedback and requesting help

• Reviewing own active subscriptions

• Understanding the lineage behind each product along with outstanding exceptions and future plans

For the product manager/owner:

• Setting up a new product, creating a new release of an existing product and issuing a data correction/restatement

• Reviewing a product’s active subscriptions and feedback/requests from consumers

• Interacting with the technical teams on pipeline implementations along with issues and proposed enhancements

• For the data governance team

• Viewing the network of dependencies between data products (the data mesh) to understand the data value chains and risk concentrations

• Reviewing a dashboard of metrics around the data products including popularity, errors/exceptions, subscriptions, interaction

• Show traceability from a governance policy relating to, say data sovereignty or data privacy to the product implementations.

• Building trust profiles for producers and consumers

The aim of the demonstrations and discussions is to explore the principles and patterns relating to data products, rather than push a particular implementation approach.

Having said that, all of the software used in the demonstrations is open source. Principally this is Egeria, Open Lineage and Unity Catalog from the Linux Foundation, plus Apache Airflow, Apache Kafka and Apache SuperSet from the Apache Software Foundation.  

Videos of the demonstrations will be available on YouTube after the conference and the complete demo software can be downloaded and run on a laptop so you can share your experiences with your teams after the event.

When data users choose to circumvent official sources of data, it increases the risks to the organisation. How do you encourage teams to share data widely, effectively and legally? Are data products the answer? We explore the data sharing user journey to highlight the key features of a data product strategy that significantly improves the effectiveness of your data.