talk-data.com talk-data.com

Topic

Data Contracts

data_governance data_quality data_engineering

2

tagged

Activity Trend

14 peak/qtr
2020-Q1 2026-Q1

Activities

Showing filtered results

Filtering by: Shirshanka Das ×
Shift-left governance for your dbt centered stack: Data contracts and more! - Coalesce 2023

Data contracts have been much discussed in the community of late, with a lot of curiosity around how to approach this concept in practice and how it might enable shift-left developer-first governance and data quality. For organizations adopting dbt while also dealing with non-dbt data that is upstream of the warehouse, it can be challenging to understand how to apply data contracts uniformly across a fragmented stack. We are calling this harmonizing layer the Control Plane for Data - powered by the common thread across these systems: metadata.

In this talk, Shirshanka Das, CTO of Acryl Data and founder of the DataHub Project describes how you can use data contracts and DataHub to make your dbt centered stack more reliable - as well as other use cases that can help build a simpler, more flexible data stack.

Speaker: Shirshanka Das, CTO, Acryl Data

Register for Coalesce at https://coalesce.getdbt.com

Data contracts have been much discussed in the community of late, with a lot of curiosity around how to approach this concept in practice. We believe data contracts need a harmonizing layer to manage data quality in a uniform manner across a fragmented stack. We are calling this harmonizing layer the Control Plane for Data - powered by the common thread across these systems: metadata. For teams already orchestrating pipelines with Airflow, data contacts can be an effective way to process data that meets preset quality standards. With a control plane as a connecting layer, producers can build data contracts that consumers can rely on, ensuring DAGs only run when a contract is valid. Producers can govern how workflows should behave, and consumers receive the tooling they need to only opt into high quality data. Learn how to use data contracts and DataHub to make your Airflow pipelines more reliable - as well as other use cases that can help build a simpler, more flexible data stack.