Airflow Asset originated from data lineage and evolved into its current state, being used as a scheduling concept (data-aware, event-based scheduling). It has even more potential. This talk discusses how other parts of Airflow, namely Connection and Object Storage, contain concepts related to Asset, and we can tie them all together to make task authoring flow even more naturally. Planned topics: Brief history on Asset and related constructs. Current state of Asset concepts. Inlets, anyone? Finding inspiration from Pydantic et al. My next step for Asset.
talk-data.com
Topic
Pydantic
schemas
python
data_modeling
data_validation
2
tagged
Activity Trend
5
peak/qtr
2020-Q1
2026-Q1
Top Events
PyConDE & PyData Berlin 2023
3
Airflow Summit 2025
2
DataTopics: All Things Data, AI & Tech
2
PyData Rhein-Main I Security Risks in AI & Structured Automation with Agentic AI
1
Python Data Science Day
1
May Members Talk Evening
1
PyData Hamburg July 24th Meetup
1
PyDataMCR August
1
Node.js Meetup #43 (Pizza 🍕 and Drinks 🍺)
1
O'Reilly Data Engineering Books
1
Filtering by:
Airflow Summit 2025
×
This session explores how to bring unit testing to SQL pipelines using Airflow. I’ll walk through the development of a SQL testing library that allows isolated testing of SQL logic by injecting mock data into base tables. To support this, we built a type system for AWS Glue tables using Pydantic, enabling schema validation and mock data generation. Over time, this type system also powered production data quality checks via a custom Airflow operator. Learn how this approach improves reliability, accelerates development, and scales testing across data workflows.