talk-data.com talk-data.com

Kushal Thakkar

Speaker

Kushal Thakkar

1

talks

Member of Technical Staff Anthropic

Frequent Collaborators

Filtering by: Airflow Summit 2025 ×

Filter by Event / Source

Talks & appearances

Showing 1 of 2 activities

Search activities →

This session explores how to bring unit testing to SQL pipelines using Airflow. I’ll walk through the development of a SQL testing library that allows isolated testing of SQL logic by injecting mock data into base tables. To support this, we built a type system for AWS Glue tables using Pydantic, enabling schema validation and mock data generation. Over time, this type system also powered production data quality checks via a custom Airflow operator. Learn how this approach improves reliability, accelerates development, and scales testing across data workflows.