Modern data pipelines are fast and expressive, but ensuring data quality is often not as straightforward. This talk introduces Paguro, an open-source, feature-rich validation and metadata library designed on top of the Polars DataFrame library. Paguro enables users to validate both single Data(Lazy)Frames and collections of Data(Lazy)Frames together, and provides beautifully formatted terminal diagnostics that explain why and where validation failed. Attendees will learn how to integrate the lightweight, fast, and composable validation toolkit into their workflows, from exploration to production, using a familiar Polars-native syntax.
talk-data.com
Topic
Polars
data_manipulation
data_analysis
rust
1
tagged
Activity Trend
13
peak/qtr
2020-Q1
2026-Q1
Top Events
SciPy 2025
5
PyData Berlin 2025
3
O'Reilly Data Science Books
3
Data Engineering Central Podcast
3
PyData Paris 2025
2
PyData London 2025
2
DataTopics: All Things Data, AI & Tech
2
PyData Seattle 2025
2
PyConDE & PyData Berlin 2023
2
PyData Amsterdam 2025
2
Databricks DATA + AI Summit 2023
2
O'Reilly Data Engineering Books
1
Filtering by:
Bernardo Dionisi
×