talk-data.com talk-data.com

Dr. Jeroen Janssens

Speaker

Dr. Jeroen Janssens

4

talks

Head of DevRel Posit

Frequent Collaborators

Filter by Event / Source

Talks & appearances

4 activities · Newest first

Search activities →

Ever been burned by a mysterious slowdown in your data pipeline? In this session, we'll reveal how a stealthy performance regression in the Polars DataFrame library was hunted down and squashed. Using git bisect, Bash scripting, and uv, we automated commit compilation and benchmarking across two repos to pinpoint a commit that degraded multi-file Parquet loading. This led to challenging assumptions and rethinking performance monitoring for the Python data science library Polars.

The Importance and Elegance of Polars Expressions

Polars is known for its speed, but its elegance comes from its use of expressions. In this talk, we’ll explore how Polars expressions work and why they are key to efficient and elegant data manipulation. Through real-world examples, you’ll learn how to create, expand, and combine expressions in Polars to wrangle data more effectively.