talk-data.com talk-data.com

Event

PyData Amsterdam 2025

2025-09-24 โ€“ 2025-09-26 PyData

Activities tracked

1

Filtering by: Jeroen Janssens ×

Sessions & talks

Showing 1โ€“1 of 1 ยท Newest first

Search within this event →

Actionable Techniques for Finding Performance Regressions

2025-09-25
talk
Jeroen Janssens , Thijs Nieuwdorp (VodafoneZiggo)

Ever been burned by a mysterious slowdown in your data pipeline? In this session, we'll reveal how a stealthy performance regression in the Polars DataFrame library was hunted down and squashed. Using git bisect, Bash scripting, and uv, we automated commit compilation and benchmarking across two repos to pinpoint a commit that degraded multi-file Parquet loading. This led to challenging assumptions and rethinking performance monitoring for the Python data science library Polars.