Topic

Polars

data_manipulation data_analysis rust

Activities

2

tagged

Activity Trend

13 peak/qtr

2020-Q1 2026-Q1

Top Events

SciPy 2025 5 PyData Berlin 2025 3 O'Reilly Data Science Books 3 Data Engineering Central Podcast 3 PyData Paris 2025 2 PyData London 2025 2 DataTopics: All Things Data, AI & Tech 2 PyData Seattle 2025 2 PyConDE & PyData Berlin 2023 2 PyData Amsterdam 2025 2 Databricks DATA + AI Summit 2023 2 O'Reilly Data Engineering Books 1

Top Speakers

Marco Gorelli (Narwhals) 4 Dr. Jeroen Janssens (Posit) 3 Thijs Nieuwdorp (VodafoneZiggo) 2 Daniel Beach 2 Thomas Bierhance 1 Bernardo Dionisi 1 Brodie Vidrine 1 Guen Prawiroatmodjo 1 Vyas Ramasubramani 1 Ritchie Vink (Polars) 1 Oz Katz (Treeverse) 1 Joris Bekkers 1

Activities

Showing filtered results

All Video Podcast Book

Filtering by: PyData Amsterdam 2025 ×

Streamlining data pipeline development with Ordeq

2025-09-25 · PyData Amsterdam 2025

talk

by Niels Neerhoff , Simon Brugman

Airflow Cloud Computing GCP Matplotlib Spark

In this talk, we will introduce Ordeq, a cutting-edge data pipeline development framework used by data engineers, scientists and analysts across ING. Ordeq helps you modularise pipeline logic and abstract IO, elevating projects from proof-of-concepts to maintainable production-level applications. We will demonstrate how Ordeq integrates seamlessly with popular data processing tools like Spark, Polars, Matplotlib, DSPy, and orchestration tools such as Airflow. Additionally, we showcase how you can leverage Ordeq on public cloud offering like GCP. Ordeq has 0 dependencies and is available under MIT license.

Actionable Techniques for Finding Performance Regressions

2025-09-25 · PyData Amsterdam 2025

talk

by Thijs Nieuwdorp (VodafoneZiggo) , Dr. Jeroen Janssens (Posit)

Bash Data Science Git Parquet Python

Ever been burned by a mysterious slowdown in your data pipeline? In this session, we'll reveal how a stealthy performance regression in the Polars DataFrame library was hunted down and squashed. Using git bisect, Bash scripting, and uv, we automated commit compilation and benchmarking across two repos to pinpoint a commit that degraded multi-file Parquet loading. This led to challenging assumptions and rethinking performance monitoring for the Python data science library Polars.