Topic

Polars

data_manipulation data_analysis rust

Activities

2

tagged

Activity Trend

13 peak/qtr

2020-Q1 2026-Q1

Top Events

SciPy 2025 5 PyData Berlin 2025 3 O'Reilly Data Science Books 3 Data Engineering Central Podcast 3 PyData Paris 2025 2 PyData London 2025 2 DataTopics: All Things Data, AI & Tech 2 PyData Seattle 2025 2 PyConDE & PyData Berlin 2023 2 PyData Amsterdam 2025 2 Databricks DATA + AI Summit 2023 2 O'Reilly Data Engineering Books 1

Top Speakers

Marco Gorelli (Narwhals) 4 Dr. Jeroen Janssens (Posit) 3 Thijs Nieuwdorp (VodafoneZiggo) 2 Daniel Beach 2 Thomas Bierhance 1 Bernardo Dionisi 1 Brodie Vidrine 1 Guen Prawiroatmodjo 1 Vyas Ramasubramani 1 Ritchie Vink (Polars) 1 Oz Katz (Treeverse) 1 Joris Bekkers 1

Activities

Showing filtered results

All Video Podcast Book

Filtering by: PyConDE & PyData Berlin 2023 ×

Polars - make the switch to lightning-fast dataframes

2023-04-17 · PyConDE & PyData Berlin 2023

talk

by Thomas Bierhance

AI/ML Arrow Pandas Python Rust

In this talk, we will report on our experiences switching from Pandas to Polars in a real-world ML project. Polars is a new high-performance dataframe library for Python based on Apache Arrow and written in Rust. We will compare the performance of polars with the popular pandas library, and show how polars can provide significant speed improvements for data manipulation and analysis tasks. We will also discuss the unique features of polars, such as its ability to handle large datasets that do not fit into memory, and how it feels in practice to make the switch from Pandas. This talk is aimed at data scientists, analysts, and anyone interested in fast and efficient data processing in Python.

Raised by Pandas, striving for more: An opinionated introduction to Polars

2023-04-17 · PyConDE & PyData Berlin 2023

talk

by Nico Kreiling

Arrow Pandas Python Rust

Pandas is the de-facto standard for data manipulation in python, which I personally love for its flexible syntax and interoperability. But Pandas has well-known drawbacks such as memory in-efficiency, inconsistent missing data handling and lacking multicore-support. Multiple open-source projects aim to solve those issues, the most interesting is Polars.

Polars uses Rust and Apache Arrow to win in all kinds of performance-benchmarks and evolves fast. But is it already stable enough to migrate an existing Pandas' codebase? And does it meet the high-expectations on query language flexibility of long-time Pandas-lovers?

In this talk, I will explain, how Polars can be that fast, and present my insights on where Polars shines and in which scenarios I stay with pandas (at least for now!)