Topic

PySpark

big_data distributed_computing python

Activities

3

tagged

Activity Trend

14 peak/qtr

2020-Q1 2026-Q1

Top Events

O'Reilly Data Engineering Books 19 Databricks DATA + AI Summit 2023 16 Data + AI Summit 2025 13 Data Engineering Podcast 4 O'Reilly Data Science Books 2 PyData Berlin 2025 2 PyData Cardiff - July 2025 1 From a Fintech lens: MCP server live-coding & feature selection data hacks 1 dbt Coalesce 2025 1 PyData Seattle 2025 1 PyConDE & PyData Berlin 2023 1 SciPy 2025 1

Top Speakers

Tobias Macey 4 Marco Gorelli (Narwhals) 3 Denny Lee (Databricks) 3 Pramod Singh 3 Sundar Krishnan 2 Tomasz Drabas 2 Raju Kumar Mishra 2 Allison Wang (Databricks) 2 Ramcharan Kakarla 2 Xiao Li (Databricks) 2 Stuart Moncada (Google Cloud) 1 Benjamin Bengfort 1

Activities

Showing filtered results

All Video Podcast Book

Filtering by: Marco Gorelli ×

Narwhals: enabling universal dataframe support

2025-09-02 · PyData Berlin 2025 Watch

talk

by Marco Gorelli (Narwhals)

Data Science DuckDB Pandas Plotly Polars

Ever tried passing a Polars Dataframe to a data science library and found that it...just works? No errors, no panics, no noticeable overhead, just...results? This is becoming increasingly common in 2025, yet only 2 years ago, it was mostly unheard of. So, what changed? A large part of the answer is: Narwhals.

Narwhals is a lightweight compatibility layer between dataframe libraries which lets your code work seamlessly across Polars, pandas, PySpark, DuckDB, and more! And it's not just a theoretical possibility: with ~30 million monthly downloads and set as a required dependency of Altair, Bokeh, Marimo, Plotly, Shiny, and more, it's clear that it's reshaping the data science landscape. By the end of the talk, you'll understand why writing generic dataframe code was such a headache (and why it isn't anymore), how Narwhals works and how its community operates, and how you can use it in your projects today. The talk will be technical yet accessible and light-hearted.

Narwhals: A lightweight compatibility layer between dataframe libraries

2025-07-31 · PyData Cardiff - July 2025

talk

by Marco Gorelli (Narwhals)

DuckDB Pandas Polars narwhals

Narwhals is a lightweight compatibility layer between dataframe libraries which lets your code work seamlessly across Polars, Pandas, PySpark, DuckDB and more.

Polars, DuckDB, PySpark, PyArrow, pandas, cuDF: how Narwhals has brought them all together!

2025-06-08 · PyData London 2025 Watch

talk

by Marco Gorelli (Narwhals)

Data Science DuckDB Pandas Polars

Suppose you want to write a data science tool to do feature engineering. Your experience may go like this: - Expectation: you can focus on state-of-the art techniques for feature engineering. - Reality: you keep having to make you codebase more complex because a new dataframe library has come out and users are demanding support for it.

Or rather, it might have gone like that in the pre-Narwhals era. Because now, you can focus on solving the problems which your tool set out to do, and let Narwhals handle the subtle differences between different kinds of dataframe inputs!