talk-data.com

Topic

Rust

programming_language

Activities

tagged

Activity Trend

11 peak/qtr

2020-Q1 2026-Q1

Top Events

ADSP: Algorithms + Data Structures = Programs 33 DataTopics: All Things Data, AI & Tech 9 Databricks DATA + AI Summit 2023 8 PyConDE & PyData Berlin 2023 6 Data Engineering Podcast 5 The Pragmatic Engineer 5 Data + AI Summit 2025 5 LDN Talks September 2025 Community Showcase 3 Data Engineering Central Podcast 2 Big Data LDN 2025 2 DATA MINER Big Data Europe Conference 2020 2 Google Cloud Next '24 2

Top Speakers

Conor Hoekstra 33 Bryce Adelstein Lelbach (NVIDIA) 33 Tobias Macey 5 Sean Parent (Adobe) 5 Richard Feldman (Roc programming language) 4 Gergely Orosz 4 Jane Losare-Lusby (Futurewei Technologies) 3 Tristan Brindle (C++ London Uni) 3 Douglas Gregor (Apple) 3 Robert Pack (Databricks) 3 Frank McSherry (Materialize) 2 Barry Revzin (Jump Trading) 2

Activities

Showing filtered results

All Video Podcast Book

Filtering by: PyConDE & PyData Berlin 2023 ×

Most of you don't need Spark. Large-scale data management on a budget with Python

2023-04-19 · PyConDE & PyData Berlin 2023

talk

by Guillem Borrell

Data Management Python Spark Data Streaming

The Python data ecosystem has matured during the last decade and there are less and less reasons to rely only large batch process executed in a Spark cluster, but with every large ecosystem, putting together the key pieces of technology takes some effort. There are now better storage technologies, streaming execution engines, query planners, and low level compute libraries. And modern hardware is way more powerful than what you'd probably expect. In this workshop we will explore some global-warming-reducing techniques to build more efficient data transformation pipelines in Python, and a little bit of Rust.

Pragmatic ways of using Rust in your data project

2023-04-18 · PyConDE & PyData Berlin 2023

talk

by Christopher Prohm

NumPy Pandas Python

Writing efficient data pipelines in Python can be tricky. The standard recommendation is to use vectorized functions implemented in Numpy, Pandas, or the like. However, what to do, when the processing task does not fit these libraries? Using plain Python for processing can result in lacking performance, in particular when handling large data sets.

Rust is a modern, performance-oriented programming language that is already widely used by the Python community. Augmenting data processing steps with Rust can result in substantial speed ups. In this talk will present strategies of using Rust in a larger Python data processing pipeline with a particular focus on pragmatism and minimizing integration efforts.

Rusty Python: A Case Study

2023-04-18 · PyConDE & PyData Berlin 2023

talk

by Robin Raymond

Python

Python is a very expressive and powerful language, but it is not always the fastest option for performance-critical parts of an application. Rust, on the other hand, is known for its lightning-fast runtime and low-level control, making it an attractive option for speeding up performance-sensitive portions of Python programs.

In this talk, we will present a case study of using Rust to speed up a critical component of a Python application. We will cover the following topics:

An overview of Rust and its benefits for Python developers
Profiling and identifying performance bottlenecks in Python application
Implementing a solution in Rust and integrating it with the Python application using PyO3
Measuring the performance improvements and comparing them to other optimization techniques

Attendees will learn about the potential for using Rust to boost the performance of their Python programs and how to go about doing so in their own projects.

Specifying behavior with Protocols, Typeclasses or Traits. Who wears it better (Python, Scala 3, Rust)?

2023-04-17 · PyConDE & PyData Berlin 2023

talk

by Kolja Maier

Python Scala

In this talk, we will explore the use of Python's typing.Protocol, Scala's Typeclasses, and Rust's Traits. They all offer a very powerful & elegant mechanism for abstracting over various concepts (such as Serialization) in a modular manner. We will compare and contrast the syntax and implementation of these constructs in each language and discuss their strengths and weaknesses. We will also look at real-world examples of how these features are used in each language to specify behavior, and consider differences in terms of type system expressiveness and effectiveness. By the end of the talk, attendees will have a better understanding of the differences and similarities between these three language features, and will be able to make informed decisions about which one is best suited for their needs.

Polars - make the switch to lightning-fast dataframes

2023-04-17 · PyConDE & PyData Berlin 2023

talk

by Thomas Bierhance

AI/ML Arrow Pandas Polars Python

In this talk, we will report on our experiences switching from Pandas to Polars in a real-world ML project. Polars is a new high-performance dataframe library for Python based on Apache Arrow and written in Rust. We will compare the performance of polars with the popular pandas library, and show how polars can provide significant speed improvements for data manipulation and analysis tasks. We will also discuss the unique features of polars, such as its ability to handle large datasets that do not fit into memory, and how it feels in practice to make the switch from Pandas. This talk is aimed at data scientists, analysts, and anyone interested in fast and efficient data processing in Python.

Raised by Pandas, striving for more: An opinionated introduction to Polars

2023-04-17 · PyConDE & PyData Berlin 2023

talk

by Nico Kreiling

Arrow Pandas Polars Python

Pandas is the de-facto standard for data manipulation in python, which I personally love for its flexible syntax and interoperability. But Pandas has well-known drawbacks such as memory in-efficiency, inconsistent missing data handling and lacking multicore-support. Multiple open-source projects aim to solve those issues, the most interesting is Polars.

Polars uses Rust and Apache Arrow to win in all kinds of performance-benchmarks and evolves fast. But is it already stable enough to migrate an existing Pandas' codebase? And does it meet the high-expectations on query language flexibility of long-time Pandas-lovers?

In this talk, I will explain, how Polars can be that fast, and present my insights on where Polars shines and in which scenarios I stay with pandas (at least for now!)