talk-data.com talk-data.com

Meetup talk 2025-11-20 at 08:30

Optimize first, parallelize second: a better path to faster data processing

Topics

Description

You’re processing a large amount of data with Python, and your code is too slow. One obvious way to getting faster results is adding multithreading or multiprocessing, so you can use multiple CPU cores. Unfortunately, switching straight to parallelism is almost always premature, often unnecessary, and sometimes impossible.