talk-data.com

Topic

Python

programming_language data_science web_development

Activities

tagged

Activity Trend

185 peak/qtr

2020-Q1 2026-Q1

Top Events

O'Reilly Data Science Books 220 Data Engineering Podcast 183 O'Reilly Data Engineering Books 151 SciPy 2025 67 PyConDE & PyData Berlin 2023 49 Data + AI Summit 2025 30 Databricks DATA + AI Summit 2023 29 O'Reilly AI & ML Books 27 PyData Seattle 2025 23 PyData Paris 2025 20 O'Reilly Data Visualization Books 20 PyData London 2025 20

Top Speakers

Tobias Macey 183 Bryce Adelstein Lelbach (NVIDIA) 18 Conor Hoekstra 17 Harpreet Sahota (Voxel51) 15 Dan Gural (Voxel51) 14 Avery Smith 13 Kyle Polich 8 Al Martin (IBM) 7 Dr. Yasin Ceran (KAIST) 7 Luca Massaron 7 Julie Hoyer 6 Gleb Mezhanskiy (Datafold) 6

Activities

Showing filtered results

All Video Podcast Book

Filtering by: PyConDE & PyData Berlin 2023 ×

Postmodern Architecture: The Python Powered Modern Data Stack

2023-04-19 · PyConDE & PyData Berlin 2023

talk

by John Sandall

Analytics Analytics Engineering API Data Analytics Data Engineering dbt ETL/ELT Modern Data Stack Pandas

The Modern Data Stack has brought a lot of new buzzwords into the data engineering lexicon: "data mesh", "data observability", "reverse ETL", "data lineage", "analytics engineering". In this light-hearted talk we will demystify the evolving revolution that will define the future of data analytics & engineering teams.

Our journey begins with the PyData Stack: pandas pipelines powering ETL workflows...clean code, tested code, data validation, perfect for in-memory workflows. As demand for self-serve analytics grows, new data sources bring more APIs to model, more code to maintain, DAG workflow orchestration tools, new nuances to capture ("the tax team defines revenue differently"), more dashboards, more not-quite-bugs ("but my number says this...").

This data maturity journey is a well-trodden path with common pitfalls & opportunities. After dashboards comes predictive modelling ("what will happen"), prescriptive modelling ("what should we do?"), perhaps eventually automated decision making. Getting there is much easier with the advent of the Python Powered Modern Data Stack.

In this talk, we will cover the shift from ETL to ELT, the open-source Modern Data Stack tools you should know, with a focus on how dbt's new Python integration is changing how data pipelines are built, run, tested & maintained. By understanding the latest trends & buzzwords, attendees will gain a deeper insight into Python's role at the core of the future of data engineering.

Rethinking codes of conduct

2023-04-19 · PyConDE & PyData Berlin 2023

talk

by Tereza Iofciu

Did you know that the Python Software Foundation Code of Conduct is turning 10 years old in 2023? It was voted in as they felt they were “unbalanced and not seeing the true spectrum of the greater community”. Why is that a big thing? Come to my talk and find out!

Cloud Infrastructure From Python Code: How Far Could We Go?

2023-04-19 · PyConDE & PyData Berlin 2023

talk

by Asher Sterkin , Etzik Bega

Cloud Computing DevOps

Discover how Infrastructure From Code (IfC) can revolutionize Cloud DevOps automation by generating cloud deployment templates directly from Python code. Learn how this technology empowers Python developers to easily deploy and operate cost-effective, secure, reliable, and sustainable cloud software. Join us to explore the strategic potential of IfC.

Introduction to Async programming

2023-04-19 · PyConDE & PyData Berlin 2023

talk

by Dishant Sethi

API JavaScript

Asynchronous programming is a type of parallel programming in which a unit of work is allowed to run separately from the primary application thread. Post execution, it notifies the main thread about the completion or failure of the worker thread. There are numerous benefits to using it, such as improved application performance, enhanced responsiveness, and effective usage of CPU.

Asynchronicity seems to be a big reason why Node.js is so popular for server-side programming. Most of the code we write, especially in heavy IO applications like websites, depends on external resources. This could be anything from a remote database POST API call. As soon as you ask for any of these resources, your code is waiting around for process completion with nothing to do. With asynchronous programming, you allow your code to handle other tasks while waiting for these other resources to respond.

In this session, we are going to talk about asynchronous programming in Python. Its benefits and multiple ways to implement it.

The Beauty of Zarr

2023-04-19 · PyConDE & PyData Berlin 2023

talk

by Sanket Verma

GitHub HTML

In this talk, I’d be talking about Zarr, an open-source data format for storing chunked, compressed N-dimensional arrays. This talk presents a systematic approach to understanding and implementing Zarr by showing how it works, the need for using it, and a hands-on session at the end. Zarr is based on an open technical specification, making implementations across several languages possible. I’d mainly talk about Zarr’s Python implementation and show how it beautifully interoperates with the existing libraries in the PyData stack.

The Battle of Giants: Causality vs NLP => From Theory to Practice

2023-04-19 · PyConDE & PyData Berlin 2023

talk

by Aleksander Molak

Microsoft NLP

With an average of 3.2 new papers published on Arxiv every day in 2022, causal inference has exploded in popularity, attracting large amount of talent and interest from top researchers and institutions including industry giants like Amazon or Microsoft. Text data, with its high complexity, posits an exciting challenge for causal inference community. In the workshop, we'll review the latest advances in the field of Causal NLP and implement a causal Transformer model to demonstrate how to translate these developments into a practical solution that can bring real business value. All in Python!

Behind the Scenes of tox: The Journey of Rewriting a Python Tool with more than 10 Million Monthly Downloads

2023-04-19 · PyConDE & PyData Berlin 2023

talk

by Jürgen Gmach

tox is a widely-used tool for automating testing in Python. In this talk, we will go behind the scenes of the creation of tox 4, the latest version of the tool. We will discuss the motivations for the rewrite, the challenges and lessons learned during the development process. We will have a look at the new features and improvements introduced in tox 4. But most importantly, you will get to know the maintainers.

You've got trust issues, we've got solutions: Differential Privacy

2023-04-19 · PyConDE & PyData Berlin 2023

talk

by Vikram Waradpande , Sarthika Dhawan

Big Data

As we are in an era of big data where large groups of information are assimilated and analyzed, for insights into human behavior, data privacy has become a hot topic. Since there is a lot of private information which once leaked can be misused, all data cannot be released for research. This talk aims to discuss Differential Privacy, a cutting-edge technique of cybersecurity that claims to preserve an individual’s privacy, how it is employed to minimize the risks with private data, its applications in various domains, and how Python eases the task of employing it in our models with PyDP.

How to connect your application to the world (and avoid sleepless nights)

2023-04-19 · PyConDE & PyData Berlin 2023

talk

by Luis Fernando Alvarez

API

Let’s say you are the ruler of a remote island. For it to succeed and thrive you can’t expect it to be isolated from the world. You need to establish trade routes, offer your products to other islands, and import items from them. Doing this will certainly make your economy grow! We’re not going to talk about land masses or commerce, however, you should think of your application as an island that needs to connect to other applications to succeed. Unfortunately, the sea is treacherous and is not always very consistent, similar to the networks you use to connect your application to the world.

We will explore some techniques and libraries in the Python ecosystem used to make your life easier while dealing with external services. From asynchronicity, caching, testing, and building abstractions on top of the APIs you consume, you will definitely learn some strategies to build your connected application gracefully, and avoid those pesky 2 AM errors that keep you awake.

Streamlit meets WebAssembly - stlite

2023-04-19 · PyConDE & PyData Berlin 2023

talk

by Yuichiro Tachibana

Streamlit, a pure-Python data app framework, has been ported to Wasm as "stlite". See its power and convenience with many live examples and explore its internals from a technical perspective. You will learn to quickly create interactive in-browser apps using only Python.

Modern typed python: dive into a mature ecosystem from web dev to machine learning

2023-04-19 · PyConDE & PyData Berlin 2023

talk

by samsja

AI/ML Pydantic

Typing is at the center of „modern Python“, and tools (mypy, beartype) and libraries (FastAPI, SQLModel, Pydantic, DocArray) based on it are slowly eating the Python world.

This talks explores the benefits of Python type hints, and shows how they are infiltrating the next big domain: Machine Learning

Prompt Engineering 101: Beginner intro to LangChain, the shovel of our ChatGPT gold rush."

2023-04-19 · PyConDE & PyData Berlin 2023

talk

by Lev Konstantinovskiy

AI/ML LLM

A modern AI start-up is a front-end developer plus a prompt engineer" is a popular joke on Twitter. This talk is about LangChain, a Python open-source tool for prompt engineering. You can use it with completely open-source language models or ChatGPT. I will show you how to create a prompt and get an answer from LLM. As an example application, I will show a demo of an intelligent agent using web search and generating Python code to answer questions about this conference.

What are you yield from?

2023-04-19 · PyConDE & PyData Berlin 2023

talk

by Maxim Danilov

Many developers avoid using generators. For example, many well-known python libraries use lists instead of generators. The generators themselves are slower than normal list loops, but their use in code greatly increases the speed of the application. Let’s discover why.

Most of you don't need Spark. Large-scale data management on a budget with Python

2023-04-19 · PyConDE & PyData Berlin 2023

talk

by Guillem Borrell

Data Management Rust Spark Data Streaming

The Python data ecosystem has matured during the last decade and there are less and less reasons to rely only large batch process executed in a Spark cluster, but with every large ecosystem, putting together the key pieces of technology takes some effort. There are now better storage technologies, streaming execution engines, query planners, and low level compute libraries. And modern hardware is way more powerful than what you'd probably expect. In this workshop we will explore some global-warming-reducing techniques to build more efficient data transformation pipelines in Python, and a little bit of Rust.

Teaching Neural Networks a Sense of Geometry

2023-04-19 · PyConDE & PyData Berlin 2023

talk

by Jens Agerberg

AI/ML PyTorch Scikit-learn

By taking neural networks back to the school bench and teaching them some elements of geometry and topology we can build algorithms that can reason about the shape of data. Surprisingly these methods can be useful not only for computer vision – to model input data such as images or point clouds through global, robust properties – but in a wide range of applications, such as evaluating and improving the learning of embeddings, or the distribution of samples originating from generative models. This is the promise of the emerging field of Topological Data Analysis (TDA) which we will introduce and review recent works at its intersection with machine learning. TDA can be seen as being part of the increasingly popular movement of Geometric Deep Learning which encourages us to go beyond seeing data only as vectors in Euclidean spaces and instead consider machine learning algorithms that encode other geometric priors. In the past couple of years TDA has started to take a step out of the academic bubble, to a large extent thanks to powerful Python libraries written as extensions to scikit-learn or PyTorch.

Accelerating Python Code

2023-04-19 · PyConDE & PyData Berlin 2023

talk

by Jens Nie

Cyber Security

Python is a beautiful language for fast prototyping and and sketching ideas quickly. People often struggle to get their code into production though for various reasons. Besides of all security and safety concerns that usually are not addressed from the very beginning when playing around with an algorithmic idea, performance concerns are quite frequently a reason for not taking the Python code to the next level.

We will look at the "missing performance" worries using a simple numerical problem and how to speed the corresponding Python code up to top notch performance.

Building Hexagonal Python Services

2023-04-19 · PyConDE & PyData Berlin 2023

talk

by Shahriyar Rzayev

The importance of enterprise architecture patterns is all well-known and applicable to varied types of tasks. Thinking about the architecture from the beginning of the journey is crucial to have a maintainable, therefore testable, and flexible code base. In We are going to explore the Ports and Adapters(Hexagonal) pattern by showing a simple web app using Repository, Unit of Work, and Services(Use Cases) patterns tied together with Dependency Injection. All those patterns are quite famous in other languages but they are relatively new for the Python ecosystem, which is a crucial missing part. As a web framework, we are going to use FastAPI which can be replaced with any framework in a matter of time because of the abstractions we have added.

Monorepos with Python

2023-04-19 · PyConDE & PyData Berlin 2023

talk

by AbdealiLoKo

CI/CD

Working with python is fun. Managing python packaging, linters, tests, CI, etc. is not as fun.

Every maintainer needs to worry about consistent styling, quality, speed of tests, etc as the project grows.

Monorepos have been successful in other communities - how does it work in Python ?

The Spark of Big Data: An Introduction to Apache Spark

2023-04-19 · PyConDE & PyData Berlin 2023

talk

by Pasha Finkelshteyn (Bellsoft)

API Big Data PySpark Spark SQL

Get ready to level up your big data processing skills! Join us for an introductory talk on Apache Spark, the distributed computing system used by tech giants like Netflix and Amazon. We'll cover PySpark DataFrames and how to use them. Whether you're a Python developer new to big data or looking to explore new technologies, this talk is for you. You'll gain foundational knowledge about Apache Spark and its capabilities, and learn how to leverage DataFrames and SQL APIs to efficiently process large amounts of data. Don't miss out on this opportunity to up your big data game!

Neo4j graph databases for climate policy

2023-04-18 · PyConDE & PyData Berlin 2023

talk

by Marcus Tedesco

Neo4j

In this talk we walkthrough our experience using Neo4j and Python to model climate policy as a graph database. We discuss how we did it, some of the challenges we faced, and what we learnt along the way!

Page 1 of 3

1 2 3