talk-data.com

Topic

Python

programming_language data_science web_development

Activities

tagged

Activity Trend

185 peak/qtr

2020-Q1 2026-Q1

Top Events

O'Reilly Data Science Books 220 Data Engineering Podcast 183 O'Reilly Data Engineering Books 151 SciPy 2025 67 PyConDE & PyData Berlin 2023 49 Data + AI Summit 2025 30 Databricks DATA + AI Summit 2023 29 O'Reilly AI & ML Books 27 PyData Seattle 2025 23 PyData Paris 2025 20 O'Reilly Data Visualization Books 20 PyData London 2025 20

Top Speakers

Tobias Macey 183 Bryce Adelstein Lelbach (NVIDIA) 18 Conor Hoekstra 17 Harpreet Sahota (Voxel51) 15 Dan Gural (Voxel51) 14 Avery Smith 13 Kyle Polich 8 Al Martin (IBM) 7 Dr. Yasin Ceran (KAIST) 7 Luca Massaron 7 Julie Hoyer 6 Gleb Mezhanskiy (Datafold) 6

Activities

Showing filtered results

All Video Podcast Book

Filtering by: SciPy 2025 ×

Reproducible Machine Learning Workflows for Scientists with pixi

2025-07-07 · SciPy 2025

talk

by John Kirkham , Ruben Arts , Matthew Feickert

AI/ML Linux PyTorch

Scientific researchers need reproducible software environments for complex applications that can run across heterogeneous computing platforms. Modern open source tools, like pixi, provide automatic reproducibility solutions for all dependencies while providing a high level interface well suited for researchers.

This tutorial will provide a practical introduction to using pixi to easily create scientific and AI/ML environments that benefit from hardware acceleration, across multiple machines and platforms. The focus will be on applications using the PyTorch and JAX Python machine learning libraries with CUDA enabled, as well as deploying these environments to production settings in Linux container images.

The-Silmaril: Practice #ontology engineering with Python (and other languages).

2025-07-07 · SciPy 2025

talk

by Shaurya Agarwal

Pandas PySpark SciPy

Ontologies provide a powerful way to structure knowledge, enable reasoning, and support more meaningful queries compared to traditional data models. Recently, interest in ontologies has resurged, driven by advancements in language models, reasoning capabilities, and the growing adoption of platforms like Palantir Foundry.

In this hands-on tutorial, participants will explore ontology development across multiple domains using a variety of Python-based tools such as rdflib, Owlready2, PySpark, Pandas, and SciPy. They will learn how ontologies facilitate semantic reasoning, improve data interoperability, and enhance query capabilities.
Additionally, attendees will build a rudimentary reasoning engine to better understand inference mechanisms.
The tutorial emphasizes practical applications and comparisons with conventional data representations, making it ideal for researchers, data engineers, and developers interested in knowledge representation and reasoning.

All the SQL a Pythonista needs to know: an introduction to SQL and DataFrames with DuckDB

2025-07-07 · SciPy 2025

talk

by Guen Prawiroatmodjo , Jacob Matson (MotherDuck) , Alex Monahan (MotherDuck)

Cloud Computing DuckDB HTML Pandas Polars SQL

Structured Query Language (or SQL for short) is a programming language to manage data in a database system and an essential part of any data engineer’s tool kit. In this tutorial, you will learn how to use SQL to create databases, tables, insert data into them and extract, filter, join data or make calculations using queries. We will use DuckDB, a new open source embedded in-process database system that combines cutting edge database research with dataframe-inspired ease of use. DuckDB is only a pip install away (with zero dependencies), and runs right on your laptop. You will learn how to use DuckDB with your existing Python tools like Pandas, Polars, and Ibis to simplify and speed up your pipelines. Lastly, you will learn how to use SQL to create fast, interactive data visualizations, and how to teach your data how to fly and share it via the Cloud.

Building with LLMs Made Simple

2025-07-07 · SciPy 2025

talk

by Eric Ma

LLM

In this tutorial, you will learn how to integrate Large Language Models (LLMs) directly into Python programs as thoughtfully-designed core components of the program rather than bolt-on additions. This hands-on session teaches design principles and practical techniques for incorporating LLM outputs into program control flow. We will use LlamaBot, an open-source Python interface to LLMs, focusing on local execution with local and efficient models.

The Accelerated Python Developer's Toolbox

2025-07-07 · SciPy 2025

talk

by Dr. Katrina Riehl (NumFOCUS; Snowflake; Georgetown University)

As general purpose GPU programming has risen in popularity, many Python programmers have expressed a need to use this technology in their libraries and applications. They soon realize that the GPU landscape is vast and sometimes difficult to traverse for Python users.

In this talk, I will demystify the CUDA-enabled Accelerated Python landscape, focusing on the advantages and disadvantages of popular libraries, the common performance issues encountered, and the best practices to getting the most out of your GPU. Topics include CuPy, numba, nvmath-python, cuDF, and cuML.

This talk is beginner-friendly, but even the most seasoned programmer will gain insight into the Python GPU computing landscape.

Thinking in arrays

2025-07-07 · SciPy 2025

talk

by Peter Fackeldey , Jim Pivarski

NumPy

Despite its reputation for being slow, Python is the leading language of scientific computing, which generally needs large-scale (fast) computations. This is because most scientific problems can be split into "metadata bookkeeping" and "number crunching," where the latter is performed by array-oriented (vectorized) calls into precompiled routines.

This tutorial is an introduction to array-oriented programming. We'll focus on techniques that are equally useful in any array library, with a particular focus on NumPy and JAX. You'll work in groups on four class projects: Conway's Game of Life using arrays, iterative computations on arrays, just-in-time (JIT) compilation for the Mandelbrot set, and exploring data in ragged arrays.

Vega-Altair: A Structured Way to Build Interactive Charts

2025-07-07 · SciPy 2025

talk

by Jon Mease , Dylan Wootton

API DataViz

This tutorial is an introduction to data visualization using the popular Vega-Altair Python library. Vega-Altair provides a simple and expressive API, enabling authors to rapidly create a wide range of interactive charts.

Participants will explore the fundamentals of effective chart design and gain hands-on experience building a variety of visualizations using Vega-Altair's declarative API. Furthermore, this tutorial will introduce users to advanced topics such as data transformations and interaction design. We will finish off by covering practical workflows such as integrating Vega-Altair into dashboarding systems, publishing visualizations, and creating reusable, themed charting libraries. By the end of the session, attendees will have the skills to leverage Vega-Altair for both rapid prototyping and production-ready visualizations in diverse environments

Page 4 of 4

← Previous

1 2 3 4