Topic

AWS Glue

etl data_catalog aws

Activities

1

tagged

Activity Trend

10 peak/qtr

2020-Q1 2026-Q2

Top Events

AWS re:Invent 2024 15 O'Reilly Data Engineering Books 6 Data + AI Summit 2025 3 Data Engineering Podcast 2 O'Reilly Data Science Books 2 Databricks DATA + AI Summit 2023 2 Leaders of Analytics 1 Data Expo NL 2025 1 dbt Coalesce 2025 1 PyData Amsterdam 2025 1 Airflow Summit 2023 1 Experiencing Data w/ Brian T. O’Neill (AI & data product management leadership—powered by UX design) 1

Top Speakers

Noritaka Sekiyama (Amazon Web Services (AWS)) 3 Leonardo Gomez (AWS) 2 Tobias Macey 2 Viquar Khan 1 James T. McClave 1 Jason Williams 1 Trâm Ngọc Phạm 1 Gurmeet Saran (Anthropic) 1 Aaron Wishnick 1 Alvaro Videla 1 Luis Campos (AWS) 1 Navneet Srivastava (Amazon Web Services) 1

Activities

Showing filtered results

All Video Podcast Book

Filtering by: PyData Amsterdam 2025 ×

Composable Pipelines for ML: Automating Feature Engineering with Hopsworks’ Brewer

2025-09-26 · PyData Amsterdam 2025

talk

by Javier de la Rúa Martínez (Hopsworks)

AI/ML

Operationalizing ML isn’t just about models — it’s about moving and engineering data. At Hopsworks, we built a composable AI pipeline builder (Brewer) based on two principles: Tasks and Data Sources. This lets users define workflows that automatically analyse, clean, create and update feature groups, without glue code or brittle scheduling logic.

In this talk, we’ll show how Brewer drives the automation of feature engineering, enabling reproducible, declarative pipelines that respond to changes in upstream data. We’ll explore how this fits into broader ML workflows, from ingestion to feature materialization, and how it integrates with warehouses, streams, and file-based systems.