talk-data.com talk-data.com

Topic

AWS Glue

etl data_catalog aws

1

tagged

Activity Trend

10 peak/qtr
2020-Q1 2026-Q1

Activities

Showing filtered results

Filtering by: PyData Amsterdam 2025 ×

Operationalizing ML isn’t just about models — it’s about moving and engineering data. At Hopsworks, we built a composable AI pipeline builder (Brewer) based on two principles: Tasks and Data Sources. This lets users define workflows that automatically analyse, clean, create and update feature groups, without glue code or brittle scheduling logic.

In this talk, we’ll show how Brewer drives the automation of feature engineering, enabling reproducible, declarative pipelines that respond to changes in upstream data. We’ll explore how this fits into broader ML workflows, from ingestion to feature materialization, and how it integrates with warehouses, streams, and file-based systems.