Topic

TensorFlow

machine_learning deep_learning neural_networks

Activities

2

tagged

Activity Trend

10 peak/qtr

2020-Q1 2026-Q2

Top Events

O'Reilly AI & ML Books 9 O'Reilly Data Engineering Books 8 O'Reilly Data Science Books 6 Data Engineering Podcast 5 Databricks DATA + AI Summit 2023 4 Airflow Summit 2022 3 ADSP: Algorithms + Data Structures = Programs 2 Google Cloud Next '25 2 DataTalks.Club 2 Airflow Summit 2020 1 PyData Berlin 2025 1 Women in AI and Data Science Conference 2025 1

Top Speakers

Formateur expert 6 Tobias Macey 5 Bryce Adelstein Lelbach (NVIDIA) 2 Francois Chollet 2 David Cardozo (Updata) 2 Conor Hoekstra 2 Daniel Imberman 2 Fabio Nelli 2 Dave Abrahams (Adobe) 2 Louis Lacombe 1 ASHISH PATEL 1 Kyle Polich 1

Activities

Showing filtered results

All Video Podcast Book

Filtering by: Daniel Imberman ×

Apache Airflow and Ray: Orchestrating ML at Scale

2021-07-01 · Airflow Summit 2021

session

by Daniel Imberman

AI/ML Airflow Pandas

As the Apache Airflow project grows, we seek both ways to incorporate rising technologies and novel ways to expose them to our users. Ray is one of the fastest-growing distributed computation systems on the market today. In this talk, we will introduce the Ray decorator and Ray backend. These features, built with the help of the Ray maintainers at Anyscale, will allow Data Scientists to natively integrate their distributed pandas, XGBoost, and TensorFlow jobs to their airflow pipelines with a single decorator. By merging the orchestration of Airflow and the distributed computation of Ray, this coordination of technologies opens Airflow users to a whole host of new possibilities when designing their pipelines.

Machine Learning with Apache Airflow

2020-07-01 · Airflow Summit 2020

session

by Daniel Imberman

AI/ML Airflow Cloud Computing Data Science GCP Cloud Functions Cyber Security Spark

This talk discusses how to build an Airflow based data platform that can take advantage of popular ML tools (Jupyter, Tensorflow, Spark) while creating an easy-to-manage/monitor As the field of data science grows in popularity, companies find themselves in need of a single common language that can connect their data science teams and data infrastructure teams. Data scientists want rapid iteration, infrastructure engineers want monitoring and security controls, and product owners want their solutions deployed in time for quarterly reports. This talk will discuss how to build an Airflow based data platform that can take advantage of popular ML tools (Jupyter, Tensorflow, Spark) while creating an easy-to-manage/monitor ecosystem for data infrastructure and support team. In this talk, we will take an idea from a single-machine Jupyter Notebook to a cross-service Spark + Tensorflow pipeline, to a canary tested, production-ready model served on Google Cloud Functions. We will show how Apache Airflow can connect all layers of a data team to deliver rapid results.