Topic

TensorFlow

machine_learning deep_learning neural_networks

Activities

3

tagged

Activity Trend

10 peak/qtr

2020-Q1 2026-Q1

Top Events

O'Reilly AI & ML Books 9 O'Reilly Data Engineering Books 8 O'Reilly Data Science Books 6 Data Engineering Podcast 5 Databricks DATA + AI Summit 2023 4 Airflow Summit 2022 3 ADSP: Algorithms + Data Structures = Programs 2 Google Cloud Next '25 2 DataTalks.Club 2 Airflow Summit 2020 1 PyData Berlin 2025 1 Women in AI and Data Science Conference 2025 1

Top Speakers

Formateur expert 6 Tobias Macey 5 Bryce Adelstein Lelbach (NVIDIA) 2 Francois Chollet 2 David Cardozo (Updata) 2 Conor Hoekstra 2 Daniel Imberman 2 Fabio Nelli 2 Dave Abrahams (Adobe) 2 Louis Lacombe 1 ASHISH PATEL 1 Kyle Polich 1

Activities

Showing filtered results

All Video Podcast Book

Filtering by: Airflow Summit 2022 ×

Automatic Speech Recognition at Scale Using Tensorflow, Kubernetes and Airflow

2022-07-01 · Airflow Summit 2022

session

by Rafael Pierre

Airflow Kubernetes

Automatic Speech Recognition is quite a compute intensive task, which depends on complex Deep Learning models. To do this at scale, we leveraged the power of Tensorflow, Kubernetes and Airflow. In this session, you will learn about our journey to tackle this problem, main challenges, and how Airflow made it possible to create a solution that is powerful, yet simple and flexible.

TFX on Airflow with delegation of processing to third party services

2022-07-01 · Airflow Summit 2022

session

by Israel Herraiz , Paul Balm

AI/ML Airflow Flink Cloud Computing Dataflow GCP Spark

Get your ticket for this workshop Tensorflow Extended (TFX) can run machine learning pipelines on Airflow, but all the steps are run by default in the same workers where the Airflow DAG is running. This can lead to an excessive usage of resources, and breaks the assumption that Airflow is a scheduler; it becomes also the data processing platform. In this session, we will see how to use TFX with third party services, on top of Google Cloud Platform. The data processing steps can be run in Dataflow, Spark, Flink and other runners (parallelizing the processing of data and scaling up to petabytes), and the training steps can be run in Vertex or other external services. After this workshop, you will have learnt how to externalize any TFX heavyweight computing outside Airflow, while maintaining Airflow as the orchestrator for your machine learning pipelines.

Vega: Unifying Machine Learning Workflows at Credit Karma using Apache Airflow

2022-07-01 · Airflow Summit 2022

session

by Nicholas Pataki (Credit Karma) , Raj Katakam (Credit Karma) , Debasish Das (Credit Karma)

AI/ML Airflow API Beam BigQuery Cloud Computing ETL/ELT Python

At Credit Karma, we enable financial progress for more than 100 million of our members by recommending them personalized financial products when they interact with our application. In this talk we are introducing our machine learning platform to build interactive and production model-building workflows to serve relevant financial products to Credit Karma users. Vega, Credit Karma’s Machine Learning Platform, has 3 major components: 1) QueryProcessor for feature and training data generation, backed by Google BigQuery, 2) PipelineProcessor for feature transformations, offline scoring and model-analysis, backed by Apache Beam 3) ModelProcessor for running Tensorflow and Scikit models, backed by Google AI Platform, which provides data scientists the flexibility to explore different kinds of machine learning or deep learning models, ranging from gradient boosted trees to neural network with complex structures Vega exposed a unified Python API for Feature Generation, Modeling ETL, Model Training and Model Analysis. Vega supports writing interactive notebooks and python scripts to run these components in local mode with sampled data and in cloud mode for large scale distributed computing. Vega provides the ability to chain the processors provided by data scientists through Python code to define the entire workflow. Then it automatically generates the execution plan for deploying the workflow on Apache Airflow for running offline model experiments and refreshes. Overall, with the unified python API and automated Airflow DAG generation, Vega has improved the efficiency of ML Engineering. Using Airflow we deploy more than 20K features and 100 models daily