Topic

Scikit-learn

machine_learning data_science data_analysis

Activities

3

tagged

Activity Trend

6 peak/qtr

2020-Q1 2026-Q2

Top Events

O'Reilly Data Science Books 25 O'Reilly AI & ML Books 6 PyData Paris 2025 5 SciPy 2025 3 Data + AI Summit 2025 3 PyData Paris 2024 3 DataTalks.Club 3 Databricks DATA + AI Summit 2023 2 Data Engineering Podcast 2 O'Reilly Data Visualization Books 2 PyConDE & PyData Berlin 2023 2 Big Data & AI Paris 2025 2

Top Speakers

Andrew Worsley 2 Robert Thas John 2 Robert Johansson 2 Thomas Joseph 2 Dr. Samuel Asare 2 Daniel Y. Chen 2 Stephen Klosterman 2 Fabio Nelli 2 Anthony So 2 Jake VanderPlas 2 Kirthi Raman 2 Guillaume Lemaitre (scikit-learn) 2

Activities

Showing filtered results

All Video Podcast Book

Filtering by: Data + AI Summit 2025 ×

Machine Learning Model Deployment

2025-06-10 · Data + AI Summit 2025

talk

AI/ML Data Lakehouse Databricks Delta Python

This course is designed to introduce three primary machine learning deployment strategies and illustrate the implementation of each strategy on Databricks. Following an exploration of the fundamentals of model deployment, the course delves into batch inference, offering hands-on demonstrations and labs for utilizing a model in batch inference scenarios, along with considerations for performance optimization. The second part of the course comprehensively covers pipeline deployment, while the final segment focuses on real-time deployment. Participants will engage in hands-on demonstrations and labs, deploying models with Model Serving and utilizing the serving endpoint for real-time inference. By mastering deployment strategies for a variety of use cases, learners will gain the practical skills needed to move machine learning models from experimentation to production. This course shows you how to operationalize AI solutions efficiently, whether it's automating decisions in real-time or integrating intelligent insights into data pipelines. Pre-requisites: Familiarity with Databricks workspace and notebooks, familiarity with Delta Lake and Lakehouse, intermediate level knowledge of Python (e.g. common Python libraries for DS/ML like Scikit-Learn, awareness of model deployment strategies) Labs: Yes Certification Path: Databricks Certified Machine Learning Associate

Machine Learning Model Development

2025-06-09 · Data + AI Summit 2025

talk

AI/ML Data Lakehouse Databricks Delta Python

In this course, you’ll learn how to develop traditional machine learning models on Databricks. We’ll cover topics like using popular ML libraries, executing common tasks efficiently with AutoML and MLflow, harnessing Databricks' capabilities to track model training, leveraging feature stores for model development, and implementing hyperparameter tuning. Additionally, the course covers AutoML for rapid and low-code model training, ensuring that participants gain practical, real-world skills for streamlined and effective machine learning model development in the Databricks environment. Pre-requisites: Familiarity with Databricks workspace and notebooks, familiarity with Delta Lake and Lakehouse, intermediate level knowledge of Python (e.g. common Python libraries for DS/ML like Scikit-Learn, fundamental ML algorithms like regression and classification, model evaluation with common metrics) Labs: Yes Certification Path: Databricks Certified Machine Learning Associate

Data Preparation for Machine Learning

2025-06-09 · Data + AI Summit 2025

talk

AI/ML DataViz Databricks Matplotlib Pandas PySpark Python

In this course, you’ll learn the fundamentals of preparing data for machine learning using Databricks. We’ll cover topics like exploring, cleaning, and organizing data tailored for traditional machine learning applications. We’ll also cover data visualization, feature engineering, and optimal feature storage strategies. By building a strong foundation in data preparation, this course equips you with the essential skills to create high-quality datasets that can power accurate and reliable machine learning and AI models. Whether you're developing predictive models or enabling downstream AI applications, these capabilities are critical for delivering impactful, data-driven solutions. Pre-requisites: Familiarity with Databricks workspace, notebooks, as well as Unity Catalog. An intermediate level knowledge of Python (scikit-learn, Matplotlib), Pandas, and PySpark. As well as with concepts of exploratory data analysis, feature engineering, standardization, and imputation methods). Labs: Yes Certification Path: Databricks Certified Machine Learning Associate