talk-data.com talk-data.com

Filter by Source

Select conferences and events

People (23 results)

See all 23 →
Showing 4 results

Activities & events

Title & Speakers Event

📢 PyData Milano is back with another exciting event, hosted at Amazon Italia! This time, we’re diving into two major topics shaping the future of data science and software development. 📍 Location: Amazon Italia, Viale Monte Grappa 3/5, 20124 Milano 🕕 Time: 18:30

Agenda

18:30 – Doors open 19:00 – Welcome from PyData Milano 19:10 – Talk 1: Developments in the Scikit-Learn Ecosystem – Marie Sacksick & Guillaume Lemaître (Probabl.) 19:50 – Talk 2: AI Coding Agents and How to Code Them – Alex Shershebnev (Zencoder) 20:30 – Networking & Aperitivo 🍹

Talk Details

Developments in the Scikit-Learn Ecosystem: Going Beyond .fit(X, y).predict(X)

Scikit-learn has long been the go-to library for predictive modeling, but data science is much more than just training models. In this talk, we’ll explore new tools that enhance the entire machine learning workflow—from data preparation with skrub, to experiment tracking with skore, to production-ready model deployment with skops. Expect demos, insights, and a fresh perspective on how the scikit-learn ecosystem is evolving. 👩‍💻 Speakers: Marie Sacksick (Product Engineer @ Probabl., WiMLDS Paris) Guillaume Lemaître (Open-Source Engineer @ Probabl.)

AI Coding Agents and How to Code Them

AI coding agents promise a new paradigm in software development, going beyond simple autocomplete to autonomous task execution. In this talk, we’ll break down what makes them different from traditional coding assistants, showcase live coding examples, and explore how they might shape the future of programming. By the end, you’ll be ready to integrate AI agents into your workflow—starting now. 🧑‍💻 Speaker: Alex Shershebnev (Head of ML/DevOps @ Zencoder) Join us for a deep dive into the future of machine learning and AI-powered coding, and don’t miss the chance to network over an aperitivo after the talks! 🔗 Register now and spread the word! 🚀

PyDataMilano #MachineLearning #AI #ScikitLearn #DataScience #AIAgents

Beyond .fit(): Scikit-Learn Innovations & The Rise of AI Coding Agents

In this podcast episode, we talked with Guillaume Lemaître about navigating scikit-learn and imbalanced-learn.

🔗 CONNECT WITH Guillaume Lemaître LinkedIn - https://www.linkedin.com/in/guillaume-lemaitre-b9404939/ Twitter - https://x.com/glemaitre58 Github - https://github.com/glemaitre Website - https://glemaitre.github.io/

🔗 CONNECT WITH DataTalksClub Join the community - https://datatalks-club.slack.com/join/shared_invite/zt-2hu0sjeic-ESN7uHt~aVWc8tD3PefSlA#/shared-invite/email Subscribe to our Google calendar to have all our events in your calendar - https://calendar.google.com/calendar/u/0/r?cid=ZjhxaWRqbnEwamhzY3A4ODA5azFlZ2hzNjBAZ3JvdXAuY2FsZW5kYXIuZ29vZ2xlLmNvbQ Check other upcoming events - https://lu.ma/dtc-events LinkedIn - https://www.linkedin.com/company/datatalks-club/ Twitter - https://twitter.com/DataTalksClub Website - https://datatalks.club/

🔗 CONNECT WITH ALEXEY Twitter - https://twitter.com/Al_Grigor Linkedin - https://www.linkedin.com/in/agrigorev/

🎙 ABOUT THE PODCAST At DataTalksClub, we organize live podcasts that feature a diverse range of guests from the data field. Each podcast is a free-form conversation guided by a prepared set of questions, designed to learn about the guests’ career trajectories, life experiences, and practical advice. These insightful discussions draw on the expertise of data practitioners from various backgrounds.

We stream the podcasts on YouTube, where each session is also recorded and published on our channel, complete with timestamps, a transcript, and important links.

You can access all the podcast episodes here - https://datatalks.club/podcast.html

📚Check our free online courses ML Engineering course - http://mlzoomcamp.com Data Engineering course - https://github.com/DataTalksClub/data-engineering-zoomcamp MLOps course - https://github.com/DataTalksClub/mlops-zoomcamp Analytics in Stock Markets - https://github.com/DataTalksClub/stock-markets-analytics-zoomcamp LLM course - https://github.com/DataTalksClub/llm-zoomcamp Read about all our courses in one place - https://datatalks.club/blog/guide-to-free-online-courses-at-datatalks-club.html

👋🏼 GET IN TOUCH If you want to support our community, use this link - https://github.com/sponsors/alexeygrigorev

If you're a company and want to support us, contact at [email protected]

AI/ML Analytics Data Engineering GitHub HTML LLM MLOps Scikit-learn
DataTalks.Club

Navigating scikit-learn and imbalanced-learn - Guillaume Lemaître

About the event

Outline:

  • Working as a core developer on scikit-learn
  • Working as a core developer on imbalance-learn

About the speaker

Guillaume is an open-source software engineer working at :probably. He is a core maintainer of the scikit-learn and imbalanced-learn libraries. He holds a PhD in medical imaging.

DataTalks.Club is the place to talk about data. Join our slack community!

Working as a Core Developer in the Scikit Learn Universe

Mark your calendar for the next session of the PyData Paris Meetup, on March 21st 2024. This Meetup will be hosted by Scaleway, Europe's empowering cloud provider at the Iliad group office, 16 rue de la ville l'evêque 75008 Paris. The speakers for this session, that will be dedicated to Taipy are Alexandre Sajus and Florian Jacta.

Schedule 7:00pm - 7:15pm: Community announcements & short address by Fred Bardolle, Lead Product Manager AI at Scaleway. 7:15pm - 7:45pm: Get the best from your scikit-learn classifier: trusted probabilities and optimal binary decision, Guillaume Lemaître 7:45pm - 8:30pm: Deploy your Data Project on the Web using only Python, Alexandre Sajus & Florian Jacta 8:30pm - 9:30pm: Buffet

Speakers

Alexandre Sajus is a customer success engineer at Taipy. He graduated with a Master's of Engineering from Centrale Paris. Florian Jacta is a data scientist and community manager at Taipy.

Guillaume Lemaitre is an open-source scientific software developer at :probabl. and a core developer of the scikit-learn project.

Abstracts

Deploy your Data Project on the Web using only Python, Alexandre Sajus & Florian Jacta In the Python ecosystem, many packages are available for running algorithms, training models, and visualizing data. Despite this, over 85% of data science projects stay at the proof-of-concept stage and never reach the production stage. With Taipy, Python developers can build great pilots as well as stunning production-ready web applications designed for end-users.

Get the best from your scikit-learn classifier: trusted probabilties and optimal binary decision, Guillaume Lemaitre When operating a classifier in a production setting (i.e. predictive phase), practitioners are interested in potentially two different outputs: a "hard" decision used to leverage a business decision or/and a "soft" decision to get a confidence score linked to each potential decision (e.g. usually related to class probabilities). Scikit-learn does not provide any flexibility to go from "soft" to "hard" predictions: it uses a cut-off point at a confidence score of 0.5 (or 0 when using decision_function) to get class labels. However, optimizing a classifier to get a confidence score close to the true probabilities (i.e. a calibrated classifier) does not guarantee to obtain accurate "hard" predictions using this heuristic. Reversely, training a classifier for an optimum "hard" prediction accuracy (with the cut-off constraint at 0.5) does not guarantee obtaining a calibrated classifier. In this talk, we will present a new scikit-learn meta-estimator allowing us to get the best of the two worlds: a calibrated classifier providing optimum "hard" predictions. This meta-estimator will land in a future version of scikit-learn: https://github.com/scikit-learn/scikit-learn/pull/26120. We will provide some insights regarding the way to obtain accurate probabilities and predictions and also illustrate how to use in practice this model on different use cases: cost-sensitive problems and imbalanced classification problems.

PyData Paris - March 2024 Meetup
Showing 4 results