talk-data.com
People (23 results)
See all 23 →Activities & events
| Title & Speakers | Event |
|---|---|
|
Beyond .fit(): Scikit-Learn Innovations & The Rise of AI Coding Agents
2025-03-19 · 17:30
📢 PyData Milano is back with another exciting event, hosted at Amazon Italia! This time, we’re diving into two major topics shaping the future of data science and software development. 📍 Location: Amazon Italia, Viale Monte Grappa 3/5, 20124 Milano 🕕 Time: 18:30 Agenda18:30 – Doors open 19:00 – Welcome from PyData Milano 19:10 – Talk 1: Developments in the Scikit-Learn Ecosystem – Marie Sacksick & Guillaume Lemaître (Probabl.) 19:50 – Talk 2: AI Coding Agents and How to Code Them – Alex Shershebnev (Zencoder) 20:30 – Networking & Aperitivo 🍹 Talk DetailsDevelopments in the Scikit-Learn Ecosystem: Going Beyond
|
Beyond .fit(): Scikit-Learn Innovations & The Rise of AI Coding Agents
|
|
Guillaume Lemaître
– guest
,
Alexey Grigorev
– guest
In this podcast episode, we talked with Guillaume Lemaître about navigating scikit-learn and imbalanced-learn. 🔗 CONNECT WITH Guillaume Lemaître LinkedIn - https://www.linkedin.com/in/guillaume-lemaitre-b9404939/ Twitter - https://x.com/glemaitre58 Github - https://github.com/glemaitre Website - https://glemaitre.github.io/ 🔗 CONNECT WITH DataTalksClub Join the community - https://datatalks-club.slack.com/join/shared_invite/zt-2hu0sjeic-ESN7uHt~aVWc8tD3PefSlA#/shared-invite/email Subscribe to our Google calendar to have all our events in your calendar - https://calendar.google.com/calendar/u/0/r?cid=ZjhxaWRqbnEwamhzY3A4ODA5azFlZ2hzNjBAZ3JvdXAuY2FsZW5kYXIuZ29vZ2xlLmNvbQ Check other upcoming events - https://lu.ma/dtc-events LinkedIn - https://www.linkedin.com/company/datatalks-club/ Twitter - https://twitter.com/DataTalksClub Website - https://datatalks.club/ 🔗 CONNECT WITH ALEXEY Twitter - https://twitter.com/Al_Grigor Linkedin - https://www.linkedin.com/in/agrigorev/ 🎙 ABOUT THE PODCAST At DataTalksClub, we organize live podcasts that feature a diverse range of guests from the data field. Each podcast is a free-form conversation guided by a prepared set of questions, designed to learn about the guests’ career trajectories, life experiences, and practical advice. These insightful discussions draw on the expertise of data practitioners from various backgrounds. We stream the podcasts on YouTube, where each session is also recorded and published on our channel, complete with timestamps, a transcript, and important links. You can access all the podcast episodes here - https://datatalks.club/podcast.html 📚Check our free online courses ML Engineering course - http://mlzoomcamp.com Data Engineering course - https://github.com/DataTalksClub/data-engineering-zoomcamp MLOps course - https://github.com/DataTalksClub/mlops-zoomcamp Analytics in Stock Markets - https://github.com/DataTalksClub/stock-markets-analytics-zoomcamp LLM course - https://github.com/DataTalksClub/llm-zoomcamp Read about all our courses in one place - https://datatalks.club/blog/guide-to-free-online-courses-at-datatalks-club.html 👋🏼 GET IN TOUCH If you want to support our community, use this link - https://github.com/sponsors/alexeygrigorev If you're a company and want to support us, contact at [email protected] |
DataTalks.Club |
|
Working as a Core Developer in the Scikit Learn Universe
2024-06-27 · 10:30
Navigating scikit-learn and imbalanced-learn - Guillaume Lemaître About the event Outline:
About the speaker Guillaume is an open-source software engineer working at :probably. He is a core maintainer of the scikit-learn and imbalanced-learn libraries. He holds a PhD in medical imaging. DataTalks.Club is the place to talk about data. Join our slack community! |
Working as a Core Developer in the Scikit Learn Universe
|
|
PyData Paris - March 2024 Meetup
2024-03-21 · 18:00
Mark your calendar for the next session of the PyData Paris Meetup, on March 21st 2024. This Meetup will be hosted by Scaleway, Europe's empowering cloud provider at the Iliad group office, 16 rue de la ville l'evêque 75008 Paris. The speakers for this session, that will be dedicated to Taipy are Alexandre Sajus and Florian Jacta. Schedule 7:00pm - 7:15pm: Community announcements & short address by Fred Bardolle, Lead Product Manager AI at Scaleway. 7:15pm - 7:45pm: Get the best from your scikit-learn classifier: trusted probabilities and optimal binary decision, Guillaume Lemaître 7:45pm - 8:30pm: Deploy your Data Project on the Web using only Python, Alexandre Sajus & Florian Jacta 8:30pm - 9:30pm: Buffet Speakers Alexandre Sajus is a customer success engineer at Taipy. He graduated with a Master's of Engineering from Centrale Paris. Florian Jacta is a data scientist and community manager at Taipy. Guillaume Lemaitre is an open-source scientific software developer at :probabl. and a core developer of the scikit-learn project. Abstracts Deploy your Data Project on the Web using only Python, Alexandre Sajus & Florian Jacta In the Python ecosystem, many packages are available for running algorithms, training models, and visualizing data. Despite this, over 85% of data science projects stay at the proof-of-concept stage and never reach the production stage. With Taipy, Python developers can build great pilots as well as stunning production-ready web applications designed for end-users. Get the best from your scikit-learn classifier: trusted probabilties and optimal binary decision, Guillaume Lemaitre When operating a classifier in a production setting (i.e. predictive phase), practitioners are interested in potentially two different outputs: a "hard" decision used to leverage a business decision or/and a "soft" decision to get a confidence score linked to each potential decision (e.g. usually related to class probabilities). Scikit-learn does not provide any flexibility to go from "soft" to "hard" predictions: it uses a cut-off point at a confidence score of 0.5 (or 0 when using decision_function) to get class labels. However, optimizing a classifier to get a confidence score close to the true probabilities (i.e. a calibrated classifier) does not guarantee to obtain accurate "hard" predictions using this heuristic. Reversely, training a classifier for an optimum "hard" prediction accuracy (with the cut-off constraint at 0.5) does not guarantee obtaining a calibrated classifier. In this talk, we will present a new scikit-learn meta-estimator allowing us to get the best of the two worlds: a calibrated classifier providing optimum "hard" predictions. This meta-estimator will land in a future version of scikit-learn: https://github.com/scikit-learn/scikit-learn/pull/26120. We will provide some insights regarding the way to obtain accurate probabilities and predictions and also illustrate how to use in practice this model on different use cases: cost-sensitive problems and imbalanced classification problems. |
PyData Paris - March 2024 Meetup
|