talk-data.com talk-data.com

Filter by Source

Select conferences and events

Activities & events

Title & Speakers Event

Ciao Pythonistas! 👋 This month, we are doing something special. PyData and PyMI Milano are joining forces for the ultimate end-of-year celebration! 🎉 Join us for "Pynettone & Lightning Talks"—a night dedicated to community stories, quick insights, and, of course, a massive slice of traditional Panettone! 🇮🇹 ⚡ What is a Lightning Talk? Never given a talk before? This is the perfect format for you!

  • Duration: strictly 5 minutes. ⏱️
  • Format: Anything goes! Slides, live demos, just standing up and telling a story, or sharing a cool project you're working on.
  • Topic: If it’s interesting to you, it’s interesting to us. (Python, Data, ML, or just a great tech story).

We want YOU on stage! Unleash your creativity and share what you love. 👉 For attending click here! 👉 Submit your Lightning Talk topic here: One-click-away (You can also sign up at the event, but slots fill up fast!) Come for the code, stay for the Pynettone! See you there. 🎅🐍

IMPORTANT: to simplify thinks please subscribe at the PyMI version of this announcement. Eventual wait list of this event will not be considered.

🐍🎄 Pynettone & Lightning Talks: A PyMI x PyData Joint Special! 🎄⚡

PyDataMCR Code Night - November

Have a data project or some coursework? Want to get to talk it through with your data peers?

Come to our Code Night at Social Refuge! A night to work on your own project, with supportive peers available for advice. Ask for tips from others more experienced than yourself, or teach somebody else something great!

WHAT YOU WILL NEED - A laptop - A project (optional!\, feel free to join us for a chat or to provide others with advice).

HOW IT WILL WORK 1830-2030 Come down and join us to Social Refuge to work on some code together. You'll have an opportunity to share any problems you're working on with other attendees. After the event we'll head around the corner to Seven Brothers for some post-event socialising.

Location We'll be at Social Refuge: Ancoats, join us downstairs! The capacity is limited to 40 people, so sign up early for a spot!

EVENT GUIDELINES PyDataMCR is a strictly professional event, as such professional behaviour is expected. PyDataMCR is a chapter of PyData, an educational program of NumFOCUS and thus abides by the NumFOCUS Code of Conduct https://pydata.org/code-of-conduct.html Please take a moment to familiarise yourself with its contents.

ACCESSIBILITY Under 16s welcome with a responsible guardian. There is a quiet room available if needed. Toilets and venue are accessible.

SPONSORS Thank you to NUMFocus for sponsoring Meetup and further support. Thank you to Autotrader, Krakenflex and Horsefly Analytics for their ongoing support and sponsorship of PyDataMCR.

PyDataMCR Code Night - November

PyDataMCR Code Night - October

Have a data project or some coursework? Want to get to talk it through with your data peers?

Come to our Code Night at Social Refuge! A night to work on your own project, with supportive peers available for advice. Ask for tips from others more experienced than yourself, or teach somebody else something great!

WHAT YOU WILL NEED - A laptop - A project (optional!\, feel free to join us for a chat or to provide others with advice).

HOW IT WILL WORK 1830-2030 Come down and join us to Social Refuge to work on some code together. You'll have an opportunity to share any problems you're working on with other attendees. After the event we'll head around the corner to Seven Brothers for some post-event socialising.

Location We'll be at Social Refuge: Ancoats, join us downstairs! The capacity is limited to 40 people, so sign up early for a spot!

EVENT GUIDELINES PyDataMCR is a strictly professional event, as such professional behaviour is expected. PyDataMCR is a chapter of PyData, an educational program of NumFOCUS and thus abides by the NumFOCUS Code of Conduct https://pydata.org/code-of-conduct.html Please take a moment to familiarise yourself with its contents.

ACCESSIBILITY Under 16s welcome with a responsible guardian. There is a quiet room available if needed. Toilets and venue are accessible.

SPONSORS Thank you to NUMFocus for sponsoring Meetup and further support. Thank you to Autotrader, Krakenflex and Horsefly Analytics for their ongoing support and sponsorship of PyDataMCR.

PyDataMCR Code Night - October
Techie vs Comic: The sequel 2025-09-26 · 15:20

A data scientist by day and a standup comedian by night. This was how Arda described himself prior to his critically acclaimed performance about his two identities during PyData 2024, where they merged.

Now he doesn't even know.

After another year of stage performances, awkward LinkedIn interactions and mysterious cloud errors, Arda is back for another tale of absurdity. In this closing talk, he will illustrate the hilarity of his life as a data scientist in the age of LLMs and his non-existent comfort zone, proving good sequels can exist

Cloud Computing LLM
PyData Amsterdam 2025
PyDataMCR Code Night - July 2025-07-30 · 17:30

PyDataMCR Code Night - July

Have a data project or some coursework? Want to get to talk it through with your data peers?

Come to our Code Night at Social Refuge! A night to work on your own project, with supportive peers available for advice. Ask for tips from others more experienced than yourself, or teach somebody else something great!

WHAT YOU WILL NEED

- A laptop - A project (optional!\, feel free to join us for a chat or to provide others with advice).

HOW IT WILL WORK

1830-2030

Come down and join us to Social Refuge to work on some code together. You'll have an opportunity to share any problems you're working on with other attendees.

After the event we'll head around the corner to Seven Brothers for some post-event socialising.

Location We'll be at Social Refuge: Ancoats, join us downstairs! The capacity is limited to 40 people, so sign up early for a spot!

EVENT GUIDELINES

PyDataMCR is a strictly professional event, as such professional behaviour is expected.

PyDataMCR is a chapter of PyData, an educational program of NumFOCUS and thus abides by the NumFOCUS Code of Conduct

https://pydata.org/code-of-conduct.html

Please take a moment to familiarise yourself with its contents.

ACCESSIBILITY

Under 16s welcome with a responsible guardian. There is a quiet room available if needed. Toilets and venue are accessible.

SPONSORS

Thank you to NUMFocus for sponsoring Meetup and further support.

Thank you to Autotrader, Krakenflex and Horsefly Analytics for their ongoing support and sponsorship of PyDataMCR.

PyDataMCR Code Night - July

PyData Helsinki is Back! 🚀🐍

After a quiet period following five awesome online events in 2020–2021, we're rebooting PyData Helsinki!

With the recent sold-out, standing-room-only Helsinki Python events proving there's plenty of enthusiasm in our community, now is a great opportunity to meet and reconnect in person.

First Meetup:

  • 🗓 Wednesday, June 25th
  • 17:00
  • 📍 Kaisla (not Kaivopuisto)

Come along and let's enjoy a relaxed evening of networking and discussions. Meet old friends, make new connections, and let's talk about what we want PyData Helsinki to become!

Please register for the event so we know how many are coming.

We were planning a picnic but the weather forecast doesn't look promising, so let's go to a pub instead. Join the Helsinki Python Discord.

― ― ― ― ― ―

And we've already found a great host for August: Wonna is a hands-on software consulting company that is committed to working with the developer community. We're privileged to be able to have our next meetup there. Save the date:

  • 🗓 Tuesday, August 26th
  • ⏰ 18:00
  • 📍 Wonna, Elimäenkatu 17-19

We need some talks for this event. If you would like to give a talk, please reach out. We welcome talks of all levels from beginner to advanced. Also ⚡️ lightning talks of 5–10 minutes! Suitable subjects include the tools of the data trade, including but not limited to Python, and experiences using them. Think more "this is why Parquet is a great file format" or "I hated Cursor until I implemented these .cursorrules" than "AGI is coming, buy my product or be left behind" or "quantum-proof your digitalisation strategies with AI-driven design sprints".

Meetups always need venues—please ask your employer to host an event!

― ― ― ― ― ―

Looking forward to seeing everyone again, or in many cases for the first time,

The PyData Helsinki Team

PyData Helsinki Pub Night (in lieu of Picnic)

Join PyData Boston for a night of technical workshops and lightning talks sponsored by the Open Data Science Conference (ODSC)!

Sign up to give a lightning talk: https://forms.gle/XSTRmCTL2jPMRpXp7

Lightning talks are 5-10 minute talks on a topic of interest to you! For those that attended ODSC East this month, we'd love to hear what you learned and about the discussions you had.

For those that attended or are attending an upcoming conference, let us know about it!

NOTE TIME CHANGE - Do not arrive before 6:30! RSVP is REQUIRED to attend 7:00-7:15 - Networking 7:15-7:45 - Build your own Git Workshop 7:45-8:15 - Break + Networking 8:15-9:00 - Lightning talks + Networking 9:00-9:30 - Wrap up

Please fill out the registration form, this will make signing in to the Moderna office much faster. Everyone will get a card with their name + company on it! 🏠 Venue provided by Moderna 🍕 Pizza provided by ODSC This, and all NumFOCUS-affiliated events and spaces, both in-person and online are governed by a Code of Conduct. More at https://pydata.org/code-of-conduct/ This event will not be recorded or streamed. ⚡⚡Speak at PyData! ⚡⚡ We are always looking for speakers! Sign up here and we'll be in touch: https://forms.gle/kfFZ5hiqA9W57Ewg7 ⚡⚡Sponsor an event! ⚡⚡ PyData events are totally free and open to all! We have a broad reach to tech professionals of all kinds. We're always looking for sponsors and hosts for our events. Please get in touch if you're interested in supporting the community: [email protected]

May Meetup: Build your own Git + Lightning Talks sponsored by ODSC
PyData Boston March Meetup 2025-03-26 · 22:30

Join PyData Boston for a night of in-person networking and expert speakers! NOTE: RSVP is REQUIRED to attend 6:30-7:00 - Networking 7:00-7:45 - Isaac Slavitt (DrivenData) - Best practices for hiring data scientists Or: Data science hiring is broken—how can we fix it? 7:45-8:15 - Break + Networking 8:15-9:00 - Discussion Groups - Python Work Knowledge, DS/ML/GenAI Work Knowledge, Networking 9:00-9:30 - Wrap up

Speakers: Isaac Slavitt (Co-Founder, DrivenData) In this talk, I will share insights from interviews with 20 data science hiring managers at top organizations (e.g., FAANG, finance, startups) on the evolving challenges of hiring in an AI-augmented world. We’ll explore emerging best practices, discuss how to extract real signal from interviews, and offer some tactical strategies to improve data science hiring in 2025 and beyond.

Discussion Groups - Python Work Knowledge, DS/ML/GenAI Work Knowledge, Networking

RSVPs are required to attend! Please fill out the registration form, this will make signing in to the Moderna office much faster. Everyone will get a card with their name + company on it!

🏠 Venue provided by Moderna 🍕 Pizza provided by DrivenData This, and all NumFOCUS-affiliated events and spaces, both in-person and online are governed by a Code of Conduct. More at https://pydata.org/code-of-conduct/ This event will not be recorded or streamed. ⚡⚡Speak at PyData! ⚡⚡ We are always looking for speakers! Sign up here and we'll be in touch: https://forms.gle/kfFZ5hiqA9W57Ewg7 ⚡⚡Sponsor an event! ⚡⚡ PyData events are totally free and open to all! We have a broad reach to tech professionals of all kinds. We're always looking for sponsors and hosts for our events. Please get in touch if you're interested in supporting the community: [email protected]

PyData Boston March Meetup

PyData Pittsburgh is excited to host our first event of 2025: Machine Learning in Astronomy. Join us on Tuesday, February 25, as Ashod Khederlarian, a 4th-year Ph.D. student at the University of Pittsburgh, shares state-of-the-art Machine Learning techniques being used to analyze vast astronomical datasets.

We have an exciting venue for this event—the Allegheny Observatory has graciously agreed to not only host the talk but also offer a free private tour exclusively for the PyData Pittsburgh group after the presentation! Don’t miss this opportunity to learn about cutting-edge AI applications in astronomy while exploring one of Pittsburgh’s most fascinating scientific landmarks.

Note: Attendance for this event is limited. Please RSVP only if you are committed to attending. Thank you.

About the talk:

Astronomy is an observational science. To understand the history and evolution of our universe and everything in it, our only option is to observe the night sky and test our theories against the observations. Current and next-generation observatories, such as the Dark Energy Spectroscopic Instrument, the Rubin Observatory, the Roman Space Telescope, and the Euclid Space Telescope will collect light coming from billions of galaxies and stars, resulting in 10s of terabytes of data per night. Most of this complex, high-dimensional data will not be seen by the naked eye, making data science and Machine Learning (ML) tools essential for analyzing them.

In this talk, Ashod will highlight how state-of-the-art ML techniques are being used in Astronomy. Particularly, he will focus on his work at the University of Pittsburgh on using simple neural networks to add realistic properties to galaxy simulations, using deep convolutional neural networks to make 3D maps of the universe, and using dimensionality reduction techniques to visualize high-dimensional datasets.

About the observatory:

The Allegheny Observatory is one of the major historic astronomical research institutions of the world. A short presentation about the institution will be shown followed by a walking tour of the building finally ending up at the 13" Fitz-Clark refractor.

Times: 7pm, Doors Open 7:30pm, Machine Learning in Astronomy Talk 8:30pm, Observatory Tour

Getting to the observatory:

Address: 159 Riverview Ave, Pittsburgh, PA 15214

If you are coming up 279 from Pittsburgh, take exit 3, Hazlett St. Turn left on East street. Continue north on East St. DO NOT turn left on Milroy. Your mapping program will reroute you: Continue on and bear left to stay on East street at the 4th light. Make a sharp left turn onto Perrysville Ave. Continue on to make a right turn at Riverview Ave.

You can park on the righthand side of the one way road that loops around the observatory, or in the parking lot for the nearby dog park. Enter through the main doors and proceed to the event room.

If you arrive at the front door and it is closed, please knock or buzz the bell. Thanks!

To use a handicapped-accessible ramp, park in the back of the observatory, use the ramp to the back door and ring the doorbell to the left of the door.

Machine Learning in Astronomy

Join PyData NYC at 11 Times Square (Microsoft) on Feb 12th at 6:30 pm for a talk night with Tamer Abuelsaad (Emergence AI).

🍕 Pizza, drinks & venue sponsored by Emergence AI- thank you!

Agenda:

Navigating the Web: Lessons from Building Real-World Agents Abstract: How do you create agents that reliably move through websites and perform complex tasks? In this talk, we’ll share Emergence AI’s journey of building and refining web navigation agents using different techniques. This talk will cover what worked well, what didn’t, and the practical lessons learned along the way.

Networking Connect with fellow data enthusiasts, professionals, and community leaders. Build meaningful connections and forge collaborations. ---------------------------------------------------------------- Doors open @ 6 pm Doors close @ 7 pm Event @ 6:30 - 8:30 pm Venue provided by MSFT: 11 Times Square ---------------------------------------------------------------- The building requires a government-issued photo ID for entrance. This, and all PyData NYC events, is an all-level event. Newcomers and beginners are welcome. This and all NumFOCUS-affiliated events and spaces, both in-person and online, are governed by a Code of Conduct. ---------------------------------------------------------------- This event may be recorded.nd

Navigating the Web: Lessons from Building Real-World Agents

Join PyData Boston for a night of in-person networking and tech talks by the community and a featured speaker!

If you'd like to sign up to give a lightning talk (\~10 minutes), use this form: https://forms.gle/KJfDN2iiUa4x5GVn9

NOTE: We have limited space, so ONLY RSVP if you are fairly certain you will come. We do keep track of who attends!

6-6:45 - Networking 6:45-7:30 - Featured speaker - Isaac Godfried - Multimodal Deep Learning 7:30-8:00 - Break + Networking 8:00-8:30 - Lightning talks (sign up here: https://forms.gle/KJfDN2iiUa4x5GVn9) 8:30-8:45 - Wrap up

🏠 Venue provided by Microsoft 🍕 Pizza provided by Mattermost - thank you!

This, and all NumFOCUS-affiliated events and spaces, both in-person and online are governed by a Code of Conduct. More at https://pydata.org/code-of-conduct/ This event will not be recorded or streamed.

⚡⚡Speak at PyData! ⚡⚡ We are always looking for speakers! Sign up here and we'll be in touch: https://forms.gle/kfFZ5hiqA9W57Ewg7

⚡⚡Sponsor an event! ⚡⚡ PyData events are totally free and open to all! We have a broad reach to tech professionals of all kinds. We're always looking for sponsors and hosts for our events.

Please get in touch if you're interested in supporting the community: [email protected]

PyData Boston December Meetup

PyData Roma - 7th Meetup! 🎉 New location unlocked! Get ready for another great night about Python, data, and science!

Let's pick up where we left off and continue our mission to make Rome a fantastic place for software engineering and data science.

⚠️ Remember to RSVP using your full name for security reasons and bring a valid ID to show at the entrance. Otherwise, you will not be allowed to enter the premises! ⚠️

The presentations for this event will be announced soon. You could be the next speaker at this or a future event. If you have a presentation, some interesting code, or an open problem you'd like to discuss with the community, compile the form and let us know! (Proposals can be in English or Italian, whatever makes you comfortable.)

Location: Via di Vigna Murata 605 Date: November 22nd 2024

Schedule:

  • 18:00 🚪 Door Opening
  • 18:45 🎤 Talk 1 - "Conformal Prediction: quantificazione dell'incertezza per umanizzare i modelli" by Vincenzo Ventriglia (ML Engineer @ Istituto Nazionale di Geofisica e Vulcanologia)
  • 19.15 🎤 Talk 2 - "Advanced topics on RAG" by Federico Ricciuti (Data Scientist)
  • 19.45 🤝 Socializing

Here is a short description of the two presentations:

1. Conformal Prediction: quantificazione dell'incertezza per umanizzare i modelli L'identificazione delle incertezze nel Machine Learning è fondamentale per prendere decisioni solide, migliorare l'affidabilità dei modelli e valutarne i rischi. Quantificando e comprendendo l'incertezza, si possono costruire sistemi di AI più affidabili e degni di fiducia. Immaginiamo di avere un modello che predice se una TAC contiene o meno un tumore. Gli approcci tradizionali tendono a fornire previsioni binarie, non fornendo informazioni sul livello di fiducia del modello per ciascuna previsione. La Conformal Prediction (CP) è un framework per la quantificazione dell'incertezza che aggiunge una stima della fiducia nelle previsioni del modello: invece di fornire una risposta "puntuale", fornisce una serie di risultati possibili (set di previsioni), unitamente a una misura della fiducia in ciascun risultato. Questi set di previsione sono corredati da garanzia (matematica!) di copertura del risultato vero, assicurando che rileveranno almeno una percentuale pre-fissata di valori veri. CP, inoltre, è un paradigma agnostico rispetto al modello sottostante e non fa ipotesi sulla distribuzione dei dati. CP dunque offre una struttura robusta che consente agli stakeholder di prendere decisioni più informate, soprattutto in quei settori a elevato rischio come la sanità, la finanza e i sistemi autonomi.

2. Advanced topics on RAG Advanced topics about the construction of RAGs, starting from basic adaptations (e.g., reranking, answer refinement) to more advanced concepts related to metadata exploitation and filtering, semantic caching, cost and log monitoring, jailbreak and prompt injection detection, automatic evaluation of RAG solutions, and their integration in agentic systems. At the end, a quick demo will be shown.

Sign Up! Space is limited, so RSVP today to secure your spot!

Please note: Remember to RSVP using your full name for security reasons and bring a valid ID to show at the entrance. If you can't attend, please let us know at least 2 days in advance so you can free spots for people on the waiting list. We look forward to seeing you there! 🙌

PyData Rome, 7th Meeting, 22nd November 2024
PyData Talk Night ✨ 2024-11-13 · 23:30

Join PyData NYC at 11 Times Square (Microsoft) on November 13th at 6:30 pm for a talk night with Milan Janosov, Kelly Abuelsaad & Thanos Tatsios. Please bring your 💻 to code along and sign up with your government official name.

🍕 Pizza, drinks & venue sponsored by Microsoft Reactor - thank you!

Agenda: Connecting the Dots - From Network Science to Spatial Analytics Speaker: Milan Janosov (Founder of Geospatial Data Consulting)

Everything is connected - we have heard that many times. In my talk, I aim to outline how the science of connections - network science uncovers and makes these often invisible connections visible. I will touch base on topics like how to use graph analytics on subjects like the Game of Thrones or The Witcher, how to uncover the secret sauce of star DJs, and how the science of cities has recently unfolded, partly powered by networks - all done on a purely Python data stack.

Milan Janosov is a seasoned data scientist with a background in Physics, a PhD in Network and Data Science, and a current focus on Geospatial Data Science. Start-up co-founder, Forbes 30 under 30 entrepreneur, and public educator. Author of the #1 Amazon Best Seller Geospatial Data Science Essentials. His work has been widely featured in professional, scientific, and popular media, including Towards Data Science, Nature Social Science Research, GQ, New Scientist, New York Times, TechXplore, The Economic Times, Gamestar, and more.

Build a local AI co-pilot using open-source Granite Code, Ollama, and Continue Speaker: Kelly Abuelsaad & Thanos Tatsios (IBM) This session introduces the open-source Granite model and how you could use Granite as a co-pilot on your laptop.

Networking Connect with fellow data enthusiasts, professionals, and community leaders. Build meaningful connections and forge collaborations. ---------------------------------------------------------------- Doors open @ 6 pm Doors close @ 7 pm Event @ 6:30 - 8:30 pm Venue provided by MSFT: 11 Times Square ---------------------------------------------------------------- The building requires a government-issued photo ID for entrance. This, and all PyData NYC events, is an all-level event. Newcomers and beginners are welcome. This and all NumFOCUS-affiliated events and spaces, both in-person and online, are governed by a Code of Conduct. ---------------------------------------------------------------- This event may be recorded.

PyData Talk Night ✨

WARNING -> Sign up on TicketTailor here -> https://www.tickettailor.com/events/opendatamanchestercic/1417909

As Halloween approaches, Open Data Manchester, HER+Data MCR, PyData MCR and Rust Manchester invite you once again to our annual Data Horror Stories event on 30th October!

In today's world, algorithms predict our preferences, chatbots engage in surprisingly human-like conversations, and AI systems make crucial decisions affecting millions. But what happens when these technologies don't work as intended? Join us for an evening of real-world cautionary tales and data-driven insights

We're calling on data professionals, tech experts, anyone who's encountered tech troubles to share their most unsettling experiences. Has a coding error caused chaos in your organisation? Did a data breach leave you sleepless? Perhaps a digital campaign went horribly wrong? We want to hear about it.

  • Have a story to tell? Email sam[at]opendatamanchester.org.uk to secure a speaking slot.
  • Prefer to remain anonymous? Submit your story, and we may present it without revealing your identity.
  • Feeling inspired on the night? We'll have quick-fire slots available for impromptu speakers. Just let us know when you arrive.

Join us for an evening of eye-opening stories, thoughtful discussion, and practical solutions to address the challenges posed.

About

HER+Data MCR is a community working to connect, inspire, support and empower the NW UK’s Women in Data. It brings together anyone who identifies as a woman or non-binary and has a connection to data. We talk data science, analytics, research, visualisation, software, applications and experiences women share working in male dominated environments. Follow us on Meetup or Linkedin.

PyData MCR is the Manchester chapter of the International PyData Community. For Manchester based data people, to share and learn new things. All open data tooling welcome. Follow PyData MCR on LinkedIn or on Meetup.

Location Northcoders, M1 7ED

Data Horror Stories with Open Data Manchester, Her+Data MCR & Rust Manchester

Join PyData NYC at 11 Times Square (Microsoft) on October 9th at 6:30 pm for a talk night with Daniel Gural (Voxel51) and Olivier Poupeney (Head of Developer Relations at Orkes). Please bring your 💻 to code and sign up with your government official name.

🍕 Pizza, drinks & venue sponsored by Microsoft Reactor - thank you!

Agenda: Build Your Own Virtual World: 3D Reconstruction in FiftyOne Speaker: Daniel Gural, Machine Learning and DevRel at Voxel51

3D is one of the fastest-growing spaces in ML, and new models are coming out that can achieve incredible results. In this talk, you'll learn some of the methods used to create 3D reconstructions, the drawbacks of today's models, and what there is to be excited about on the horizon.

GenAI Orchestrations using Python Speaker: Olivier Poupeney, Head of Developer Relations at Orkes Orchestrating models and vector databases to automate insightful answers to LLM queries and seamlessly integrate GenAI capabilities to existing apps.

Olivier manages Orkes's Developer Relations program for Orkes Conductor's developer community and is member of FINOS's Technical Oversight Committee.

Networking Connect with fellow data enthusiasts, professionals, and community leaders. Build meaningful connections and forge collaborations. ---------------------------------------------------------------- Doors open @ 6 pm Doors close @ 7 pm Event @ 6:30 - 8:30 pm Venue provided by MSFT: 11 Times Square ---------------------------------------------------------------- The building requires a government-issued photo ID for entrance. This, and all PyData NYC events, is an all-level event. Newcomers and beginners are welcome.This and all NumFOCUS-affiliated events and spaces, both in-person and online, are governed by a Code of Conduct. ---------------------------------------------------------------- This event may be recorded.nd

Visual AI & GenAI Orchestration ⭐️

Join PyData Boston for a casual night of in-person networking and lightning talks by members of the community. By who?? By You!!

Doors @ 6 pm Event @ 6:30 - 8:30 pm

🏠 Venue provided by IBM 🍕 Pizza provided by Mattermost - thank you!

⚡⚡What is a Lightning Talk? ⚡⚡ A lightning talk... • ... is a 5-10 minute talk on any topic, technical or not • ... is often about something cool you just learned or some open source software you wrote. • ... can have slides but it's not required • ... doesn't need to be prepared, but could be • ... can be given by ANYONE! First-timers and first-time speakers are always welcome. Multiple talks per person are allowed (just make sure everyone else has a chance first!) Please no marketing pitches.

⚡⚡How do I give a talk ? ⚡⚡ Sign up sheet: https://forms.gle/yHPFAzFWC7epW5mm8

This, and all PyData Boston events, is an all-levels event. Newcomers and beginners are welcome. This, and all NumFOCUS-affiliated events and spaces, both in-person and online are governed by a Code of Conduct. More at https://pydata.org/code-of-conduct/ This event will not be recorded or streamed.

-- 🤔 🍕 Is your company willing to host or sponsor a future event? Get in touch on [email protected]

Launch Party and Lightning Talks, sponsored by Mattermost ⚡️

PyData Roma - 6th Meetup! 🎉 New location unlocked! Get ready for another great night about python, data and science!

Welcome back from the holidays! Let's pick up where we left off and continue our mission to make Rome a fantastic place for software engineering and data science.

The presentations of this event are ready and listed below. You can be the next speaker. If you have a presentation, some interesting code, or an open problem you'd like to discuss with the community compile the form and let us know! (proposals can be in English or Italian, whatever makes you comfortable).

Location: Via Sandro Sandri 81, Rome - Italy Date: September 23rd 2024

Schedule:

  • 18:00 🚪 Door Opening
  • 18:20 ELIS Innovation Hub, Marco Oreste Migliori (Technology Innovation Manager @ ELIS)
  • 18.35 🧊 PyData: what it is and why it matters, Luigi Selmi (CNR-IIA, PyData Rome Organizer)
  • 18:45 🎤 Talk 1 - Il GPS non funziona! Un modello può avvisarci prima che accada? by Vincenzo Ventriglia (ML Engineer @ Istituto Nazionale di Geofisica e Vulcanologia)
  • 19.15 🎤 Talk 2 - Dal Caos agli Insights: sfide e strategie per creare big data pipelines by Carmela Salandria & Pietro Di Giandomenico (Data Engineers @ Elis Innovation Hub)
  • 19.45 🤝 Socializing

Here is a short description of the two presentations:

1. Il GPS non funziona! Un modello può avvisarci prima che accada?

Ti sarà successo di usare il GPS sul tuo smartphone e accorgerti che non funziona correttamente? I responsabili potrebbero essere il Sole e le Large-Scale Travelling Ionospheric Disturbances (LSTIDs), fluttuazioni ionosferiche che giocano un ruolo cruciale nella dinamica dello Space Weather. Presenteremo un modello di previsione di LSTIDs che stiamo sviluppando all'Istituto Nazionale di Geofisica e Vulcanologia, che si basa su CatBoost e usa diversi driver fisici per fare predizioni. L'explainability è una caratteristica desiderabile in un modello, specialmente in contesti potenzialmente ad alto rischio come lo Space Weather. Noi useremo SHAP – un approccio mutuato dalla teoria dei giochi – per interpretare e spiegare l'output. Accenneremo infine al probabilistic forecasting e alla calibrazione del modello, seguendo il paradigma della conformal prediction.

2. Dal Caos agli Insights: sfide e strategie per creare big data pipelines È possibile ridurre da due settimane a un solo giorno il tempo necessario per ottenere insights di valore? Partendo da un caso d'uso reale sviluppato per un'azienda di trasporti, esploreremo come costruire una big data pipeline capace di semplificare e automatizzare i processi decisionali aziendali. Vi mostreremo l'importanza di PySpark nella gestione dei big data e come, adottando best practice, sia possibile trasformare dati grezzi in informazioni utili per data analyst e data scientist. Approfondiremo come il calcolo distribuito e parallelo possa drasticamente ridurre i tempi di elaborazione, e quando preferirlo rispetto a librerie più tradizionali come Pandas. Inoltre, approfondiremo il tema della scalabilità, confrontando i benefici di una big data pipeline in cloud rispetto a soluzioni on-premises.

Sign Up! Space is limited, so RSVP today to secure your spot!

Please note: Remember to RSVP using your full name for security reasons. If you can't attend, please let us know at least 2 days in advance so you can free spots for people on the waiting list. We look forward to seeing you there! 🙌

PyData Rome, 6th Meeting, 23rd September 2024

Join PyData NYC at 11 Times Square (Microsoft) on August 14th at 6:30 pm for a tutorial night with Yujian Tang (CEO of OSS4Al) and Zain Hasan (Senior ML Developer Relations Engineer at Weaviate). Please bring your 💻 to code and sign up with your government official name.

🍕 Pizza, drinks & venue sponsored by Microsoft Reactor - thank you!

Agenda: LLM Based Applications - Building Agentic RAG Workshop Speaker: Yujian Tang, CEO of OSS4Al

Yujian Tang started developing software professionally at the age of 16. In college, he studied computer science, neuroscience, and statistics and published machine learning papers to conferences like lEEE Big Data. After graduation, he worked on the AutoML system at Amazon before moving on to build his own companies including a data aggregation app, an NLP API, and his current company - OSS4Al, an organization aimed at providing all developers access to the resources to understand, use, and contribute to the direction and development of Al.

Scaling Vector Search in Production Without Breaking the Bank: Quantization and Adaptive Retrieval Speaker: Zain Hasan, Senior ML Developer Relations Engineer at Weaviate

Everybody loves vector search and enterprises now see its value thanks to the popularity of LLMs and RAG. The problem is that prod-level deployment of vector search requires boatloads of CPU, for search, and GPU, for inference, compute. The bottom line is that if deployed incorrectly vector search can be prohibitively expensive compared to classical alternatives.

The solution: quantizing vectors, leveraging hardware-accelerated optimizations and performing adaptive retrieval. These techniques allow you to scale applications into production by allowing you to balance and tune memory costs, latency performance, and retrieval accuracy very reliably.

I’ll talk about how you can perform real-time billion-scale vector searches on your laptop! This includes covering different quantization techniques, including product, binary, scalar and matryoshka quantization that can be used to compress vectors trading off memory requirements for accuracy. I’ll also introduce the concept of adaptive retrieval where you first perform cheap hardware-optimized low-accuracy search to identify retrieval candidates using compressed vectors followed by a slower, higher-accuracy search to rescore and correct. When used with well-thought-out adaptive retrieval, these quantization techniques can lead to a 32x reduction in memory cost requirements at the cost of \~ 5% loss in retrieval recall in your RAG stack.

Zain Hasan is a senior ML developer relations engineer at Weaviate. An engineer and data scientist by training, he pursued his undergraduate and graduate work at the University of Toronto building artificially intelligent assistive technologies, then founded his company, VinciLabs in the digital health-tech space. More recently he practiced as a consultant senior data scientist in Toronto. Zain is passionate about the fields of machine learning, education, and public speaking.

Networking Connect with fellow data enthusiasts, professionals, and community leaders. Build meaningful connections and forge collaborations.

---------------------------------------------------------------- RSVP is required; please note that walk-ins will not be accepted. Note: Per building policy, RSVPs will close at 12 pm on Aug 12th. Doors open @ 6 pm Doors close @ 7 pm Event @ 6:30 - 8:30 pm Venue provided by MSFT: 11 Times Square ----------------------------------------------------------------

The building requires a government-issued photo ID for entrance. This, and all PyData NYC events, is an all-level event. Newcomers and beginners are welcome.This and all NumFOCUS-affiliated events and spaces, both in-person and online, are governed by a Code of Conduct. ---------------------------------------------------------------- This event may be recorded.

Building Agentic RAG and Scaling Vector Search 🪩

Join PyData NYC at 11 Times Square (Microsoft) on July 17th at 6:30 pm for a talk night with Art Anderson (Aerospike, Sr Dev Experience Engineer) and Ilia Zlobin (Systems Architect). Please bring your 💻 to code and sign up with your government official name.

🍕 Pizza, drinks & venue sponsored by Microsoft Reactor - thank you!

Agenda: Supercharging Real-time Applications with Vector and Graph Speaker: Art Anderson, Sr Dev Experience Engineer (Aerospike) Embark on a journey connecting key-value, vector, and graph to build real-world applications showcasing Retrieval Augmented Generation (RAG) using semantic search along with recommendation engines, user profile stores, and more. Unlock the full potential of your real-time applications with multi-model databases and see how Python pulls this all together.

New Machine Learning Paradigm with DSPy: No Prompt Engineering Required Speaker: Ilia Zlobin, Systems Architect Prompt engineering has some limitations in that you have to tailor it to a particular use case and then adjust and support it as your application continues to evolve. With DSPy, you look at the problem from the traditional machine learning perspective where you operate with datasets orchestration, evaluation metrics and hyper-parameters tuning to improve performance on the task. DSPy gives you exactly that but for LLMs and with much lower requirements in terms of training data size, hardware resources needed. With a slight effort you could gain a great boost in performance and could seamlessly continue increasing your application complexity.

Networking Connect with fellow data enthusiasts, professionals, and community leaders. Build meaningful connections and forge collaborations.

---------------------------------------------------------------- RSVP is required; please note that walk-ins will not be accepted. Note: Per building policy, RSVPs will close at 12 pm on June 10th. Doors open @ 6 pm Doors close @ 7 pm Event @ 6:30 - 8:30 pm Venue provided by MSFT: 11 Times Square ----------------------------------------------------------------

The building requires a government-issued photo ID for entrance. This, and all PyData NYC events, is an all-level event. Newcomers and beginners are welcome.This and all NumFOCUS-affiliated events and spaces, both in-person and online, are governed by a Code of Conduct. ---------------------------------------------------------------- This event may be recorded.

Vector Databases and Machine Learning Paradigms ⭐️

PyDataMCR and Rust Manchester are teaming up for our June speakers night

THE TALK

Polars and Time Series: what it can do, and how to overcome any limitation - Marco Gorelli (he/him)

Time series analysis is ubiquitous in applied data science because of the value it delivers. In order to do effective time series analysis, you need to know your tools well. Polars has excellent built-in time series support, and it's also possible to extend it where necessary.

We will talk about: - Basic built-in time series operations with Polars (e.g. "what's the average number of sales per month?"). - numba/numpy/scipy interoperability for not-so-basic time series operations (e.g. non-linear interpolation\, or cumulative operations). - Advanced\, custom time series operations\, and how you can implement them as Polars plugins (e.g. business day arithmetic).

Q&A developing open source software in the Python ecosystem

Location We'll be at Krakenflex Manchester, who are kindly supplying catering. The capacity is limited to 100.

EVENT GUIDELINES

PyDataMCR is a strictly professional event, as such professional behaviour is expected.

PyDataMCR is a chapter of PyData, an educational program of NumFOCUS and thus abides by the NumFOCUS Code of Conduct

https://pydata.org/code-of-conduct.html

Please take a moment to familiarise yourself with its contents.

ACCESSIBILITY

Under 16s welcome with a responsible guardian. The venue and toilets are accessible with a lift from reception.

SPONSORS

Thank you to NUMFocus for sponsoring the PyData meetups.

Thank you to AutoTrader for sponsoring PyDataMCR.

Thank you to Krakenflex for an awesome venue and catering!

PyDataMCR & Rust Manchester - June Talks