Search – talk-data.com

Observe event-driven systems | Zendesk Customer Success Story | Cont. Profiling 2026-01-29 · 17:00

I'm excited to announce another meetup! If you're interested in give a talk, feel free to reach out to me at [email protected]

---

All talks will be presented in English, to ensure that as many people as possible can participate and engage at this event.

If you want to attend, please RSVP to secure your spot - this will make organizing easier. Thank you so much ♥️

Location: Spiced Academy, Ritterstrasse 12, 10969 Berlin, 2. Hof

------

🏠 18:00 - Arrival: Networking, Drinks & Snacks (30 min.) Grab yourself snacks & drinks and say hello to everybody else!

📅 18:30 - So you want to observe an Event Driven System? (45 min.) 🎙️Speaker: Roman Boiko, Enterprise Sales Engineer @ Datadog

Event-driven architectures offer powerful decoupling, but they often shatter traditional monitoring strategies. When a single business process jumps across multiple queues, buses, and microservices, tracing the line between cause and effect becomes a complex puzzle. This talk bridges that visibility gap by exploring practical observability strategies using OpenTelemetry. We will navigate the critical architectural decisions behind distributed tracing - specifically choosing between Span Linking and Parent-Child relationships - and master context propagation to ensure no transaction is lost in the noise. Join us to learn how to connect low-level technical data with high-level business metrics, turning asynchronous complexity into a clear, manageable system.

Roman Boiko is a Senior Enterprise Sales Engineer at Datadog, where he helps developers and architects build reliable, observable, and scalable systems. With a strong background in serverless and cloud-native architectures, he’s passionate about bridging the gap between engineering and operations to help teams move faster with confidence. Roman frequently speaks at tech events, sharing practical insights on observability, distributed systems, and modern application design. He enjoys exploring how developers can simplify complexity and deliver better software through data-driven decisions.

📅 19:15 - Datadog at Zendesk - a solid foundation for scalable, reliable and cost efficient systems (30 min.) 🎙️Speaker: Anatoly Mikhaylov, Principal Software Engineer & Datadog Ambassador @ Zendesk

Anatoly will share practical and valuable lessons what value Datadog brings and what role it plays at Zendesk. They customized and encorporated observability triad (APM, Logs and metrics) within a large volume of systems and applications. Zendesk made it work very well at scale and so that Datadog became a common language engineers and product teams can speak to one another. A single picture often helps to narrow down the issue, surface the problem, brings the solution and be confident remediation is applied successfully. Zendesk teams are fluent using complex Datadog tools and drive necessary changes.

📅 19:45 - Continuous Profiling with Datadog (30 min.) 🎙️Speaker: Felix Geisendörfer, Senior Staff Engineer @ Datadog

Description following

🥗 20:15 - Drinks, Food & Networking Enjoy refreshments while networking with community peers!

👋 21:00 - Goodbye, see you next time!

Observe event-driven systems | Zendesk Customer Success Story | Cont. Profiling

Santa’s Data Workshop 2025-12-18 · 17:30

The next meetups will be managed on Luma.

You can register for this event here. You can follow the Data Berlin calendar here.

Join us for the final meetup of the year, where we explore how modern teams use MCP to better support their analytics users — improving discoverability, governance, and the flow from data to insights.

Agenda

6:30 PM – 7:00 PM — Registration and Networking 7:00 PM – 7:10 PM — Welcome and Initial Remarks 7:10 PM – 7:25 PM — Talk 1: Scout24 Powered by Data Speaker:

Ali Izhar Ahmed, VP Data and AI @ Scout24

7:25 PM – 7:50 PM — Talk 2: Scout24 Intelligent Data Platform: Powering Analytics, Ensuring Governance Speakers:

Angelita Frozza Sanches, Engineering Manager @ Scout24
Kiran Paduvalli Gourish, Lead BI Engineer @ Scout24

7:50 PM – 8:00 PM — Break 8:00 PM – 8:25 PM — Talk 3: TBD Speaker:

Abdallah Dorra, Senior Software Engineer @ Vinted

8:25 PM – 9:30 PM — Closing Remarks and Extended Networking

About our host Scout24 is the company behind ImmoScout24, Germany’s leading online marketplace for real estate. Every day, millions of users rely on Scout24’s platforms to find homes, make informed decisions, and navigate the real estate market. Behind the scenes, Scout24’s data teams build the infrastructure and intelligence that power personalized experiences, trusted recommendations, and data-driven innovation across the organization.

Santa’s Data Workshop

Databricks Cost Optimization | Data Engineering Meetup | Berlin, Dec 9th 2025-12-09 · 17:30

We're celebrating 1 year applydata Meetups in Berlin! 🎉 Let’s kick things off for our last Meetup in 2025, this time focusing on Databricks Cost Optimization and featuring an interactive data engineering quiz. Join us on December 9th in Berlin and bring all your questions & curiosity!

Kaan Ara: "*Databricks Cost Optimization: A Multi-Layered Strategy for Performance and Efficiency"***

Kaan Ara, Senior Cloud Engineer at Diconium, about his talk: "Databricks cost optimization requires a multi-layered strategy that focuses on three pillars: efficient Compute, optimized Storage, and strict Governance. Efficiency is driven by leveraging technologies like Photon and Serverless SQL, while storage is optimized using Delta Lake features such as Z-ordering and aggressive vacuuming. Strict governance, enforced through cluster policies and auto-termination, ensures these technical gains translate into consistent budget predictability without sacrificing performance."

Who's the data expert in the room? Interactive data pub quiz

After the keynote, it’s your turn: we’ll fire up a quiz in pub-style. There’s no prep needed – everyone is welcome to join, no matter if you're a data engineering expert or a data newbie!

What to expect:

Expert talk
Interactive Q&A and quiz
Networking opportunities
Some snacks & drinks :)

Timetable:

18:30 - Event admission
18:50 - Welcome & Introduction
19:00 - Databricks Cost Optimization: A Multi-Layered Strategy for Performance and Efficiency - Kaan Ara, Senior Cloud Engineer at Diconium
19:30 - 5 minutes break
19:35 - Interactive data engineering pub quiz
20:00 - Snacks, drinks & networking
21:30 - End

More on the -> applydata data engineering meetup page.

Our goal is to form a local data-loving community, so join us and let's talk data together!

---

At the event, sound, image and video recordings are created and published for documentation purposes as well as for the presentation of the event in publicly accessible media, on websites and blogs and for presentation on social media. By participating the event, the participant implicitly consents to the aforementioned photo and/or video recordings. Find more information on data protection here.

Databricks Cost Optimization | Data Engineering Meetup | Berlin, Dec 9th

Data Engineering Meetup | Berlin, Oct 30th 2025-10-30 · 17:30

Let’s kick things off for another Meetup, this time focusing on the collaboration of data scientists and data engineers, as well as data streaming in the VW environment. Join us on October 30th in Berlin and bring all your questions!

Tom Kaltofen: "What Data Scientists Actually Need from Data Engineers: A ‘Data Producer’ Perspective"

Tom Kaltofen is an Engineer at DHL Data & AI and a Creator at mloda.ai. In his keynote, he'll explore how data engineers can better support data scientists, BI, software engineers, analysts and management by understanding their real needs and designing data products accordingly. He’ll share practical lessons from his own industry experience: what worked, what didn’t, and the trade-offs involved in real-world data workflows. Since data engineering often involves navigating competing approaches, we’ll also look at some of the pros and cons of different methods, but always with the different data user groups in mind.

Alex Kalinnikov: "Event-driven data streaming platform at VW Group"

Alex Kalinnikov is a Product Owner at CARIAD with over 10 years of experience in IT & Infrastructure. He will talk about how Cariad handles 180M telemetry messages per day with a modern data streaming architecture and how Cariad UDE Solution leverages Confluent Kafka, Apache Flink and Microsoft Azure to move terabytes of IoT data.

What to expect:

Two expert talks and Q&A
Networking opportunities in our great Creator Space
Some snacks & drinks :)

Timetable:

18:30 - Event admission
18:50 - Welcome & Introduction
19:00 - Tom Kaltofen: "What Data Scientists Actually Need from Data Engineers: A ‘Data Producer’ Perspective"
19:30 - 5 minutes break
19:35 - Alex Kalinnikov: "Event-driven data streaming platform at VW Group"
20:05 - Snacks, Drinks & Networking
21:30 - End

More on the -> applydata data engineering meetup page.

Our goal is to form a local data-loving community, so join us and let's talk data together! --- At the event, sound, image and video recordings are created and published for documentation purposes as well as for the presentation of the event in publicly accessible media, on websites and blogs and for presentation on social media. By participating the event, the participant implicitly consents to the aforementioned photo and/or video recordings. Find more information on data protection here.

Data Engineering Meetup | Berlin, Oct 30th

Data Builders’ Evening: Architecture, Engineering & Beyond | Berlin, Sep. 16th 2025-09-16 · 16:00

Dear data-loving community, we can't wait to present to you our new Meetup event: This time, it will be a collaboration with RisingWave, a platform for real-time streaming data management and analysis. Yingjun Wu, Founder and CEO at RisingWave Labs, will share his experience in a techy talk, as well as Behnaz Derakhshani, who works as a Specialist Data Engineer at Diconium's data department. Additionally, we're going to welcome external guest speaker Erik Schmiegelow, CEO at Hivemind Technologies. Exciting line-up, right? :D

Join us on September 16th in Berlin and bring all your questions! Here are the topics you can expect:

Yingjun Wu: Achieving Sub‑100 ms Real‑Time Stream Processing with an S3‑Native Architecture

Stream processing systems have traditionally relied on local storage engines such as RocksDB to achieve low latency. While effective in single-node setups, this model doesn't scale well in the cloud, where elasticity and separation of compute and storage are essential. In this talk, we'll explore how RisingWave rethinks the architecture by building directly on top of S3 while still delivering sub-100 ms latency. At the core is Hummock, a log-structured state engine designed for object storage. Hummock organizes state into a three-tier hierarchy: in-memory cache for the hottest keys, disk cache managed by Foyer for warm data, and S3 as the persistent cold tier. This approach ensures queries never directly hit S3, avoiding its variable performance. We'll also examine how remote compaction offloads heavy maintenance tasks from query nodes, eliminating interference between user queries and background operations. Combined with fine-grained caching policies and eviction strategies, this architecture enables both consistent query performance and cloud-native elasticity. Attendees will walk away with a deeper understanding of how to design streaming systems that balance durability, scalability, and low latency in an S3-based environment.

Behnaz Derakhshani: From Raw Data to Trusted Assets: A Practical Walkthrough with AWS services and Collibra

Expect a hands-on journey of Behnaz showing how modern data lake tools and governance platforms connect the dots, making your data discoverable, governed, and productized for real-world use.

Erik Schmiegelow: Effective Agentic GenAI in Data Streaming

Successful genAI projects strike the balance between impact, accuracy, and cost. In this talk, Erik will cover how to create agentic data applications effectively, choosing when and how to integrate them in data streams and keep response quality issues and costs in check.

What you can expect:

3 expert talks
Interactive Q&A
Networking opportunities
Pizza & drinks (indoor or at our terrace)

Timetable:

18:00 - Event admission
18:30 - Welcome & introduction
18:35 - Keynote by Yingjun Wu & Q&A
19:05 - Short break
19:15 - Keynote by Behnaz Derakhshani & Q&A
19:45 - Keynote by Erik Schmiegelow & Q&A
20:15 - Snacks, drinks & networking
21:30 - End *

Our goal is to form a local data-loving community, so join us and let's talk data together!

-> Our event page, where you can also contact us if you want to present in the future at our Meetup: Data Engineering MeetUp Berlin - applydata

--- At the event, sound, image and video recordings are created and published for documentation purposes as well as for the presentation of the event in publicly accessible media, on websites and blogs and for presentation on social media. By participating the event, the participant implicitly consents to the aforementioned photo and/or video recordings. Find more information on data protection here.

Data Builders’ Evening: Architecture, Engineering & Beyond | Berlin, Sep. 16th

Berlin Cybersecurity Social #18: AI & Cybersecurity Sessions 2025-07-31 · 15:00

Are you a cybersecurity professional looking to connect with like-minded professionals, share experiences, and make friends? Look no further! Join us for a special edition of the Berlin Cybersecurity Social hosted in collaboration with the Venture Café Berlin and the AI Ethics Action Hub for a fantastic evening of networking.

Agenda:

5:00 PM - 5:15 PM: Welcome
5:15 PM – 5:50 PM: Lightning Talk: AI Threat Modeling: how to bring the right mindset to detect and prevent AI risk - Iryna Schwindt In this talk, we'll explore the full spectrum of AI risks—not just security-related ones—and why understanding the application context is critical. You'll learn: - What are the types of AI risks (not only security risks) - Why AI application context matters - How to identify potential threats\, apply effective controls and guardrails - Cultivating the right mindset to detect and prevent AI risks *
5:50 PM - 6:30 PM: Panel: AI Meets Cybersecurity: Building Smarter, Safer Systems at Scale - Jose Quesada, Diana Waithanji, Ali Yazdani, Pranav Vattaparambil As AI rapidly integrates into every layer of digital infrastructure, the stakes for cybersecurity have never been higher. This panel brings together experts from across the security spectrum—ranging from DevSecOps and enterprise risk to cybersecurity strategy—to explore how AI is transforming threat detection, governance, and secure system design. We’ll dive into real-world use cases, emerging risks, and what it takes to build scalable, intelligent, and secure systems in an increasingly AI-driven world.
6:30 PM - 8:00 PM: Breakout Session: Cybersecurity in the Age of AI: Ethics & Human-Centered Future *Featuring Azer Aliyev (speakinprivate.com), Gunay Kazimzade (Mercedes-Benz Consulting), and Justin Shenk (AI Salon Berlin), this fast-paced session brings together innovators, researchers, and tech leaders to explore how to build AI systems that protect privacy, bolster trust, and keep humans at the heart of digital transformation.

*This session is organised by the AI Ethics Action Hub

About the Speakers:

Iryna Schwindt is a Cybersecurity engineer currently at Vodafone and a co-author at the OWASP AI Exchange (https://owaspai.org/) project, contributing to the EU AI Act security standard and AI Red Teaming.

Jose Quesada is the founder and director of Data Science Retreat (DSR), an advanced ML bootcamp that has helped over 300 professionals land data science roles. With a PhD and 20+ years in machine learning, Jose brings a unique blend of technical depth and creative flair—he’s also a former photorealism artist. He has advised on impactful projects ranging from malaria diagnostics to sustainability-focused robotics.

Diana Waithanji is a Cybersecurity Engineer at SAP SE, with experience working across Europe and Africa. She is an advocate for data privacy as a fundamental human right and serves on two technical committees at the Kenya Bureau of Standards. Diana is also a board member at Nivishe Foundation, where she supports youth mental health through safe spaces. Her work bridges global standards, social impact, and cutting-edge security practices.

Ali Yazdani is a seasoned security professional with over a decade of experience spanning offensive security and secure development practices. Starting his career as a penetration tester, he now specializes in building scalable DevSecOps programs and embedding security into engineering workflows. Ali brings deep technical knowledge and a pragmatic approach to security culture. His mission is to empower teams to build safer software at scale and is currently a founder at Scandog.io

Pranav Vattaparambil is Chief Security Officer at Unosecur (https://www.unosecur.com/) as well as a security and product strategist with deep expertise in fintech. Formerly VP of Cybersecurity at the EU’s largest Banking-as-a-Service company, he also advises multiple startups on navigating security, risk, and go-to-market strategy. Pranav bridges the gap between technical execution and business impact, especially in regulated industries like banking and crypto. His focus is on helping companies build secure, scalable products from day one.

About Venture Café Berlin: Venture Café Berlin connects a community of innovators and entrepreneurs with free high-impact programming and events. Venture Café is a part of the CIC network, whose mission is to fix the world through innovation.

About Berlin Cybersecurity Social: This meetup is open to cybersecurity professionals of all levels, from beginners to experts. Whether you're a seasoned pro or just starting your journey in the field, this event is the perfect opportunity to connect with others who share your passion for cybersecurity.

About the AI Ethics Action Hub: A global, interdisciplinary collective dedicated to advancing ethical, inclusive, and accountable AI. We believe technology should be designed to respecting human dignity, planetary well-being, and intergenerational justice.

Berlin Cybersecurity Social #18: AI & Cybersecurity Sessions

Databricks Notebooks in production | Data engineering meetup | Berlin, July 22nd 2025-07-22 · 16:30

Let’s kick things off for another meetup, this time focusing on Databricks Notebooks in production as well as an interactive data engineering discussion. Join us for an engaging Meetup on July 22nd in Berlin - bring all your questions!

Databricks Notebooks in Production: Best Practices

Using notebooks is convenient when you write code for Databricks. However, it is a common opinion that they should not be used to run a production code. In my talk, I would like to argue that having notebooks in production may actually be the best decision compared to other deployment options, especially in terms of convenience, maintainability, and efficiency.

Interactive data engineering discussion

After the keynote, it’s your turn: we’ll fire up a live Mentimeter to get your thoughts, opinions, and hot takes. Then we’ll look at the results together to discuss, agree, disagree (nicely!), or just raise an eyebrow. There’s no prep needed. Just bring your phone to scan the QR code, and you’re in. Quick, easy, and mildly addictive!

What to expect:

Expert talks
Interactive Q&A
Networking opportunities
Some snacks & drinks :)

Timetable:

18:30 - Event admission
18:50 - Welcome & Introduction
19:00 - Databricks Notebooks in Production: Best Practices - Aleksandr Kudriavtcev
19:30 - 5 minutes break
19:35 - Interactive data engineering discussion
20:05 - Snacks, Drinks & Networking
21:30 - End

More on the -> applydata data engineering meetup page.

Our goal is to form a local data-loving community, so join us and let's talk data together! --- At the event, sound, image and video recordings are created and published for documentation purposes as well as for the presentation of the event in publicly accessible media, on websites and blogs and for presentation on social media. By participating the event, the participant implicitly consents to the aforementioned photo and/or video recordings. Find more information on data protection here.

Databricks Notebooks in production | Data engineering meetup | Berlin, July 22nd

[Online] Democratizing Bayesian Modeling with Insight Agents: A Case Study 2025-06-17 · 16:00

🎙️ Speaker: Andy Heusser\, Luca Fiaschi \| ⏰ Time: 4 PM UTC / 9 AM PT / 12 PM ET / 6 PM Berlin

Insight Agents are purpose‑built AI coworkers that transform demanding analytical workflows into push‑button tasks. Built on a modular blend of retrieval‑augmented generation (RAG), tool calling, and sandboxed code execution, each agent automates the full statistical pipeline—from data exploration and validation to model fitting and interpretation—without requiring deep technical expertise.

The session showcases our Marketing Mix Modeling (MMM) Insight Agent, which compresses weeks of Bayesian MMM work into minutes by delegating tasks to specialized sub‑agents. You’ll see how this architecture delivers secure, explainable, and scalable results that let marketers focus on strategy instead of code.

MMM is only the first stop. We plan to extend the same framework to prototype Insight Agents for customer life-time value, causal impact analysis and more. We’ll dig into the design principles, share implementation lessons, and outline the roadmap from today’s collaborative “copilots” to tomorrow’s autonomous digital coworkers that proactively surface insights and drive better business outcomes.

Read More:

📜 Outline of Talk / Agenda:

5 min: Intro to PyMC Labs and speakers
45 min: Presentation, panel discussion
10 min: Q&A

💼 About the speaker:

Andy Heusser, PhD (Principal Data Scientist at PyMC Labs) Andy is a data science leader with over 15 years of experience. He began his career in academia, leveraging computational models and brain imaging to study human memory, as well as developing AI-driven tutoring algorithms using language models. He then served as the Director of Data Science at a digital health startup, where he led teams in building machine learning and AI solutions for digital mental health products. Today, he runs a data science consulting company called Cognitive Insight Consulting and is a Principal Data Scientist at PyMC Labs.

🔗 Connect with Andy: 👉 Linkedin: https://www.linkedin.com/in/andrew-heusser-3b6587b1/ 👉Github: https://github.com/andrewheusser

Luca Fiaschi, PhD - Partner at PyMC Labs With over 15 years of leadership experience in AI, data science, and analytics, Luca has driven transformative growth in technology-first businesses, such as Chief Data & AI Officer at Mistplay (+$200M) where he led revenue growth through AI-powered personalization and dynamic pricing algorithm, and holding executive roles at global industry leaders such as HelloFresh ($8B), Stitch Fix ($1.2B), and Rocket Internet ($1B). His core competencies span machine learning, artificial intelligence, analytics, data engineering, and computer vision, which he has applied for consumer-facing products within CPG, gaming, e-commerce, food delivery, meal kits, and healthcare, working with both fast-paced startups and F100 companies. Luca is a partner at PyMC Labs, providing insights and guidance on Generative AI to Fortune 500 companies. He holds a PhD in AI and Computer Vision from Heidelberg University and has more than 450 citations on his research work.

🔗 Connect with Luca: 👉 Linkedin: https://www.linkedin.com/in/lfiaschi/

💼 About the Host:

Thomas Wiecki (Founder of PyMC Labs) Dr. Thomas Wiecki is an author of PyMC, the leading platform for statistical data science. To help businesses solve some of their trickiest data science problems, he assembled a world-class team of Bayesian modelers and founded PyMC Labs -- the Bayesian consultancy. He did his PhD at Brown University studying cognitive neuroscience. 🔗 Connect with Thomas: 👉 Linkedin: https://www.linkedin.com/in/twiecki/ 👉 Website: https://www.pymc-labs.com/, https://twiecki.io/ 👉 GitHub: https://github.com/twiecki 👉 Twitter: https://twitter.com/twiecki

📖 Code of Conduct: Please note that participants are expected to abide by PyMC's Code of Conduct.

🔗 Connecting with PyMC Labs: 🌐 Website: https://www.pymc-labs.com/ 👥 LinkedIn: https://www.linkedin.com/company/pymc-labs/ 🐦 Twitter: https://twitter.com/pymc_labs 🎥 YouTube: https://www.youtube.com/c/PyMCLabs 🤝 Meetup: https://www.meetup.com/pymc-labs-online-meetup/

[Online] Democratizing Bayesian Modeling with Insight Agents: A Case Study

May 22 - AI, ML and Computer Vision Meetup 2025-05-22 · 17:00

When and Where

May 22\, 2025 \| 10:00 AM Pacific
Virtual - Register for the Zoom

CountGD: Multi-Modal Open-World Counting

We propose CountGD, the first open-world counting model that can count any object specified by text only, visual examples only, or both together. CountGD extends the Grounding DINO architecture and adds components to enable specifying the object with visual examples. This new capability – being able to specify the target object by multi-modalites (text and exemplars) – lead to an improvement in counting accuracy. CountGD is powering multiple products and has been applied to problems across different domains including counting large populations of penguins to monitor the influence of climate change, counting buildings from satellite images, and counting seals for conservation.

About the Speaker

Niki Amini-Naieni is a DPhil student focusing on developing foundation model capabilities for visual understanding of the open world at the Visual Geometry Group (VGG), Oxford supervised by Andrew Zisserman. In the past, Niki has consulted with Amazon and other companies in robotics and computer vision, interned at SpaceX, and studied computer science and engineering at Cornell.

GorillaWatch: Advancing Gorilla Re-Identification and Population Monitoring with AI

Accurate monitoring of endangered gorilla populations is critical for conservation efforts in the field, where scientists currently rely on labor-intensive manual video labeling methods. The GorillaWatch project applies visual AI to provide robust re-identification of individual gorillas and generate local population estimates in wildlife encounters.

About the Speaker

Maximilian von Klinski is a Computer Science student at the Hasso-Plattner-Institut and is currently working on the GorillaWatch project alongside seven fellow students.

This Gets Under Your Skin – The Art of Skin Type Classification

Skin analysis is deceptively hard: inconsistent portrait quality, lighting variations, and the presence of sunscreen or makeup often obscure what’s truly “under the skin.” In this talk, I’ll share how we built an AI pipeline for skin type classification that tackles these real-world challenges with a combination of vision models. The architecture includes image quality control, facial segmentation, and a final classifier trained on curated dermatological features.

About the Speaker

Markus Hinsche is the co-founder and CTO of Thea Care, where he builds AI-powered skincare solutions at the intersection of health, beauty, and longevity. He holds a Master’s in Software Engineering from the Hasso Plattner Institute and brings a deep background in AI and product development.

A Spot Pattern Is like a Fingerprint: Jaguar Identification Project

The Jaguar Identification Project is a citizen science initiative actively engaging the public in conservation efforts in Porto Jofre, Brazil. This project increases awareness and provides an interesting and challenging dataset that requires the use of fine-grained visual classification algorithms. We use this rich dataset for dual purposes: teaching data-centric visual AI and directly contributing to conservation efforts for this vulnerable species.

Learn more: Jaguar Identification Project | Jaguar Conservation NGO in Brazil | Porto Jofre – Poconé, State of Mato Grosso, Brazil

About the Speaker

Antonio Rueda-Toicen, an AI Engineer in Berlin, has extensive experience in deploying machine learning models and has taught over 300 professionals. He is currently a Research Scientist at the Hasso Plattner Institute. Since 2019, he has organized the Berlin Computer Vision Group and taught at Berlin’s Data Science Retreat. He specializes in computer vision, cloud technologies, and machine learning. Antonio is also a certified instructor of deep learning and diffusion models in NVIDIA’s Deep Learning Institute.

May 22 - AI, ML and Computer Vision Meetup

May 22 - AI, ML and Computer Vision Meetup 2025-05-22 · 17:00

When and Where

May 22\, 2025 \| 10:00 AM Pacific
Virtual - Register for the Zoom

CountGD: Multi-Modal Open-World Counting

We propose CountGD, the first open-world counting model that can count any object specified by text only, visual examples only, or both together. CountGD extends the Grounding DINO architecture and adds components to enable specifying the object with visual examples. This new capability – being able to specify the target object by multi-modalites (text and exemplars) – lead to an improvement in counting accuracy. CountGD is powering multiple products and has been applied to problems across different domains including counting large populations of penguins to monitor the influence of climate change, counting buildings from satellite images, and counting seals for conservation.

About the Speaker

Niki Amini-Naieni is a DPhil student focusing on developing foundation model capabilities for visual understanding of the open world at the Visual Geometry Group (VGG), Oxford supervised by Andrew Zisserman. In the past, Niki has consulted with Amazon and other companies in robotics and computer vision, interned at SpaceX, and studied computer science and engineering at Cornell.

GorillaWatch: Advancing Gorilla Re-Identification and Population Monitoring with AI

Accurate monitoring of endangered gorilla populations is critical for conservation efforts in the field, where scientists currently rely on labor-intensive manual video labeling methods. The GorillaWatch project applies visual AI to provide robust re-identification of individual gorillas and generate local population estimates in wildlife encounters.

About the Speaker

Maximilian von Klinski is a Computer Science student at the Hasso-Plattner-Institut and is currently working on the GorillaWatch project alongside seven fellow students.

This Gets Under Your Skin – The Art of Skin Type Classification

Skin analysis is deceptively hard: inconsistent portrait quality, lighting variations, and the presence of sunscreen or makeup often obscure what’s truly “under the skin.” In this talk, I’ll share how we built an AI pipeline for skin type classification that tackles these real-world challenges with a combination of vision models. The architecture includes image quality control, facial segmentation, and a final classifier trained on curated dermatological features.

About the Speaker

Markus Hinsche is the co-founder and CTO of Thea Care, where he builds AI-powered skincare solutions at the intersection of health, beauty, and longevity. He holds a Master’s in Software Engineering from the Hasso Plattner Institute and brings a deep background in AI and product development.

A Spot Pattern Is like a Fingerprint: Jaguar Identification Project

The Jaguar Identification Project is a citizen science initiative actively engaging the public in conservation efforts in Porto Jofre, Brazil. This project increases awareness and provides an interesting and challenging dataset that requires the use of fine-grained visual classification algorithms. We use this rich dataset for dual purposes: teaching data-centric visual AI and directly contributing to conservation efforts for this vulnerable species.

Learn more: Jaguar Identification Project | Jaguar Conservation NGO in Brazil | Porto Jofre – Poconé, State of Mato Grosso, Brazil

About the Speaker

Antonio Rueda-Toicen, an AI Engineer in Berlin, has extensive experience in deploying machine learning models and has taught over 300 professionals. He is currently a Research Scientist at the Hasso Plattner Institute. Since 2019, he has organized the Berlin Computer Vision Group and taught at Berlin’s Data Science Retreat. He specializes in computer vision, cloud technologies, and machine learning. Antonio is also a certified instructor of deep learning and diffusion models in NVIDIA’s Deep Learning Institute.

May 22 - AI, ML and Computer Vision Meetup

May 22 - AI, ML and Computer Vision Meetup 2025-05-22 · 17:00

When and Where

May 22\, 2025 \| 10:00 AM Pacific
Virtual - Register for the Zoom

CountGD: Multi-Modal Open-World Counting

We propose CountGD, the first open-world counting model that can count any object specified by text only, visual examples only, or both together. CountGD extends the Grounding DINO architecture and adds components to enable specifying the object with visual examples. This new capability – being able to specify the target object by multi-modalites (text and exemplars) – lead to an improvement in counting accuracy. CountGD is powering multiple products and has been applied to problems across different domains including counting large populations of penguins to monitor the influence of climate change, counting buildings from satellite images, and counting seals for conservation.

About the Speaker

Niki Amini-Naieni is a DPhil student focusing on developing foundation model capabilities for visual understanding of the open world at the Visual Geometry Group (VGG), Oxford supervised by Andrew Zisserman. In the past, Niki has consulted with Amazon and other companies in robotics and computer vision, interned at SpaceX, and studied computer science and engineering at Cornell.

GorillaWatch: Advancing Gorilla Re-Identification and Population Monitoring with AI

Accurate monitoring of endangered gorilla populations is critical for conservation efforts in the field, where scientists currently rely on labor-intensive manual video labeling methods. The GorillaWatch project applies visual AI to provide robust re-identification of individual gorillas and generate local population estimates in wildlife encounters.

About the Speaker

Maximilian von Klinski is a Computer Science student at the Hasso-Plattner-Institut and is currently working on the GorillaWatch project alongside seven fellow students.

This Gets Under Your Skin – The Art of Skin Type Classification

Skin analysis is deceptively hard: inconsistent portrait quality, lighting variations, and the presence of sunscreen or makeup often obscure what’s truly “under the skin.” In this talk, I’ll share how we built an AI pipeline for skin type classification that tackles these real-world challenges with a combination of vision models. The architecture includes image quality control, facial segmentation, and a final classifier trained on curated dermatological features.

About the Speaker

Markus Hinsche is the co-founder and CTO of Thea Care, where he builds AI-powered skincare solutions at the intersection of health, beauty, and longevity. He holds a Master’s in Software Engineering from the Hasso Plattner Institute and brings a deep background in AI and product development.

A Spot Pattern Is like a Fingerprint: Jaguar Identification Project

The Jaguar Identification Project is a citizen science initiative actively engaging the public in conservation efforts in Porto Jofre, Brazil. This project increases awareness and provides an interesting and challenging dataset that requires the use of fine-grained visual classification algorithms. We use this rich dataset for dual purposes: teaching data-centric visual AI and directly contributing to conservation efforts for this vulnerable species.

Learn more: Jaguar Identification Project | Jaguar Conservation NGO in Brazil | Porto Jofre – Poconé, State of Mato Grosso, Brazil

About the Speaker

Antonio Rueda-Toicen, an AI Engineer in Berlin, has extensive experience in deploying machine learning models and has taught over 300 professionals. He is currently a Research Scientist at the Hasso Plattner Institute. Since 2019, he has organized the Berlin Computer Vision Group and taught at Berlin’s Data Science Retreat. He specializes in computer vision, cloud technologies, and machine learning. Antonio is also a certified instructor of deep learning and diffusion models in NVIDIA’s Deep Learning Institute.

May 22 - AI, ML and Computer Vision Meetup

May 22 - AI, ML and Computer Vision Meetup 2025-05-22 · 17:00

When and Where

May 22\, 2025 \| 10:00 AM Pacific
Virtual - Register for the Zoom

CountGD: Multi-Modal Open-World Counting

We propose CountGD, the first open-world counting model that can count any object specified by text only, visual examples only, or both together. CountGD extends the Grounding DINO architecture and adds components to enable specifying the object with visual examples. This new capability – being able to specify the target object by multi-modalites (text and exemplars) – lead to an improvement in counting accuracy. CountGD is powering multiple products and has been applied to problems across different domains including counting large populations of penguins to monitor the influence of climate change, counting buildings from satellite images, and counting seals for conservation.

About the Speaker

Niki Amini-Naieni is a DPhil student focusing on developing foundation model capabilities for visual understanding of the open world at the Visual Geometry Group (VGG), Oxford supervised by Andrew Zisserman. In the past, Niki has consulted with Amazon and other companies in robotics and computer vision, interned at SpaceX, and studied computer science and engineering at Cornell.

GorillaWatch: Advancing Gorilla Re-Identification and Population Monitoring with AI

Accurate monitoring of endangered gorilla populations is critical for conservation efforts in the field, where scientists currently rely on labor-intensive manual video labeling methods. The GorillaWatch project applies visual AI to provide robust re-identification of individual gorillas and generate local population estimates in wildlife encounters.

About the Speaker

Maximilian von Klinski is a Computer Science student at the Hasso-Plattner-Institut and is currently working on the GorillaWatch project alongside seven fellow students.

This Gets Under Your Skin – The Art of Skin Type Classification

Skin analysis is deceptively hard: inconsistent portrait quality, lighting variations, and the presence of sunscreen or makeup often obscure what’s truly “under the skin.” In this talk, I’ll share how we built an AI pipeline for skin type classification that tackles these real-world challenges with a combination of vision models. The architecture includes image quality control, facial segmentation, and a final classifier trained on curated dermatological features.

About the Speaker

Markus Hinsche is the co-founder and CTO of Thea Care, where he builds AI-powered skincare solutions at the intersection of health, beauty, and longevity. He holds a Master’s in Software Engineering from the Hasso Plattner Institute and brings a deep background in AI and product development.

A Spot Pattern Is like a Fingerprint: Jaguar Identification Project

The Jaguar Identification Project is a citizen science initiative actively engaging the public in conservation efforts in Porto Jofre, Brazil. This project increases awareness and provides an interesting and challenging dataset that requires the use of fine-grained visual classification algorithms. We use this rich dataset for dual purposes: teaching data-centric visual AI and directly contributing to conservation efforts for this vulnerable species.

Learn more: Jaguar Identification Project | Jaguar Conservation NGO in Brazil | Porto Jofre – Poconé, State of Mato Grosso, Brazil

About the Speaker

Antonio Rueda-Toicen, an AI Engineer in Berlin, has extensive experience in deploying machine learning models and has taught over 300 professionals. He is currently a Research Scientist at the Hasso Plattner Institute. Since 2019, he has organized the Berlin Computer Vision Group and taught at Berlin’s Data Science Retreat. He specializes in computer vision, cloud technologies, and machine learning. Antonio is also a certified instructor of deep learning and diffusion models in NVIDIA’s Deep Learning Institute.

May 22 - AI, ML and Computer Vision Meetup

May 22 - AI, ML and Computer Vision Meetup 2025-05-22 · 17:00

When and Where

May 22\, 2025 \| 10:00 AM Pacific
Virtual - Register for the Zoom

CountGD: Multi-Modal Open-World Counting

We propose CountGD, the first open-world counting model that can count any object specified by text only, visual examples only, or both together. CountGD extends the Grounding DINO architecture and adds components to enable specifying the object with visual examples. This new capability – being able to specify the target object by multi-modalites (text and exemplars) – lead to an improvement in counting accuracy. CountGD is powering multiple products and has been applied to problems across different domains including counting large populations of penguins to monitor the influence of climate change, counting buildings from satellite images, and counting seals for conservation.

About the Speaker

Niki Amini-Naieni is a DPhil student focusing on developing foundation model capabilities for visual understanding of the open world at the Visual Geometry Group (VGG), Oxford supervised by Andrew Zisserman. In the past, Niki has consulted with Amazon and other companies in robotics and computer vision, interned at SpaceX, and studied computer science and engineering at Cornell.

GorillaWatch: Advancing Gorilla Re-Identification and Population Monitoring with AI

Accurate monitoring of endangered gorilla populations is critical for conservation efforts in the field, where scientists currently rely on labor-intensive manual video labeling methods. The GorillaWatch project applies visual AI to provide robust re-identification of individual gorillas and generate local population estimates in wildlife encounters.

About the Speaker

Maximilian von Klinski is a Computer Science student at the Hasso-Plattner-Institut and is currently working on the GorillaWatch project alongside seven fellow students.

This Gets Under Your Skin – The Art of Skin Type Classification

Skin analysis is deceptively hard: inconsistent portrait quality, lighting variations, and the presence of sunscreen or makeup often obscure what’s truly “under the skin.” In this talk, I’ll share how we built an AI pipeline for skin type classification that tackles these real-world challenges with a combination of vision models. The architecture includes image quality control, facial segmentation, and a final classifier trained on curated dermatological features.

About the Speaker

Markus Hinsche is the co-founder and CTO of Thea Care, where he builds AI-powered skincare solutions at the intersection of health, beauty, and longevity. He holds a Master’s in Software Engineering from the Hasso Plattner Institute and brings a deep background in AI and product development.

A Spot Pattern Is like a Fingerprint: Jaguar Identification Project

The Jaguar Identification Project is a citizen science initiative actively engaging the public in conservation efforts in Porto Jofre, Brazil. This project increases awareness and provides an interesting and challenging dataset that requires the use of fine-grained visual classification algorithms. We use this rich dataset for dual purposes: teaching data-centric visual AI and directly contributing to conservation efforts for this vulnerable species.

Learn more: Jaguar Identification Project | Jaguar Conservation NGO in Brazil | Porto Jofre – Poconé, State of Mato Grosso, Brazil

About the Speaker

Antonio Rueda-Toicen, an AI Engineer in Berlin, has extensive experience in deploying machine learning models and has taught over 300 professionals. He is currently a Research Scientist at the Hasso Plattner Institute. Since 2019, he has organized the Berlin Computer Vision Group and taught at Berlin’s Data Science Retreat. He specializes in computer vision, cloud technologies, and machine learning. Antonio is also a certified instructor of deep learning and diffusion models in NVIDIA’s Deep Learning Institute.

May 22 - AI, ML and Computer Vision Meetup

May 22 - AI, ML and Computer Vision Meetup 2025-05-22 · 17:00

When and Where

May 22\, 2025 \| 10:00 AM Pacific
Virtual - Register for the Zoom

CountGD: Multi-Modal Open-World Counting

We propose CountGD, the first open-world counting model that can count any object specified by text only, visual examples only, or both together. CountGD extends the Grounding DINO architecture and adds components to enable specifying the object with visual examples. This new capability – being able to specify the target object by multi-modalites (text and exemplars) – lead to an improvement in counting accuracy. CountGD is powering multiple products and has been applied to problems across different domains including counting large populations of penguins to monitor the influence of climate change, counting buildings from satellite images, and counting seals for conservation.

About the Speaker

Niki Amini-Naieni is a DPhil student focusing on developing foundation model capabilities for visual understanding of the open world at the Visual Geometry Group (VGG), Oxford supervised by Andrew Zisserman. In the past, Niki has consulted with Amazon and other companies in robotics and computer vision, interned at SpaceX, and studied computer science and engineering at Cornell.

GorillaWatch: Advancing Gorilla Re-Identification and Population Monitoring with AI

Accurate monitoring of endangered gorilla populations is critical for conservation efforts in the field, where scientists currently rely on labor-intensive manual video labeling methods. The GorillaWatch project applies visual AI to provide robust re-identification of individual gorillas and generate local population estimates in wildlife encounters.

About the Speaker

Maximilian von Klinski is a Computer Science student at the Hasso-Plattner-Institut and is currently working on the GorillaWatch project alongside seven fellow students.

This Gets Under Your Skin – The Art of Skin Type Classification

Skin analysis is deceptively hard: inconsistent portrait quality, lighting variations, and the presence of sunscreen or makeup often obscure what’s truly “under the skin.” In this talk, I’ll share how we built an AI pipeline for skin type classification that tackles these real-world challenges with a combination of vision models. The architecture includes image quality control, facial segmentation, and a final classifier trained on curated dermatological features.

About the Speaker

Markus Hinsche is the co-founder and CTO of Thea Care, where he builds AI-powered skincare solutions at the intersection of health, beauty, and longevity. He holds a Master’s in Software Engineering from the Hasso Plattner Institute and brings a deep background in AI and product development.

A Spot Pattern Is like a Fingerprint: Jaguar Identification Project

The Jaguar Identification Project is a citizen science initiative actively engaging the public in conservation efforts in Porto Jofre, Brazil. This project increases awareness and provides an interesting and challenging dataset that requires the use of fine-grained visual classification algorithms. We use this rich dataset for dual purposes: teaching data-centric visual AI and directly contributing to conservation efforts for this vulnerable species.

Learn more: Jaguar Identification Project | Jaguar Conservation NGO in Brazil | Porto Jofre – Poconé, State of Mato Grosso, Brazil

About the Speaker

Antonio Rueda-Toicen, an AI Engineer in Berlin, has extensive experience in deploying machine learning models and has taught over 300 professionals. He is currently a Research Scientist at the Hasso Plattner Institute. Since 2019, he has organized the Berlin Computer Vision Group and taught at Berlin’s Data Science Retreat. He specializes in computer vision, cloud technologies, and machine learning. Antonio is also a certified instructor of deep learning and diffusion models in NVIDIA’s Deep Learning Institute.

May 22 - AI, ML and Computer Vision Meetup

May 22 - AI, ML and Computer Vision Meetup 2025-05-22 · 17:00

When and Where

May 22\, 2025 \| 10:00 AM Pacific
Virtual - Register for the Zoom

CountGD: Multi-Modal Open-World Counting

We propose CountGD, the first open-world counting model that can count any object specified by text only, visual examples only, or both together. CountGD extends the Grounding DINO architecture and adds components to enable specifying the object with visual examples. This new capability – being able to specify the target object by multi-modalites (text and exemplars) – lead to an improvement in counting accuracy. CountGD is powering multiple products and has been applied to problems across different domains including counting large populations of penguins to monitor the influence of climate change, counting buildings from satellite images, and counting seals for conservation.

About the Speaker

Niki Amini-Naieni is a DPhil student focusing on developing foundation model capabilities for visual understanding of the open world at the Visual Geometry Group (VGG), Oxford supervised by Andrew Zisserman. In the past, Niki has consulted with Amazon and other companies in robotics and computer vision, interned at SpaceX, and studied computer science and engineering at Cornell.

GorillaWatch: Advancing Gorilla Re-Identification and Population Monitoring with AI

Accurate monitoring of endangered gorilla populations is critical for conservation efforts in the field, where scientists currently rely on labor-intensive manual video labeling methods. The GorillaWatch project applies visual AI to provide robust re-identification of individual gorillas and generate local population estimates in wildlife encounters.

About the Speaker

Maximilian von Klinski is a Computer Science student at the Hasso-Plattner-Institut and is currently working on the GorillaWatch project alongside seven fellow students.

This Gets Under Your Skin – The Art of Skin Type Classification

Skin analysis is deceptively hard: inconsistent portrait quality, lighting variations, and the presence of sunscreen or makeup often obscure what’s truly “under the skin.” In this talk, I’ll share how we built an AI pipeline for skin type classification that tackles these real-world challenges with a combination of vision models. The architecture includes image quality control, facial segmentation, and a final classifier trained on curated dermatological features.

About the Speaker

Markus Hinsche is the co-founder and CTO of Thea Care, where he builds AI-powered skincare solutions at the intersection of health, beauty, and longevity. He holds a Master’s in Software Engineering from the Hasso Plattner Institute and brings a deep background in AI and product development.

A Spot Pattern Is like a Fingerprint: Jaguar Identification Project

The Jaguar Identification Project is a citizen science initiative actively engaging the public in conservation efforts in Porto Jofre, Brazil. This project increases awareness and provides an interesting and challenging dataset that requires the use of fine-grained visual classification algorithms. We use this rich dataset for dual purposes: teaching data-centric visual AI and directly contributing to conservation efforts for this vulnerable species.

Learn more: Jaguar Identification Project | Jaguar Conservation NGO in Brazil | Porto Jofre – Poconé, State of Mato Grosso, Brazil

About the Speaker

Antonio Rueda-Toicen, an AI Engineer in Berlin, has extensive experience in deploying machine learning models and has taught over 300 professionals. He is currently a Research Scientist at the Hasso Plattner Institute. Since 2019, he has organized the Berlin Computer Vision Group and taught at Berlin’s Data Science Retreat. He specializes in computer vision, cloud technologies, and machine learning. Antonio is also a certified instructor of deep learning and diffusion models in NVIDIA’s Deep Learning Institute.

May 22 - AI, ML and Computer Vision Meetup

Data engineering meetup | Snowflake, dbt, Dagster | May 22, Berlin 2025-05-22 · 16:30

Let’s kick things off for another meetup, this time focusing on building a scalable, decentralized data platform and business intelligence. Join us for an engaging Meetup on May 22nd in Berlin - bring all your questions!

Lessons from the Cloud: Building a Scalable, Decentralized Data Platform with Snowflake, dbt & S3

Fahad Hassan, data engineering team lead at Ratepay, will share practical patterns and lessons learned while designing a multi-warehouse data platform across analytics, risk, and finance — including decisions to simplify modeling, decentralize ownership, and optimize cost and governance in Snowflake. He’ll also highlight challenges faced with legacy systems and how Ratepay tackled them using dbt and clean lake structures.

Business Intelligence in practice: Dagster & DBT

Enabling the business to make informed decisions based on data requires a lot of developer work. From integrating external partner services to building data models, you need to make sure the data is fresh and can be relied upon to see the full scope of the problem. Emilija Dankevičiūtė, a data engineer from diconium data, will present an architecture that uses Dagster and dbt to build a central data processing platform.

What to expect:

Two expert talks
Interactive Q&A
Networking opportunities
Some snacks & drinks :)

Timetable:

18:30 - Event admission
18:50 - Welcome & Introduction
19:00 - Business Intelligence in practice: Dagster & DBT - Emilija Dankevičiūtė
19:30 - 5 minutes break
19:35 - Lessons from the Cloud: Building a Scalable, Decentralized Data Platform with Snowflake, dbt & S3 - Fahad H.
20:05 - Snacks, Drinks & Networking
21:30 - End

More on the -> applydata data engineering meetup page.

Our goal is to form a local data-loving community, so join us and let's talk data together!

--- At the event, sound, image and video recordings are created and published for documentation purposes as well as for the presentation of the event in publicly accessible media, on websites and blogs and for presentation on social media. By participating the event, the participant implicitly consents to the aforementioned photo and/or video recordings. Find more information on data protection here.

Data engineering meetup | Snowflake, dbt, Dagster | May 22, Berlin

PyData Berlin 2025 May Meetup 2025-05-21 · 17:00

Welcome to the PyData Berlin May meetup!

We would like to welcome you all starting from 18:45. There will be food and drinks. The talks begin around 19.30 and the doors will close at 19:30. Make sure to arrive on time!

Please provide your first and last name for the registration because this is required for the venue's entry policy. If you cannot attend, please cancel your spot so others are able to join as the space is limited.

Host: Ecosia is excited to welcome you to this month's version of PyData.

Entrance is in Hof 4 - there will be signs - then up to the 3rd floor of the building.

**************************************************************************

The Lineup for the evening

Talk 1: Specializing Small Language Models With Less Data Abstract: I will present a practical, end-to-end solution for training SLMs using synthetic data, covering key aspects from data curation through training to model evaluation. You will leave with concrete strategies for building efficient, domain-specific language models for production environments. Most AI teams are exploring the possibilities of LLMs rather than being focused on margins, but soon, efficiency will become important. Small, specialized language models (SLMs) offer a promising alternative, but training them requires extensive manually-labeled datasets - a significant engineering bottleneck. In this talk, I will discuss how large language models can be used to help generate and curate the data needed for SLM training. Using extractive question answering as a case study, We'll examine how this approach can dramatically reduce data collection time while maintaining model performance.

Speaker: Jacek Golebiowski Bio: Jacek is the CTO of distil labs, building specialised AI agents that can be deployed on-device/on-prem with minimal data. Before that, he was a machine learning team lead at AWS, focused on Automated ML and natural language processing. He holds a PhD in Machine Learning for Quantum Mechanics from Imperial College London.

--- Talk 2: Exploring fairlearn and practical strategies for assessing and mitigating harm in AI systems Abstract: As AI becomes a more significant part of our everyday lives, ensuring these systems are fair is more important than ever. In this session, we’ll discuss how to define fairness and the potential harms our algorithms can have on people and society. We'll introduce fairlearn, a community-driven, open-source project that offers practical tools for assessing and mitigating harm in AI systems. We’ll also explore how to discuss bias, different types of harm, the idea of group fairness and how they all relate to fairlearn's toolkit. To make it all concrete, we’ll walk through a real-world example of assessing fairness and share some hands-on strategies you can use to mitigate harm in your own ML projects.

Speaker: Tamara Atanasoska Bio: Tamara is a software engineer, OSS contributor and maintainer and NLP researcher.

--- Lightning talks There will be slots for 2-3 Lightning Talks (3-5 Minutes for each). Kindly let us know if you would like to present something at the start of the meetup :)

*** NumFOCUS Code of Conduct

THE SHORT VERSION Be kind to others. Do not insult or put down others. Behave professionally. Remember that harassment and sexist, racist, or exclusionary jokes are not appropriate for NumFOCUS. All communication should be appropriate for a professional audience including people of many different backgrounds. Sexual language and imagery are not appropriate. NumFOCUS is dedicated to providing a harassment-free community for everyone, regardless of gender, sexual orientation, gender identity, and expression, disability, physical appearance, body size, race, or religion. We do not tolerate harassment of community members in any form. Thank you for helping make this a welcoming, friendly community for all.

If you haven't yet, please read the detailed version here: https://numfocus.org/code-of-conduct ***

PyData Berlin 2025 May Meetup

Berlin Cybersecurity Social #15 2025-04-25 · 17:00

Are you a cybersecurity professional looking to connect with like-minded professionals, share experiences, and make friends? Look no further! Join us for the Berlin Cybersecurity Social for a fantastic evening of networking.

Agenda:

7:00 PM - 7:15 PM: Welcome
7:15 PM - 7:30 PM: Lightning Talk: "No one does Security Alone: Building Security Champions Programs”

Cybersecurity isn’t just about tools and tech—it’s about people. In this honest and motivational talk, Felipe shares his personal journey advocating for security and building inclusive, impactful security communities within organizations of all sizes.

Drawing from real-world experience—including implementing Security Champions programs—Felipe will dive into the challenges and opportunities that come with navigating organizational dynamics, empowering others to speak up, and addressing hidden risks through authentic human connection.

Whether you’re a seasoned security leader, just starting out in the field, a founder seeking to embed security into your culture, or someone championing security from within, this talk offers fresh perspective, practical insights, and a powerful reminder: you don’t have to do security alone.

Who Should Attend:

Cybersecurity Professionals and Team Leads
Founders wanting to create a security culture in their organisation
IT professionals with an interest in security
Cybersecurity enthusiasts *
7:30 PM - 9:45 PM: Icebreaker & Networking

Mingle with fellow professionals from the cybersecurity industry. Share insights, discuss recent developments, and exchange ideas

About the Speaker: Felipe Olivera

Felipe Olivera is a seasoned cybersecurity professional with deep roots in secure software engineering, technical leadership, and application security. Currently at sennder, he’s spearheading the company’s security culture—running a robust Security Champions Program and scaling AppSec across 20+ teams.

Previously, Felipe held key security roles at Yokoy and NextRoll, where he shaped strategic security roadmaps, led vulnerability management initiatives, and worked closely with engineering to strengthen their software development lifecycles (SDLCs). Earlier in his career, he built a strong foundation in cyber risk consulting at Deloitte and also served as an instructor for Information Security and Secure Coding at Universidad Católica del Uruguay.

Driven by a passion for both technical depth and cross-functional collaboration, Felipe thrives on translating complex risks into actionable improvements—and making security something everyone in the org can understand and champion.

NB: c-base is a cash only venue. Soft drinks and beers are available to purchase.

Why Attend?

Connect with professionals in the cybersecurity field.
Share your experiences and insights.
Expand your professional network.
Discover collaboration opportunities.
Enjoy a relaxed evening and make new friends

This meetup is open to cybersecurity professionals of all levels, from beginners to experts. Whether you're a seasoned pro or just starting your journey in the field, this event is the perfect opportunity to connect with others who share your passion for cybersecurity.

About the Location: c-base e. V. is a non-profit association located in Berlin, Germany which has about 300 members. The purpose of this association is to increase knowledge and know-how regarding computer software, hardware and data networks. The premises of the association are also used by other initiatives in and around Berlin as an event location or as function rooms, for example the wireless community network freifunk.net, the Chaos Computer Club or the Wikipedia group in Berlin.

About Berlin Cybersecurity Social: This meetup is open to cybersecurity professionals of all levels, from beginners to experts. Whether you're a seasoned pro or just starting your journey in the field, this event is the perfect opportunity to connect with others who share your passion for cybersecurity.

Berlin Cybersecurity Social #15

April 25 - Berlin AI, Machine Learning and Computer Vision Meetup 2025-04-25 · 15:30

Register to reserve your spot

Date and Time

April 25, 2025 from 5:30 PM to 8:30 PM

Location

The Meetup will take place at MotionLab.Berlin, Bouchéstraße 12/Halle 20 in Berlin

Leaving No Pixels Behind: Deep Learning for Perfect Cutouts

Removing backgrounds from images is a challenging task, even for advanced deep learning models. The human eye is highly sensitive to minor imperfections, making high-quality outcomes crucial. In this talk, Imran Kocabiyik will demonstrate how withoutbg achieves clean, natural-looking image extractions while addressing the issues of costly training data and the need to handle diverse image types. Their approach effectively balances intelligent model design and meticulous data selection, resulting in impressive performance suited for real-world applications.

About the Speaker

Imran Kocabiyik is a technologist building AI-driven tools for marketing and creative automation. He is the founder of withoutbg and a Senior Data Scientist at Klarna.

AI on the Dance Floor: Multimodal Segmentation of Choreography Videos

Ever struggled to learn a dance routine by constantly rewinding YouTube videos? In this talk, Paras presents an approach based on temporal convolutional networks and pose estimation to automatically segment choreography videos into individual moves by leveraging both audio and visual modalities.

About the Speaker

Dr. Paras Mehta is a computer scientist and co-founder of sylby, where he leads AI engineering for pronunciation training and language learning applications.

When Images Look Alike: Intro to Dataset Curation

This talk introduces dataset curation in computer vision, focusing on visually similar images. We discuss use cases in vacation rental search and art recommendations. We demonstrate how Voxel51 helps identify image similarity, improving data quality and model reliability.

About the Speaker

Antonio Rueda-Toicen, an AI Engineer in Berlin, has extensive experience in deploying machine learning models and has taught over 300 professionals. He is currently a Research Scientist at the Hasso Plattner Institute. Since 2019, he has organized the Berlin Computer Vision Group and taught at Berlin’s Data Science Retreat. He specializes in computer vision, cloud technologies, and machine learning. Antonio is also a certified instructor of deep learning and diffusion models in NVIDIA’s Deep Learning Institute.

EnvisionHGdetector: A Framework for Detecting and Analyzing Hand Gestures During Speech

We present EnvisionHGdetector, a toolkit for studying hand movements during speech. It measures hand motion, compares gestures, and labels gesture segments using Mediapipe tracking and a custom neural network. Tested on over 8,000 gestures, it achieved approximately 75% accuracy. We also discuss plans to improve accessibility for gesture researchers.

About the Speaker

Sharjeel Shaikh is currently pursuing an MSc in Data Science at the University of Potsdam. He works at HPI on Gesture Detection and Data Masking.

April 25 - Berlin AI, Machine Learning and Computer Vision Meetup

PyData Berlin 2025 March Meetup 2025-03-19 · 18:00

Welcome to the PyData Berlin March meetup!

We would like to welcome you all starting from 18:45. There will be food and drinks. The talks begin around 19.30 and the doors will close at 19:30. Make sure to arrive on time!

*** Important!! *** Please keep in mind that there is a BVG strike on this day, affecting U-Bahn, trams, and buses. S-Bahn and regional trains will work.

Please provide your first and last name for the registration because this is required for the venue's entry policy. If you cannot attend, please cancel your spot so others are able to join as the space is limited.

Host: Bonial is excited to welcome you to this month's version of PyData. ************************************************************************** The Lineup for the evening

Talk 1: Extract structured product & deal information from PDFs on scale via LLM Abstract: Bonial shows hundreds of thousands of offers from local brick-and-mortar retailers on its platform, a subset of this content is retrieved from PDF files. In this talk I’ll explain how we leverage LLM to parse unstructured PDF files to create content on our platform.

Speaker: Philipp Johannis has been part of Bonial for 12 years. He established and leads the Data Department, which consists of multiple Analytics, Engineering & Data Science teams, and is currently serving as Head of Data. He focuses on improving the data platform and enabling and supporting the development of various data driven products such as personalisation and traffic management.

Talk 2: Airweave, an Open-Source Tool To Turn Any App Into Accessible Agent Knowledge Abstract: The talk will be an introduction to Airweave, which is an open-source Python tool that helps agent developers turn app data into accessible knowledge for AI agents. It connects to any app, database, URL, or API and structures the data for retrieval. Airweave automates authentication, ingestion, enrichment, mapping, and syncing to vector stores and graph databases of choice. It has a search layer for agents out-of-the-box and allows extension of the platform with minimal code. Developers can use Airweave via our web UI, REST API, or SDKs.

Speakers: Lennert Jansen and Rauf Akdemir are the creators of Airweave AI. Lennert is an AI Engineer & Researcher with a background in Applied Statistics and Deep Learning for NLP. Before Airweave, he worked on AI & Bayesian Statistics at Amazon, IBM, and the University of Amsterdam. Rauf is a CS graduate from Technical University of Delft, with strong engineering experience in productionising ML & data infrastructure in both start-ups and enterprise.

Lightning talks There will be slots for 2-3 Lightning Talks (3-5 Minutes for each). Kindly let us know if you would like to present something at the start of the meetup :)

*** NumFOCUS Code of Conduct THE SHORT VERSION Be kind to others. Do not insult or put down others. Behave professionally. Remember that harassment and sexist, racist, or exclusionary jokes are not appropriate for NumFOCUS. All communication should be appropriate for a professional audience including people of many different backgrounds. Sexual language and imagery are not appropriate. NumFOCUS is dedicated to providing a harassment-free community for everyone, regardless of gender, sexual orientation, gender identity, and expression, disability, physical appearance, body size, race, or religion. We do not tolerate harassment of community members in any form. Thank you for helping make this a welcoming, friendly community for all. If you haven't yet, please read the detailed version here: https://numfocus.org/code-of-conduct ***

PyData Berlin 2025 March Meetup

talk-data.com

People (556 results)

Companies (1 result)

Activities & events

Kaan Ara: "*Databricks Cost Optimization: A Multi-Layered Strategy for Performance and Efficiency"***

Who's the data expert in the room? Interactive data pub quiz

Databricks Notebooks in Production: Best Practices

Interactive data engineering discussion

People (556 results)

Companies (1 result)

Activities & events

Kaan Ara: "Databricks Cost Optimization: A Multi-Layered Strategy for Performance and Efficiency"**

Who's the data expert in the room? Interactive data pub quiz

Databricks Notebooks in Production: Best Practices

Interactive data engineering discussion

Kaan Ara: "*Databricks Cost Optimization: A Multi-Layered Strategy for Performance and Efficiency"***