talk-data.com
People (387 results)
See all 387 →Companies (2 results)
Activities & events
| Title & Speakers | Event |
|---|---|
|
(IN-PERSON) Fabric CI/CD & Intro to Fabric Data Pipelines
2025-12-18 · 17:00
This is an In-Person Evening Session Detailed information and registration can be found here : https://dataminds.be/fabric-cicd-intro-to-fabric-data-pipelines-in-person-ae/ Fabric CI/CDSpeaker: Oliver De Vijt In Microsoft Fabric, deploying all items efficiently across environments requires a robust CI/CD strategy. While Fabric’s built-in deployment pipelines offer a no-code approach for basic promotion between workspaces, they lack flexibility, automation capabilities, and testing. In this session, we’ll dive deep into Fabric CI/CD practices, leveraging Git integration, the Fabric REST APIs, and the fabric-cicd Python library to automate the deployment of Fabric items such as semantic models, notebooks, dataflows, and reports. You’ll see how to design YAML-based Azure DevOps pipelines that deploy from source control into your Fabric development, test, and production workspaces, and even include steps to run and validate your models using Tabular Editor. By the end of this session, you’ll understand how to implement your own end-to-end CI/CD framework for Microsoft Fabric, even in Power BI Pro environments, using entirely free and open-source tools. Intro to Fabric (Data) PipelinesSpeakers: Koen Verbeeck Fabric Pipelines, formerly known as Fabric Data Pipelines, is the low-code ELT (extract-load-transform) solution in Microsoft Fabric. With this tool, you can orchestrate various activities, such as reading data from various data sources, executing Spark notebooks, running SQL statements on your warehouse and so on. In the wider Azure Data Platform, Fabric Pipelines are a close sibling of Azure Data Factory and Synapse Pipelines. In this session, we’ll cover the following topics:
After the session, you’ll be able to start your own data engineering journey with Fabric Pipelines in Microsoft Fabric. Knowledge of general ETL concepts are a plus when attending this presentation. |
(IN-PERSON) Fabric CI/CD & Intro to Fabric Data Pipelines
|
|
Episode 261: 🇳🇱 C++ Under the Sea 🇳🇱 Bernhard, Koen & C++26 Reflection!
2025-11-21 · 13:00
Conor Hoekstra
– host
,
Bryce Adelstein Lelbach
– host
,
Koen
– Team Lead for HMI Software
@ NV Michel Van de Wiele
,
Bernhard
– Senior system software engineer
@ NVIDIA
,
Ben Deane
– guest
In this episode, Conor and Bryce record live from C++ Under the Sea! We interview Bernhard, Koen, talk about C++26 Reflection and more! Link to Episode 261 on WebsiteDiscuss this episode, leave a comment, or ask a question (on GitHub)Socials ADSP: The Podcast: TwitterConor Hoekstra: Twitter | BlueSky | MastodonBryce Adelstein Lelbach: TwitterAbout the Guests: Bernhard is a senior system software engineer at NVIDIA, where he extends, optimizes and maintains the CUDA Core Compute Libraries (CCCL). Previously, he worked as software engineer among physicists at CERN on real-time and embedded software for the Large Hadron Collider, as well as data layout abstractions for heterogeneous architectures, for which he received a PhD in High Performance Computing from the University of Dresden, Germany. Before, he implemented GPU accelerated simulations and 3D visualizations of industrial machining processes. Since 2022, Bernhard is a voting member of WG21 and his interests span geometry, 3D visualizations, optimization, SIMD, GPU computing, refactoring and teaching C++. Koen is an engineer specializing in high-quality software with a strong mathematical foundation. With a PhD in Computer Science from KU Leuven, his work bridges applied mathematics and performance-critical software engineering. As Team Lead for HMI Software at NV Michel Van de Wiele, he focuses on developing C++/Qt applications for textile production systems, optimizing performance, usability, and cloud integration. Passionate about elegant, efficient solutions, Koen brings deep expertise in numerical methods, system optimization, and software architecture. Show Notes Date Recorded: 2025-10-10 Date Released: 2025-11-21 Thrust DocsCUB LibraryC++26 Reflection ProposalADSP Episode 39: How Steve Jobs Saved Sean ParentParrotParrot on GitHubSean's C++ Under the Sea KeynoteParrot sumIntro Song Info Miss You by Sarah Jansen https://soundcloud.com/sarahjansenmusic Creative Commons — Attribution 3.0 Unported — CC BY 3.0 Free Download / Stream: http://bit.ly/l-miss-you Music promoted by Audio Library https://youtu.be/iYYxnasvfx8 |
ADSP: Algorithms + Data Structures = Programs |
|
La nueva era de los cálculos temporales: Time Intelligence basada en calendario + UDFs en acción
2025-11-20 · 19:10
Houssein Laftouh Zaidi
– Speaker
Explora paso a paso cómo funciona la Time Intelligence basada en calendario en Power BI y cómo combinarla con User Defined Functions para mejorar la eficiencia y la mantenibilidad de los modelos. Ideal para quienes quieren entender a fondo esta capa de inteligencia temporal y aplicarla a escenarios reales. |
|
|
La Ingeniería detrás del Modelo Semántico
2025-11-20 · 18:20
Daniel Pérez König
– Consultor de datos
@ isolutions
Esta sesión presenta la ingeniería detrás de un modelo semántico en Power BI y Microsoft Fabric. Incluye cómo estructurar un modelo estrella con más de un millón de filas, comparar conectores Import, DirectQuery y Composite con Performance Analyzer, y analizar el impacto en la latencia. También comparte estrategias prácticas para reducir la cardinalidad, usar agregaciones, materializar tablas en Lakehouse y diseñar perspectivas para facilitar la vida del usuario final. |
|
|
Indexing for Dummies - Koen Verbeeck
2025-09-25 · 16:00
🔹 Is your database a bit slow? Do you regularly go for a cup of coffee because that query in SSMS is taking too long? This means you might need some Indexing Magic™ in your life! With Indexing Magic®, you too can speed up queries like you've never seen before! In this session, we'll guide you through the wonderful world of clustered indexes, index seeks, columnstore indexes and so much more. 🔹 We'll explain why there important and what their impact can be on your database. We'll also explore how they are implemented in the various Microsoft SQL products, such as SQL Server, Azure SQL DB, Dedicated SQL Pool and Fabric. 🔹 At the end of the session, you'll have a solid understanding of indexes and how you can use them to improve query performance. Basic SQL knowledge is assumed. |
Indexing for Dummies - Koen Verbeeck
|
|
How to use CI/CD pipelines for dbt and deploy dbt Cloud using Infrastructure as Code + sneak peak into dbt Fusion
2025-07-01 · 19:15
Koen Graat
– Principal Data Engineer
@ Xebia Data
Are you tired of manual and failing dbt deployments? This talk explores how CI/CD and IaC can revolutionize your data transformation workflows, enhancing collaboration and data quality within your dbt projects. Learn the core concepts of CI/CD, including automated testing and deployment pipelines, I will guide you through building a CI/CD pipeline for dbt, triggering it with code changes and running comprehensive tests. Next to that we will dive into Infrastructure as Code (IaC) and how it automates dbt Cloud deployments using tools like Terraform. You will gain practical knowledge for automating dbt Cloud resources, projects, and environments. As a bonus we will do a sneak peek into the recently announced dbt Fusion engine. |
|
|
Armand Duijen
– Data Engineer
@ Studyportals
,
Katie Scheitzer
– Senior Analytics Engineer
@ Studyportals
We strive for our dbt project to be ready by 9am for our stakeholders. Should be easy, right? Except that our dbt project consists of around 450 dbt models and over 30 sources. Some of those sources are ready as early as midnight but some as late as 4am, and in total our project takes around 4 hours to run. Join as us we walk through the evolution of our dbt run setup, from one selector, to a set of parallel commands, to today's setup -- a dynamic lineage in Airflow which runs models when and only when the upstream source is ready. It's finished when the Tableau datasource is refreshed and our stakeholders can start their day with the latest data. |
|
|
Un agent IA, c’est quoi exactement ?
2025-05-27 · 17:00
AI/ML
|
|
|
Quand la R&D rencontre l’IA publique
2025-05-27 · 17:00
AI/ML
|
|
|
Les dernières news sur les IA génératives
2025-05-27 · 17:00
AI/ML
|
|
|
Episode 206: 🇳🇱 C++ Under the Sea Live 🇳🇱 Jason Turner, Inbal Levi & More!
2024-11-01 · 11:00
Conor Hoekstra
– host
,
Jason Turner
– Host of the YouTube channel C++Weekly; co-host emeritus of the podcast CppCast; author of C++ Best Practices
,
Bryce Adelstein Lelbach
– host
,
Koen Poppe
– guest
,
Inbal Levi
– Keynote speaker
,
Jonathan Müller
– guest
,
Jan Williams
– guest
In this episode, Conor and Bryce record live from C++ Under the Sea and interview both keynote speakers, Jason Turner and Inbal Levi as well as speak to Jan Williams, Koen Poppe and Jonathan Müller briefly! Link to Episode 206 on WebsiteDiscuss this episode, leave a comment, or ask a question (on GitHub)Twitter ADSP: The PodcastConor HoekstraBryce Adelstein LelbachGuests Interviewed Jan WilmansJason TurnerJonathan MüllerInbal LeviKoen PoppeShow Notes Date Recorded: 2024-10-11 Date Released: 2024-11-01 C++ Under the SeaVIMEChttps://nullptr.nl/C++ Under the Sea - Bryce Lelbach C++ Execution Model TalkC++ Under the Sea - Jason Turner KeynoteCppCastC++ Weekly YouTube Channelcode_report YouTube ChannelADSP Episode 103: Jason Turner from CppCast!C++ Under the Sea - Inbal Levi Closing KeynoteC++26 Reflection ProposalTheWholeDaisy TwitchP3045 Quantities and units libraryADSP Episode 195: 🇨🇦 CppNorth Live 🇨🇦 David Olsen & Pure Chaos!Vandewiele GroupC++ Under the Sea - Logging TalkMinimal Logging Framework in C++20 - Koen Poppe - Meeting C++ 2023C++ Flux Libraryflux::adjacent_mapIntro Song Info Miss You by Sarah Jansen https://soundcloud.com/sarahjansenmusic Creative Commons — Attribution 3.0 Unported — CC BY 3.0 Free Download / Stream: http://bit.ly/l-miss-you Music promoted by Audio Library https://youtu.be/iYYxnasvfx8 |
ADSP: Algorithms + Data Structures = Programs |
|
🛡️🤖 AI Assistants & ⚙️ Data Ops: 📈🐛➡🦋🚀 PyData Heilbronn #1 @ IPAI 🚀
2024-02-06 · 17:00
🚨 👉 NEW DATE 6 FEB 👈🚨 🎉 We are thrilled to announce that PyData Südwest is hosting its first-ever meetup at Innovation Park Artificial Intelligence (IPAI) in Heilbronn! 🎉 Agenda 18:00 Doors open 18:30 Welcome - Alexander CS Hendorf 18:45 Transform Your Business: A Roadmap to Being Data-Driven with Data Ops - Simon Pressler 19:15 Data Versioning with LakeFS and LakeFS-Spec - Max Mynter & Jan Willem Kleinrouweler 19:45 🍻 🍕 🕸️ Break: Networking with snacks and beverages 20:30 My Data Stays Here! Implementing AI Assistants in a Data-Saving Way - Hans-Peter Zorn 21:00 Networking with snacks and beverages 21:30 End Lightning Talks ⚡️😬 No LTs this time! How to sign up for on site It's important for us to make this meet up happen in a responsible way. We have limited seats available only. No limits to sign up remotely! How to join remotely Join the live stream on YouTube. The event is in English. This event was moved from 24 Jan due to the train strike. ---- Talk #1 📈🐛➡🦋🚀 Transform Your Business: A Roadmap to Being Data-Driven with Data Ops" Simon Pressler (Königsweg AI) Many organizations ask themselves: do we have a data treasure? - how can we find out and how do we start? In the current data-centric business environment, this question becomes increasingly relevant. Drawing from extensive experience in the field, this talk addresses those who are enthusiastic about data yet may find themselves at the outset of their journey, or uncertain about navigating their path to success. DataOps, an interdisciplinary field seamlessly integrates data analysis, software development, and system operations. This session aims to provide clarity and direction for unlocking the potential of your data assets, marking the beginning of a transformative expedition into effective data utilization. Simon is a Data Scientist holds a Master's Degree in Comparative and International Studies from ETH-Zürich and a Master's in Data Science from the University of Mannheim. Outside of his professional life, Simon is passionate about long-distance hiking, a pursuit that showcases his dedication and resilience Talk #2 ⚙️📈 Data Versioning with LakeFS and LakeFS-Spec Max Mynter & Jan Willem Kleinrouweler (appliedAI Institute for Europe) Data versioning, essential for reproducible Machine Learning, involves maintaining and accessing various versions of data sets over time. This talk highlights LakeFS, a tool enabling Git-like versioning in data lakes, and LakeFS-Spec, a Python library developed by appliedAI Institute for Europe. LakeFS-Spec, compatible with FSSpec-supported packages like Pandas, allows intuitive LakeFS interactions and offers advanced features like client-side caching and direct file system operations. Max is an MLOps engineer at appliedAI Institute for Europe. He has a background in Physics and Social Sciences, spent some time as a visiting scholar at UC Berkeley’s School of Information, and had previous stints as a Data Scientist at an energy-tech start-up and at Allianz Global Investors as a Quantitative Risk Analyst. Jan Willem is the Head of ML Engineering at appliedAI Institute for Europe. He received his PhD in Computer Science from the VU University Amsterdam. Before joining appliedAI, he worked as applied researcher and portfolio manager in the fields of media delivery, mobile networks, edge computing, and IoT. Talk#3 🛡️🤖 My Data Stays Here! Implementing AI Assistants in a Data-Saving way Hans-Peter Zorn (inovex) ChatGPT has become an integral part of many people's professional lives. Nevertheless, many companies are reluctant to provide their employees with enterprise chat solutions. In this talk, I will demonstrate various ways in which such a service can be implemented in a data-saving and data protection-compliant manner. Be it through a private endpoint in the Azure Cloud or open source models under your own control. Hans-Peter works as Head of AI and CTO at inovex to help customers overcome their challenges - sometimes with the help of AI. He studied computational linguistics in Heidelberg and computer science at KIT. He then spent a long time working on speech dialog systems, natural language processing and big data architectures. A big thank you to our sponsors:
Contact
|
🛡️🤖 AI Assistants & ⚙️ Data Ops: 📈🐛➡🦋🚀 PyData Heilbronn #1 @ IPAI 🚀
|
|
December lightning talks!
2023-12-12 · 16:00
The final meetup of 2023 will be a set of lightning talks from speakers all over the world. You can visit the schedule here: Be aware, the times are in UTC! Timezones are very hard, ask me how I keep learning that... thankful-field-057fcc110.4.azurestaticapps.net/Index.html The event kicks off at 16:00 UTC, 17:00 GMT+1, order of speakers: Deepthi Goguri Sarath Sasidharan Koen Verbeeck Roy Menger Javier Villegas Jon Vöge Hugo Kornelis John Miner |
December lightning talks!
|
|
Elasticsearch in Action, Second Edition
2023-12-10
Madhusudhan Konda
– author
Build powerful, production-ready search applications using the incredible features of Elasticsearch. In Elasticsearch in Action, Second Edition you will discover: Architecture, concepts, and fundamentals of Elasticsearch Installing, configuring, and running Elasticsearch and Kibana Creating an index with custom settings Data types, mapping fundamentals, and templates Fundamentals of text analysis and working with text analyzers Indexing, deleting, and updating documents Indexing data in bulk, and reindexing and aliasing operations Learning search concepts, relevancy scores, and similarity algorithms Elasticsearch in Action, Second Edition teaches you to build scalable search applications using Elasticsearch. This completely new edition explores Elasticsearch fundamentals from the ground up. You’ll deep dive into design principles, search architectures, and Elasticsearch’s essential APIs. Every chapter is clearly illustrated with diagrams and hands-on examples. You’ll even explore real-world use cases for full text search, data visualizations, and machine learning. Plus, its comprehensive nature means you’ll keep coming back to the book as a handy reference! About the Technology Create fully professional-grade search engines with Elasticsearch and Kibana! Rewritten for the latest version of Elasticsearch, this practical book explores Elasticsearch’s high-level architecture, reveals infrastructure patterns, and walks through the search and analytics capabilities of numerous Elasticsearch APIs. About the Book Elasticsearch in Action, Second Edition teaches you how to add modern search features to websites and applications using Elasticsearch 8. In it, you’ll quickly progress from the basics of installation and configuring clusters, to indexing documents, advanced aggregations, and putting your servers into production. You’ll especially appreciate the mix of technical detail with techniques for designing great search experiences. What's Inside Understanding search architecture Full text and term-level search queries Analytics and aggregations High-level visualizations in Kibana Configure, scale, and tune clusters About the Reader For application developers comfortable with scripting and command-line applications. About the Author Madhusudhan Konda is a full-stack lead engineer, architect, mentor, and conference speaker. He delivers live online training on Elasticsearch and the Elastic Stack. Quotes Madhu’s passion comes across in the depth and breadth of this book, the enthusiastic tone, and the hands-on examples. I hope you will take what you have read and put it ‘in action’. - From the Foreword by Shay Banon, Founder of Elasticsearch Practical and well-written. A great starting point for beginners and a comprehensive guide for more experienced professionals. - Simona Russo, Serendipity The author’s excitement is evident from the first few paragraphs. Couple that with extensive experience and technical prowess, and you have an instant classic. - Herodotos Koukkides and Semi Koen, Global Japanese Financial Institution |
O'Reilly Data Engineering Books
|
|
PyData Heidelberg #12: Convex Optimization with DSP & xAI with SHAP + Lime
2023-12-05 · 17:00
DataScience and AI: in person in Heidelberg and live on PyData.TV on YouTube Agenda 18:00 Doors open 18:30 Welcome 18:45 Philipp Schiele - Introducing Disciplined Saddle Programming (DSP): A New Paradigm in Convex Optimization 19:15 Break: Networking with snacks and beverages 20:00 Christophe Krech - Unveiling the Black Box: Exploring Explainable AI with SHAP and Lime for Tabular Data in Python 20:30 Lightning Talks 20:45 Networking with snacks and beverages 21:30 End Lightning Talks Join us by contributing a five-minute lightning talk! Fill out this form. How to sign up for on site It's important for us to make this meet up happen in a responsible way. We have limited seats available only. No limits to sign up remotely! How to join remotely Join the live stream on YouTube. Q&A Ask via Slido This event will be in English. ---- Talk #1 Philipp Schiele (Ludwig Maximilian University of Munich) Introducing Disciplined Saddle Programming (DSP): A New Paradigm in Convex Optimization Disciplined Saddle Programming (DSP), a new Python-based domain-specific language, significantly enhances the approach to convex-concave saddle problems, crucial in fields like game theory, machine learning, and finance. Hosted on GitHub, DSP extends the CVXPY framework, streamlining the dualization process in optimization. DSP focuses on robust optimization problems, providing an intuitive interface for problem specification and resolution. It builds upon the conic-representable saddle programs by Juditsky and Nemirovski, applying disciplined convex programming to saddle problems. DSP's introduction is a call to the broader scientific and engineering communities to explore its diverse applications. It simplifies complex optimization tasks, making them more accessible and manageable, and holds the potential to significantly impact various optimization-reliant fields. Philipp Schiele's educational background is in finance and economics and he is currently pursuing a PhD in financial econometrics at the Ludwig Maximilian University of Munich, where he taught various courses in statistics. He is a CVXPY maintainer and has presented a tutorial at SciPy 2022. Generally, he is enthusiastic about finance, optimization, and technology, especially open-source projects. Apart from that, he also conducts workshops at SciPy US on "Controlling Self-Landing Rockets Using CVXPY" 🚀 Talk #2 Christophe Krech - Unveiling the Black Box: Exploring Explainable AI with SHAP and Lime for Tabular Data in Python In the era of complex machine learning models, understanding and interpreting their decisions is crucial for fostering trust and transparency; and will become a regulatory requirement for many applications with the implementation of the EU AI act. Explainable AI (XAI) specifically tailored for tabular data can demystify the black box of machine learning. SHAP (SHapley Additive exPlanations) and Lime (Local Interpretable Model-agnostic Explanations) are to very powerful model-agnostic tools to enhance the understanding of model predictions, troubleshoot biases, and communicate machine learning insights effectively. Thanks to great open-source implementations, they can also be seamlessly integrated into existing Python workflows in many real-world applications. Christophe Krech is a senior data scientist at Experian. During his studies in Mannheim and Darmstadt, he already focused on the explainability of machine learning methods and the associated regulatory challenges. After completing his master’s degree in data science, he joined the global information service provider Experian in 2019. Since then, he has been supporting FinTechs, e-commerce retailers and banks in the successful use of machine learning for risk management. Explainability of machine learning models and their implementation in Python are an integral part of his work there. ---- Lightning Talks: 1. Your spot! Submit a talk here 2. Your spot! Submit a talk here 3. Your spot! Submit a talk here Acknowledgements Also a big thank you to our sponsors:
Contact If you have any questions or suggestions, please feel free to contact us via:
|
PyData Heidelberg #12: Convex Optimization with DSP & xAI with SHAP + Lime
|
|
DataScience and AI: in person in Karlsruhe and live on PyData.TV on YouTube Talks: 1. Johannes Bechberger - DIY Python Debugger 2. Jakob Ernst - Operating an AI/ML-Driven Product at Scale for 10+ Years: Successes\, Failures\, and Learnings Agenda 18:00 Doors open 18:30 Welcome 18:45 Johannes Bechberger - DIY Python Debugger 19:15 Break: Networking with snacks and beverages 20:00 Jakob Ernst - Operating an AI/ML-Driven Product at Scale for 10+ Years: Successes, Failures, and Learnings 20:30 Lightning Talks 20:45 Networking with snacks and beverages 21:30 End Lightning Talks Join us by contributing a five-minute lightning talk! Fill out this form. How to sign up It's important for us to make this meet up happen in a responsible way. We have limited seats available only. How to join remotely Join the live stream on YouTube. Q&A Ask via Slido This event will be in English. ---- Talk #1 Johannes Bechberger (SAP) DIY Python Debugger Debuggers are indispensable tools for all Python developers, empowering them to conquer bugs and unravel complex systems. But have you ever wondered how they work? Curious about the implementation of features like conditional breakpoints and single stepping? Join me for a talk in which we create our own debugger with conditional breakpoints, single stepping and a Python based debugging shell and learn a lot on debuggers along the way. Johannes Bechberger is a JVM developer working on profilers and their underlying technology in the SapMachine team at SAP. He started at SAP in 2022 after two years of research studies at the KIT in Java security analyses. His work today comprises many open-source contributions and his blog, where he writes regularly on in-depth profiling and debugging topics and works on his JEP Candidate 435 to add a new profiling API to the OpenJDK. He is an avid Python user for almost 10 years, with a special interest in type systems and debuggers. Since 2023 he's touring through the meet-ups and conferences of Europe, like JavaZone and Devoxx Belgium to speak on various topics. Talk #2 Jakob Ernst (Blue Yonder) Operating an AI/ML-Driven Product at Scale for 10+ Years: Successes, Failures, and Learnings In a rapidly evolving technology landscape, operating an AI/ML-driven product at scale presents unique challenges and opportunities. Jakob Ernst from Blue Yonder will provide an insightful dive into the journey of scaling, maintaining, and optimizing an AI product over a span of 10+ years. Jakob Ernst is a Senior Data Scientist at Blue Yonder. ---- Lightning Talks: 1. A Glimpse into Causal Reasoning in LLMs - Rustam Bekmamedov (Black Forest AI) 2. Python in Excel? Hell YES or Hell NO? - Alexander CS Hendorf 3. Your spot! Submit a talk here Acknowledgements Also a big thank you to our sponsors:
Contact If you have any questions or suggestions, please feel free to contact us via:
|
PyData Karlsruhe #8: 10+ years of AI/ML-Driven Products & DIY Python Debugger
|
|
Big PyData BBQ #5: Large Language Models
2023-09-21 · 16:00
If you like cool talks about 🧑🔬 Data Science, 🤖Artificial Intelligence, 🐍 coding or 🤗 community, the Big PyData BBQ is the place to be! 🔥 KÖNIGSWEG and hei_INNOVATION invite you to join the 5️⃣th edition of the annual big gathering of the PyData Südwest community. Besides talks there will be a lot of time for networking over a delicious 🥦+🍖 BBQ . This year's topic: Large Language Models 🔥🤩 Confirmed Speakers
The event will be live streamed and published on PyDataTV. 18:00 Welcome 👋 📺 18:20 Talk Ines Montani 📺 19:00 BBQ 🍖🥦 UPDATE: 20:30 🛋️ Panel: Alejandro, Alexander, Alina, Ines, Michael 📺 UPDATE: 21:30 ⚡ Lightning Talks. 📺 UPDATE: 21:30 Networking. 🍻 UPDATE: 22:00 End. 📺 = live stream 🍖🥦, 🍻 = locally, only About our speakers: Ines Montani, a renowned software developer, is a co-founder of Explosion AI, a digital laboratory specializing in artificial intelligence and machine learning. She is a lead developer of spaCy, a widely used open-source library for advanced natural language processing (NLP) in Python. Together with Matthew Honnibal, she also developed Prodigy, a machine learning annotation tool that aids in the efficient creation of training data. Montani is an advocate for OSS, working tirelessly to make the fields of AI and ML more accessible. Alejandro Saucedo is a technology entrepreneur and software engineer known for his work in machine learning and artificial intelligence. He is Director of Eng, Science, Product & Analytics at Zalando and the Chief Scientist at The Institute for Ethical AI & Machine Learning, a London-based research organization focused on developing best practices for machine learning and artificial intelligence, among others. Saucedo has a strong technical background and has worked in software development, ML and data science. He has spoken at numerous events and is promoting ethical practices in AI development Michael Gertz is a full professor at Heidelberg University where he heads the Database Systems Research Group at the faculty of Mathematics and Computer Science. He received his diploma in Computer Science from the TU Dortmund University, and his Dr. rer. nat. from the Leibniz University of Hannover in 1996. From 1997 until 2008 he was a faculty at Department of Computer Science at the University of California at Davis. His interdisciplinary research interests include text analytics, data mining, complex networks, and scientific data management, with applications in the medical sciences, law, physics, political sciences, and economics. Alina Lenhardt is a computational linguistat at Cerence and Program Committee Chair at PyConDE & PyData Berlin 2023. Alexander Hendorf is one of the organizers of PyData Südwest and is heavily involved in the Python & PyData community. For him, contributing to open source and the community means giving something back, as his company Königsweg uses open source to implement Data Science & AI for its customers. ⚡️ Lightning Talks (5 min. each) 1. Alessandro Angioi - Supercharge your language learning journey with Python 2. Bela Stoyan - Automatically transform complex python methods to polars expressions 3. Irina Smirnova-Pinchukova - Croshapes - using graph to design a toy 😍 A big thank you to our sponsors:
Contact If you have any questions or suggestions, please feel free to contact us via:
|
Big PyData BBQ #5: Large Language Models
|
|
PyData Heidelberg #11 - TimeSeries Forecasting & LLM Langchain
2023-08-22 · 16:00
TIMESERIES #FORECASTING #LLMS #LANGCHAINIn person in Heidelberg and live on PyData.TV on YouTube Agenda 18:00 Doors open 18:30 Welcome 18:45 Beyond Toy Datasets: Timeseries Forecasting for Real Business Problems - Robert Haase (AI Scientist @ paretos) 19:15 Break: Networking with snacks and beverages 20:00 How to Leverage the Full Potential of LLMs for Your Business with Langchain - Leon Ruddat (AI Research Specialist @ SNOCKS) 20:30 Lightning Talks 20:45 Networking with snacks and beverages 21:30 End Lightning Talks Join us by contributing a five-minute lightning talk! Fill out this form. How to sign up It's important for us to make this meet up happen in a responsible way. We have limited seats available only. How to join remotely Join the live stream on YouTube. Q&A This event will be in English. ---- About this meetup: Talk #1 Robert Haase (AI Scientist @ paretos) Beyond Toy Datasets: Timeseries Forecasting for Real Business Problems - Common Pitfalls and (Some) Solutions Real-life timeseries datasets e.g. from retailer or logistic companies often come with high complexity such as a high level of sparsity and heterogenity as well as most of the times it is highly imbalanced and driven by external factors. In this talk Robert will give an overview of what are the common pitfalls when working with real-life data in a timeseries context and provide a set of possible solutions to turn these projects into a success-story. Robert earned both his Bachelor's and Master's degrees in Physics from the University of Heidelberg, specializing in Condensed Matter Physics and Computational Physics. During his Master's thesis in 2020, he advanced existing NLP Transformer architectures for timeseries applications. This involved Robert working extensively with uncertainty quantifications and normalizing flows. Since the beginning of 2021, he has been employed at Paretos, where the primary focus of his work lies in Timeseries Forecasting, specifically demand forecasting. Robert has a keen interest in combining traditional statistical methods with deep learning techniques. Talk #2 Leon Ruddat (AI Research Specialist @ SNOCKS) How to Leverage the Full Potential of LLMs for Your Business with Langchain In this talk, Leon will introduce Langchain, a framework for developing applications powered by language models. He'll explore how to work with it, which use cases are most suited for this framework, and most importantly, which features of Langchain can deliver the greatest value from his perspective. Leon lives in Heidelberg. He holds a Master's degree in Mechatronics with a focus on Artificial Intelligence from Hochschule Mannheim. During his studies there, he was employed as an AI Scientist for almost three years, with his research mainly focusing on autonomous driving. Since April, he has been working at Snocks as an AI Scientist. There, he delves into the exciting world of Large Language Models (LLMs) and explores their applications in e-commerce. He currently leads a team of three developers with whom he collaborates on innovative AI solutions. His expertise lies particularly in working with PyTorch, CV-based AI frameworks, Langchain, and well-known LLMs such as GPT. What drives him? The "Why-Not-Spirit". His motto: "Just do it and build awesome AI tools." ---- Lightning Talks: 1. Locating the spiral arms of the Milky Way using t-sne - Dr. Bertrand Lemasle 2. Developing an AI-based Business Data Analyst using OpenAI Function Calling - Bernhard Schäfer 3. Open-Source Science (OSSci) - Tim Bonnemann Acknowledgements Also a big thank you to our sponsors:
Contact If you have any questions or suggestions, please feel free to contact us via:
|
PyData Heidelberg #11 - TimeSeries Forecasting & LLM Langchain
|
|
PyData Heidelberg #10 - Gaël Varoquaux & Daniel Stemmer @ Mathematikon
2023-06-27 · 16:00
MACHINELEARNING #DATA #DATACLEANSING #BESTPRACTICE #AIThis event will be in English. In person event. Live stream: https://youtube.com/live/I4Sksd93bZ4 Agenda 18:00 Doors open 18:30 Welcome 18:45 Skrub: Prepping Tables for Machine Learning Gets Easier - Gaël Varoquaux Research Director, Inria, France 19:30 Networking with snacks and beverages 20:15 Using Embeddings and Deep Neural Networks as a technique for AutoML Demand Forecasting - Daniel Stemmer 20:45 Lightning Talks 21:00 Networking with snacks and beverages 21:30 End Lightning Talks Join us by contributing a five-minute lightning talk! Fill out this form. How to sign up It's important for us to make this meet up happen in a responsible way. We have limited seats available only. ---- About this meetup: Talk #1 Skrub: Prepping Tables for Machine Learning Gets Easier Gaël Varoquaux, Research Director, Inria, France In standard data-science practice, a significant effort is spent on preparing the data before statistical learning. One reason is that the data come from various tables, each with its own subject matter, its specificities. These must be transformed to a format that can be injested by machine-learning modeled: assembled, aggregated, encoded. I will present some results from our research in developing machine-learning models that can more easily injest raw, messy data. I will also discuss how we are using this understanding to make a new software package that facilitate preparing tables for machine learning. It's called skrub, it's in progress, not released, but I'm excited! Gaël Varoquaux is a research director working on data science at Inria (French Computer Science National research) where he leads the Soda teamon computational and statistical methods to understand health and society with data. Varoquaux is an expert in machine learning, with an eye on applications in health and social science. He develops tools to make machine learning easier, suited for real-life, messy data. He co-funded scikit-learn, one of the reference machine-learning toolboxes, and helped build various central tools for data analysis in Python. He currently develops data-intensive approaches for epidemiology and public health, and worked for 10 years on machine learning for brain function and mental health. Varoquaux has a PhD in quantum physics supervised by Alain Aspect and is a graduate from Ecole Normale Superieure, Paris. https://gael-varoquaux.info/about.html Talk #2 Using Embeddings and Deep Neural Networks as a technique for AutoML Demand Forecasting Daniel Stemmer Daniel Stemmer studied physics at the KIT Karlsruhe, Diploma 2011, since 2011 data scientist @ BlueYonder, Currently Product Owner of LDE - the Demand Forecasting engine @ BlueYonder ---- Lightning Talks: 1. EU AI ACT\, XAI\, fair ML and banks - Dr. Christoph Anders 2. This is your slot! 3. TBA Acknowledgements Also a big thank you to our sponsors:
## Contact If you have any questions or suggestions, please feel free to contact us via:
|
PyData Heidelberg #10 - Gaël Varoquaux & Daniel Stemmer @ Mathematikon
|