talk-data.com
People (96 results)
See all 96 →Companies (1 result)
Activities & events
| Title & Speakers | Event |
|---|---|
|
Optimize first, parallelize second: a better path to faster data processing
2025-11-20 · 08:30
You’re processing a large amount of data with Python, and your code is too slow. One obvious way to getting faster results is adding multithreading or multiprocessing, so you can use multiple CPU cores. Unfortunately, switching straight to parallelism is almost always premature, often unnecessary, and sometimes impossible. |
|
|
Product Development: When and How to Build an AI Agent
2025-11-20 · 07:15
AI technologies are disrupting how we solve problems. For many companies including ours, we need to learn how to weave AI solutions into our problem space in a seamless way. This is the classic product development principle: first identifying a problem statement then using appropriate technologies to address the challenges. Without discipline, the AI hype often pushes us to do the opposite, forcefully embedding AI into a problem. |
|
|
Copenhagen dbt Meetup vol. 6
2025-03-26 · 16:30
Sixth dbt Meetup in Copenhagen Save the date: Wednesday, 26th of March 2025 🧡. We are excited to welcome three speakers who will share their knowledge and experiences with dbt. Event Details
Agenda & Speakers 17:30 - Kick off the evening by networking over drinks and snacks 🥤 17:45 - Event introduction and welcome
18:00-20:00 - Speaker Sessions 💬
20:00 - 🍕 Food & Socializing - Continue conversations over food and drinks. 20:45 - Closing remarks and wrap-up 🙌🏼 To attend, please read the Health and Safety Policy and Terms of Participation: https://www.getdbt.com/legal/health-and-safety-policy ➡️ Join the dbt Slack community: https://www.getdbt.com/community/ For the best Meetup experience, join the #local-denmark channel in dbt Slack: https://slack.getdbt.com/ |
Copenhagen dbt Meetup vol. 6
|
|
Revolutionizing Cancer Treatment
2024-10-17 · 15:00
📣 PyData Yerevan announces the October meetup of its new series! Aleksandr Sarachakov, Biomedical Imaging Team Lead at BostonGene, will deliver a talk on “Revolutionizing Cancer Treatment: Harnessing AI, Zarr, and AnnData for High-Speed Biomedical Imaging.” Zarr and AnnData, Python-based technologies, are revolutionizing the landscape of biomedical image processing, especially when paired with self-supervised learning (SSL). Zarr, a chunked and compressed data storage format, enables the efficient handling of datasets found in biomedical applications. AnnData, a specialized framework for multi-dimensional annotated data, plays a crucial role in managing and analyzing large-scale biomedical datasets. In the context of SSL, these technologies boost the processing speed and reduce the computational load for handling high-resolution images and complex datasets. Zarr's ability to store multi-terabyte data in distributed and parallelized environments allows for faster processing and analysis of biomedical images. AnnData complements this by providing structured, annotated data that SSL models can efficiently learn from without extensive labeling. This combination reduces memory usage, making it feasible to handle biomedical images on a large scale. These advancements are pivotal for applications like cancer diagnosis, where rapid, accurate image analysis is critical. During the talk, our speaker will explore: 🟣 how Zarr and AnnData facilitate scalable biomedical image processing, 🟣 outline their integration with SSL for cutting-edge research, 🟣 and discuss future developments in optimizing biomedical workflows. Save the date to attend the meetup on October 17, at 19:00, in the PMI Science R&D Center (Teryan 105, 13th Building). 🔗Register here: https://forms.gle/hWdTwCBfcSprjAgc7 |
Revolutionizing Cancer Treatment
|
|
Join our comprehensive and interactive webinar to master Postgres Table Partitioning, a game-changer for very large databases! Learn how to:
Through interactive demonstrations, real-world examples, and expert guidance, become a PostgreSQL Professional and unlock the full potential of your databases! Gain the skills and knowledge to:
Don't miss out! Sign up now and take your database skills to the next level! Presented by James Vanderpoel, Senior Principal Consultant This event is co-hosted by the New York Oracle Users Group (www.nyoug.org) and Oracle Professional Services firm, Viscosity North America (www.viscosityna.com) REGISTER HERE: https://viscosityna.com/unlock-the-power-of-postgres-table-partitioning-nyoug |
Unlock the Power of Postgres Table Partitioning! Boost Query Performance and Dat
|
|
Time Series Forecasting 📈
2024-04-10 · 22:30
Join PyData NYC at 11 Times Square (Microsoft) on April 10th at 6:30 pm for a workshop night with Jorn Mossel and Thomas J. Fan. Please bring your laptops to code along. 🍕 Food, drinks and venue sponsored by Microsoft Reactor - thank you ❤️ Agenda: Introduction to Time Series Forecasting: 30 mins Speaker: Jorn Mossel Time series problems, such as sales forecasting, energy demand, and peak server load predictions, are widespread in data science, yet often overlooked in introductory machine learning courses. This tutorial offers a brief introduction to time series, highlighting how they differ from other machine learning problems. We then demonstrate both traditional and more advanced time series techniques through explicit examples in Python. We conclude by giving an overview of some of the latest developments and available Python packages for time series analysis. Jorn is a Physics Ph.D., a former quant on Wall Street, and a senior data scientist working on energy demand forecasting. Time Series EDA with STUMPY: 25 minutes Speaker: Thomas J. Fan STUMPY is a robust and scalable Python library for computing a matrix profile, which can create valuable insights about our time series. STUMPY is built with Numba, which parallelizes computing on CPUs and accelerates it with GPUs. In this talk, we learn about the matrix profile and the various methods to compute it. Afterwards, we explore how to use the matrix profile in applications such as pattern discovery, anomaly detection, semantic segmentation, and time series chains. Thomas J. Fan is a senior machine learning engineer at Union.ai and a maintainer for scikit-learn. At scikit-learn, he led the development of DataFrame interoperability and GPU support through PyTorch. Previously, as a researcher at Columbia University, Thomas collaborated with NASA to automate machine learning workflows. Networking Connect with fellow data enthusiasts, professionals, and community leaders. Build meaningful connections and forge collaborations. ---------------------------------------------------------------- RSVP is required; please note that walk-ins will not be accepted. Note: Per building policy, RSVPs will close at 12 pm on Apr 8 Doors @ 6 pm Event @ 6:30 - 8:30 pm Venue provided by MSFT: 11 Times Square ---------------------------------------------------------------- The building requires a government-issued photo ID for entrance. This, and all PyData NYC events, is an all-level event. Newcomers and beginners are welcome. This, and all NumFOCUS-affiliated events and spaces, both in-person and online, are governed by a Code of Conduct. More at https://pydata.org/code-of-conduct/ This event may be recorded. |
Time Series Forecasting 📈
|
|
[PyMCon Web Series] Enabling Uncertainty Quantification - Q&A - #2
2024-01-30 · 06:30
🎙️ Speakers: Anne Reinarz & Linus Seelinger \| ⏰ Time: 6:30 UTC / 12 pm IST / 5:30 pm Australia / 3:30 pm Japan Treating uncertainties is essential in the design of safe aircraft, medical decision making, and many other fields. UM-Bridge enables straightforward uncertainty quantification (UQ) on advanced models by removing technical barriers. Complex numerical models often consist of large code bases that are difficult to integrate with UQ packages such as PyMC, holding back many interesting applications. UM-Bridge is a universal interface for linking UQ and models, greatly accelerating development from prototype to high-performance computing. This hands-on tutorial teaches participants how to build UQ applications using PyMC and UM-Bridge. We cover a range of practical exercises ranging from basic toy examples all the way to controlling parallelized models on a live cloud cluster. Beyond that, we encourage participants to bring their own methods and problems. Why UM-Bridge is needed: The main idea is to make UQ more widely accessible and accelerate the development of UQ methods by establishing:
Our approach closes the gap between advanced UQ methods and advanced models by removing the technical hurdle of integrating complex software stacks:
If you are interested, you can read more on the interface and the benchmarks here:
Content: 📽️ Interview video: https://youtu.be/BdHQfn_3vjQ 🎙️ Async Talk: Part - 1: Intro 👉 https://www.youtube.com/watch?v=ueoLjv6egZg Part - 2: Hands-on tutorial 👉 https://www.youtube.com/watch?v=rdQ4CXnkKF0 📝 We highly recommend following along with the tutorial on UM-Bridge docs: https://um-bridge-benchmarks.readthedocs.io/en/docs/tutorial.html 👉 Presentation Slides 👉 Discourse Post for more details and discussion: https://discourse.pymc.io/t/13583 📢 Note: This session is exclusively for Q&A. We kindly request you to watch the Async component before joining the event. For additional information and further discussion, please refer to this Discourse post. |
[PyMCon Web Series] Enabling Uncertainty Quantification - Q&A - #2
|
|
Transit Techies #21: Navigating Big Data Challenges in the MTA
2024-01-29 · 22:00
Please register using NYU's Google form by January 26th: https://docs.google.com/forms/d/e/1FAIpQLSfScQ0oPdfzJ4Yfv3OJboFlThgeYyNgJlDCc1k5n5Vm1CP5yg/viewform Join us for an engaging talk on how the MTA, America's largest bus service, tackles a major 'Big Data' challenge. Discover how our team uses GPS data from over 5,000 buses across 300+ routes to improve performance metrics. We'll dive into the complexities of matching buses to routes and stops using a cutting-edge parallelized geospatial algorithm. Learn about public data feeds, signal processing, MLOps, and more in this session that illustrates successful intra-agency collaboration. Please register using NYU's Google form by January 26th: https://docs.google.com/forms/d/e/1FAIpQLSfScQ0oPdfzJ4Yfv3OJboFlThgeYyNgJlDCc1k5n5Vm1CP5yg/viewform HAPPY HOUR AFTERWARDS Good news! There's a happy hour afterwards around the block at Hana House at 345 Adams Street. It's around the block from 370 Jay Street where the event is. Looking forward to alcoholic and non-alcoholic drinks after the talk! Agenda: 5-6:30 - Presentation 6:30-7:15 - Q&A 7:15 - 9PM - Happy hour at Hana House! |
Transit Techies #21: Navigating Big Data Challenges in the MTA
|
|
[PyMCon Web Series] Enabling Uncertainty Quantification - Q&A - #1
2024-01-29 · 15:00
🎙️ Speakers: Anne Reinarz & Linus Seelinger \| ⏰ Time: 15:00 UTC / 7 am PT / 10 am ET / 4 pm Berlin Treating uncertainties is essential in the design of safe aircraft, medical decision making, and many other fields. UM-Bridge enables straightforward uncertainty quantification (UQ) on advanced models by removing technical barriers. Complex numerical models often consist of large code bases that are difficult to integrate with UQ packages such as PyMC, holding back many interesting applications. UM-Bridge is a universal interface for linking UQ and models, greatly accelerating development from prototype to high-performance computing. This hands-on tutorial teaches participants how to build UQ applications using PyMC and UM-Bridge. We cover a range of practical exercises ranging from basic toy examples all the way to controlling parallelized models on a live cloud cluster. Beyond that, we encourage participants to bring their own methods and problems. Why UM-Bridge is needed: The main idea is to make UQ more widely accessible and accelerate the development of UQ methods by establishing:
Our approach closes the gap between advanced UQ methods and advanced models by removing the technical hurdle of integrating complex software stacks:
If you are interested, you can read more on the interface and the benchmarks here:
Content: 📽️ Interview video: https://youtu.be/BdHQfn_3vjQ 🎙️ Async Talk: Part - 1: Intro 👉 https://www.youtube.com/watch?v=ueoLjv6egZg Part - 2: Hands-on tutorial 👉 https://www.youtube.com/watch?v=rdQ4CXnkKF0 📝 We highly recommend following along with the tutorial on UM-Bridge docs: https://um-bridge-benchmarks.readthedocs.io/en/docs/tutorial.html 👉 Presentation Slides 👉 Discourse Post for more details and discussion: https://discourse.pymc.io/t/13583 📢 Note: This session is exclusively for Q&A. We kindly request you to watch the Async component before joining the event. For additional information and further discussion, please refer to this Discourse post. |
[PyMCon Web Series] Enabling Uncertainty Quantification - Q&A - #1
|
|
Christmas Tech Talks: Dive into DuckDB & Hopsworks
2023-12-14 · 16:30
Christmas is just around the corner and what better way to end the year with talks around DuckDB? Our last meetup of the year will welcome Max from DuckDB Labs and Fabio from Hopsworks. Max will introduce DuckDB, an innovative embedded data management system optimized for analytical SQL workloads and Fabio will introduce feature stores and the challenges & learnings of integrating DuckDB and Arrow Flight into the Hopsworks platform. Agenda: 17:30 - 18:00: Doors open 18:00 - 18:10: Welcome 18:10 - 18:40: DuckDB: Transforming Data Management and Analytics 18:40 - 19:10: Snacks & Refreshments 19:10 - 19:40: MLOps on the fly: Optimizing a feature store with DuckDB and ArrowFlight 19:40 - 20:30: Networking – Presentations: DuckDB: Transforming Data Management and Analytics Max Gabrielsson - Software Engineer at DuckDB Labs In this talk we present DuckDB, a novel embedded data management system designed for analytical SQL workloads. By incorporating decades of clever techniques and algorithms from the database research community, DuckDB empowers data engineering on a single machine to reach a whole new level of scale and performance, without the hassle and operational overhead commonly associated with traditional database and data warehouse systems. One way DuckDB aims to achieve this goal is through its unique in-depth integration with Python, allowing for seamless interoperability with the existing data science ecosystem through familiar APIs and zero-copy data sharing between staple libraries like Numpy, Pandas and Polars. This makes DuckDB an essential tool for the practical data scientist looking to squeeze the most out of their system without having to leave their comfort zone. We will introduce and explain the main strengths and characteristics of DuckDB such as its parallelized vectorized query execution engine, out-of-core beyond memory capabilities and transparent compression, and demonstrate how these features can be leveraged in a typical python-based data science workflow through a series of examples mixing both SQL and dataframes. We will also showcase DuckDBs flexible extension system and illustrate how it can be used to bridge different data sources and domains. Speaker Bio: Max Gabrielsson is a software engineer at DuckDB Labs where he works on the DuckDB database system. While he generally tries to not stay confined to any specific part of the stack, he has a particular interest in geospatial data management and is the primary maintainer of the DuckDB spatial GIS extension. Max holds a BSc in Computer Science from Uppsala University and hopes to one day finish his MSc with a thesis on the topic of database systems. In his spare time he enjoys kickboxing and hacking on side projects, usually involving compilers, databases or cartography. MLOps on the fly: Optimizing a feature store with DuckDB and ArrowFlight Fabio Buso - VP of Engineering at Hopsworks Feature Stores are a vital part of the MLOps stack for managing machine learning features and ensuring data consistency. This talk introduces Feature Stores and the underlying data management architecture. We’ll then discuss the challenges and learnings of integrating DuckDB and Arrow Flight into our Feature Store platform, and share benchmarks showing up to 30x speedups compared to Spark/Hive. Discover how DuckDB and ArrowFlight can also speedup your data management and machine learning pipelines. Speaker Bio: Fabio Buso is VP of Engineering at Hopsworks, leading the Feature Store development team. Fabio holds a master’s degree in Cloud Computing and Services with a focus on data intensive applications. – About the event Date: December 14th, 17:30 - 20:30 Location: Hopsworks Office (Åsögatan 119, Plan 2, 116 24 Stockholm) The venue this time is at the Hopsworks Office. As the office is sometimes difficult to locate we have made this map for everyone to follow. See you then! Directions: 2-minute walk from Medborgarplatsen. Tickets: Sign up required. Anyone who is not on the list will not get in. The event is free of charge. Capacity: Space is limited to 70 participants. If you are signed up but unable to attend, please let us know by December 13th. Food and drinks: Snacks and drinks will be provided. Questions: Please contact the meetup organizers. – Code of Conduct The NumFOCUS Code of Conduct applies to this event; please familiarize yourself with it before attending. If you have any questions or concerns regarding the Code of Conduct, please contact the organizers. |
Christmas Tech Talks: Dive into DuckDB & Hopsworks
|
|
How dbt Labs tunes model performance and optimizes cloud data platform costs - Coalesce 2023
2023-10-25 · 02:21
Elize Papineau
– Sr. Data Engineer
@ dbt Labs
In the dbt Labs on dbt series, you get a behind-the-scenes look at how dbt Labs uses data. You’ll learn how dbt Labs thinks about the role of data, how data developers collaborate with business leaders, and the technical decisions we’ve made in our own dbt project. In this session, Elize Papineau, Senior Data Engineer at dbt Labs, digs deeper into the technical details of the cost optimization project at dbt Labs. You'll learn how the team leveraged query tags in dbt to make model performance monitoring possible, the process for analyzing model performance, the implementation of warehouse specific configurations at the model level, and how the team measures the effectiveness of optimizations and translates it into cost savings. Watch to learn about dbt Labs' journey and leave with ideas that you can implement in your dbt project today. Speaker: Elize Papineau, Sr. Data Engineer, dbt Labs Register for Coalesce at https://coalesce.getdbt.com |
dbt Coalesce 2023 |
|
OVERVIEW OF POWER BI AS A SELF SERVICE AND ENTERPRISE BI TOOL
2023-06-15 · 17:00
Join LEARNTOR FREE webinar on June 15th at 6 PM WAT via Zoom for an informative session on "Overview of Power BI as a Self Service and Enterprise BI Tool". Connect with us here => https://bit.ly/LEARNTORQR 🚦 💃💃"Overview of Power BI as a Self Service and Enterprise BI Tool". Our guest speaker, Oyebola Omoya, is a Microsoft Certified Trainer, who have trained quite a number of individuals who are also interested in leveraging on analytical tool (Microsoft Power BI, Azure SQL) for excellent and impeccable delivery of their work for about 2years and counting. She will be sharing her knowledge and experience on using Power BI as a Self Service and Enterprise BI Tool. Time: June 15, 2023 06:00 PM West Central Africa Venue: Zoom Join Zoom Meeting https://us02web.zoom.us/j/87690932689?pwd=ZkQ3OHZpYWRKempMMjB3RGRGdERVQT09 Meeting ID: 876 9093 2689 Passcode: LEARNTOR Whether you're a Data analyst, Power BI Expert, or simply interested in learning more about data analysis, this meetup is for you.There will also be time for Q&A and networking, so you can connect with like-minded individuals and share your own experiences. Don't miss out on this opportunity to learn from an expert and take your Power BI skills to the next level. RSVP now to secure your spot!🚀 Meet the host: 🎙🎙🎙 Mercy George-Igbafe is a multi-award-winning tech evangelist committed to closing the gender gap in Africa with the goal to develop 10,000 women and youth. An Educator, founder, and mentor in championing, Artificial Intelligence (AI), Robotics, Agile, Scrum, Kanban, Data Analytics, Business Analysis, Cyber Security, and Digital Marketing with over 25 years of experience. A professional Scrum Master, and Microsoft Certified Trainer with Professional Diploma in digital marketing. We have trained 2,380 tech talent and 75% of our learners are women Tech first-timer. Despite challenges and overcoming insurmountable hardships not excluding rape with zero support as an orphan girl I was determined to make something of my life with a commitment to help other women win as I have. Hence my commitment to helping as many women get into tech to become African Change-agent . My Tech journey started in 2000, Technology as an employee of Ecobank, while in 2017 where I fully embraced Tech, confronted with a mid-life crisis unsure and in despair, Today I am on a mission to evangelize Tech to more women with a renewed of purpose as a customer-centric, value-driven and problem-solving individual devoted to social reforms. Today, I have almost 15 certifications. A Microsoft Certified Trainer, Data Analyst, Professional Scrum Master, Certified Business Analyst, Professional Digital marketing, and leading Agile Digital Transformation campaign and implementation for my clients and students. My tech journey began in 2000 when I joined Ecobank as an employee. It wasn't until 2017 that I fully embraced tech where I now hold over 15 certifications. An Agile Digital Transformation professional, a Microsoft Certified Trainer who is also a Data Analyst, Professional Scrum Master, Certified Business Analyst, Professional Digital Marketer, and Certified Data Analyst. I founded LEARNTOR Foundation and LEARNTOR with the goal to develop 10,000 women into tech by 2025 to mitigate the high rate of unemployment and employability problems whilst bridging the digital skills gap and advocating Agile best practices and principles. Our vision aligns with UN MDG goals (4, 5 & 8) Quality Education, Gender Equality Decent Work, and Economic Growth. Our traction and impact earned me an executive board appointment as an executive board member for Women in Agile (USA) for Diversity and Inclusion (DEI) and the convener of Women in Agile Africa organizing the most inclusive African Agile conference in 3 languages (French, English, and Portuguese). My commitment to advancing Tech in Nigeria and developing women has earned me features on major media platforms like, Channels TV (The Beam, Tech Trend, Business Morning), Wazobia MaxTV, Television Continental, ThisDay, The Nation Newspaper, Guardian, Arise TV, Daily Independent, Vanguard, Wazobia TV, Max FM, Guardian Woman, She Leads Africa etc LinkedIn Profile: https://www.linkedin.com/in/mercygeorgeigbafe/ |
OVERVIEW OF POWER BI AS A SELF SERVICE AND ENTERPRISE BI TOOL
|
|
Workshop: dbt Packages you didn’t know you needed
2022-10-25 · 19:09
You’re probably familiar with the dbt-utils package, but how many others have you explored? If you’re looking to cut development time, make your next audit less painful, or wield dbt metrics confidently, join Elize and Dave as they dig into three of their most essential dbt packages—codegen, audit_helper, and metrics —in this hands-on workshop. Coalesce 2023 is coming! Register for free at https://coalesce.getdbt.com/. |
dbt Coalesce 2022 |