talk-data.com talk-data.com

Topic

Data Science

machine_learning statistics analytics

619

tagged

Activity Trend

68 peak/qtr
2020-Q1 2026-Q1

Activities

619 activities · Newest first

Before StatQuest became the go-to learning companion for millions of AI and ML practitioners… Before the “BAM! Double BAM! Triple BAM!” became a teaching tool that many learners adore...

There was just one guy in a genetics lab, trying desperately to explain his data analysis to coworkers so they didn't think he was working magic.

In this deeply personal and inspiring episode, Joshua Starmer (CEO & Founder | StatQuest) shares the real story behind his rise — a journey shaped by strategy, struggle, blunt feedback, and a relentless desire to make complicated ideas simple.

What you’ll discover: 🔹How Josh went from helping colleagues in a genetics lab to becoming a renowned educator, treasuring his first 9 views and 2 subscribers as a big win. 🔹How early feedback Josh received as a kid became a quiet spark — motivating him to improve how he explained things and ultimately shaping the teaching style millions now rely on. 🔹How his method for breaking down complex topics with unique tools like his iconic BAM! help make learning lighter and less intimidating. 🔹His thoughts on AI tutors, avatars, and interactive learning and how ethics, bias, and hallucinations relate to next-gen learning.

This is more than a conversation about statistics, data science, AI, education, or YouTube. It’s the story of a researcher who never imagined starting a learning platform, yet became one of the most trusted teachers in statistics and machine learning—turning frustration into clarity, confusion into curiosity, and small beginnings into a massive global impact.

📌 If you’ve ever struggled with PCA, logistic regression, K-means clustering, neural networks, or any tricky stats and ML concepts… chances are StatQuest made it click. Now, hear from the creator himself about what goes on behind the scenes. Now you’ll finally understand how he made it click.

🔹A must-listen for: AI/ML learners, data scientists, educators, content creators, self-taught enthusiasts, and anyone who’s faced the fear of “I’m not good at explaining things.”Prepare to walk away inspired — and with a renewed belief that clarity is a superpower anyone can learn.

Data science leadership is about more than just technical expertise—it’s about building trust, embracing AI, and delivering real business impact. As organizations evolve toward AI-first strategies, data teams have an unprecedented opportunity to lead that transformation. But how do you turn a traditional analytics function into an AI-driven powerhouse that drives decision-making across the business? What’s the right structure to balance deep technical specialization with seamless business integration? From building credibility through high-impact forecasting to creating psychological safety around AI adoption, effective data leadership today requires both technical rigor and visionary communication. The landscape is shifting fast, but with the right approach, data science can stand as a true pillar of innovation alongside engineering, product, and design. Bilal Zia is currently the Head of Data Science & Analytics at Duolingo, an EdTech company whose mission is to develop the best education in the world and make it universally available. Previously, he spent two years helping to build and lead an interdisciplinary Central Science team at Amazon, comprising economists, data and applied scientists, survey specialists, user researchers, and engineers. Before that, he spent fifteen years in the Research Department of the World Bank in Washington, D.C., pursuing an applied academic career. He holds a Ph.D. in Economics from the Massachusetts Institute of Technology, and his interests span economics, data science, machine learning/AI, psychology, and user research. In the episode, Richie and Bilal explore rebuilding an underperforming data team, fostering trust with leadership, embedding data scientists within product teams, leveraging AI for productivity, the future of synthetic A/B testing, and much more. Links Mentioned in the Show: DuolingoDuolingo Blog: How machine learning supercharged our revenue by millions of dollarsConnect with BilalAI-Native Course: Intro to AI for WorkRelated Episode: The Future of Data & AI Education Just Arrived with Jonathan Cornelissen & Yusuf SaberRewatch RADAR AI  New to DataCamp? Learn on the go using the DataCamp mobile appEmpower your business with world-class data and AI skills with DataCamp for business

podcast_episode
by Nathalie Miebach (School of Data Science, University of Virginia) , Alex Gates (University of Virginia)

Here we explore the intersections of data, art, and storytelling. Our guest, Nathalie Miebach, is an internationally-recognized data artist and the School of Data Science’s inaugural Artist-in-Residence.

Using materials like reed and paper, she transforms complex datasets into woven sculptures and musical scores, inviting us to view and even hear data in new ways. Joining her is Alex Gates, assistant professor of data science at the University of Virginia research examines how patterns of connection shape creativity, innovation, and discovery.

Together, they discuss what happens when data meets art.

Chapters (00:00:01) - Data Points: When Art Meets Science(00:00:46) - Ian and Nicole: Introduction(00:06:18) - How Stories Get Made(00:09:59) - Basket Weaving Visualizing Data(00:20:33) - Wonders of the World(00:25:47) - Data and Artist Residency(00:27:50) - Breaking Habits in Creativity(00:30:06) - What is Data Science: Craftsmanship?(00:34:50) - How Art Affects Our Understanding of Data

The future of education is being reshaped by AI-powered personalization. Traditional online learning platforms offer static content that doesn't adapt to individual needs, but new technologies are creating truly interactive experiences that respond to each learner's context, pace, and goals. How can personalized AI tutoring bridge the gap between mass education and the gold standard of one-on-one human tutoring? What if every professional could have a private tutor that understands their industry, role, and specific challenges? As organizations invest in upskilling their workforce, the question becomes: how can we leverage AI to make learning more engaging, effective, and accessible for everyone? As the Co-Founder & CEO of DataCamp, Jonathan Cornelissen has helped grow DataCamp to upskill over 10M+ learners and 2800+ teams and enterprise clients. He is interested in everything related to data science, education, and entrepreneurship. He holds a Ph.D. in financial econometrics and was the original author of an R package for quantitative finance. Yusuf Saber is a technology leader and entrepreneur with extensive experience building and scaling data-driven organizations across the Middle East. He is the Founder of Optima and a Venture Partner at COTU Ventures, with previous leadership roles at talabat, including VP of Data and Senior Director of Data Science and Engineering. Earlier in his career, he co-founded BulkWhiz and Trustious, and led data science initiatives at Careem. Yusuf holds research experience from ETH Zurich and began his career as an engineering intern at Mentor Graphics. In the episode, Richie, Jo and Yusuf explore the innovative AI-driven learning platform Optima, its unique approach to personalized education, the potential for AI to enhance learning experiences, the future of AI in education, the challenges and opportunities in creating dynamic, context-aware learning environments, and much more. Links Mentioned in the Show: Read more about the announcementTry the AI-Native Courses:Intro to SQLIntro to AI for Work New to DataCamp? Learn on the go using the DataCamp mobile appEmpower your business with world-class data and AI skills with DataCamp for busines

AI and data analytics are transforming business, and your data career can’t afford to be left behind. 🎙️ In this episode of Data Career School, I sit down with Ketan Mudda, Director of Data Science & AI Solutions at Walmart, to explore how AI is reshaping retail, analytics, and decision-making—and what it means for students, job seekers, and early-career professionals in 2026.

We dive into: How AI is driving innovation and smarter decisions in retail and business Essential skills data professionals need to thrive in an AI-first world How AI tools like ChatGPT are changing the way analysts work What employers look for beyond technical expertise Strategies to future-proof your data career

Ketan also shares his journey from Credit Risk Analyst at HSBC to leading AI-driven initiatives at one of the world’s largest retailers.

Whether you’re starting your data career, exploring AI’s impact on business, or curious about analytics in action, this episode is packed with actionable insights, inspiration, and career guidance.

🎙️ Hosted by Amlan Mohanty — creator of Data Career School, where we explore AI, data analytics, and the future of work. Follow me: 📺 YouTube 🔗 LinkedIn 📸 Instagram

🎧Listen now to level up your data career!

Chapters 00:00 The Journey of Ketan Mudda05:18 AI's Transformative Impact on Industries12:49 Responsible AI Practices14:28 The Role of Education in Data Science23:18 AI and the Future of Jobs28:03 Embracing AI Tools for Success29:44 The Importance of Networking31:40 Curiosity and Continuous Learning32:50 Storytelling in Data Science Leadership36:22 Focus on AI Ethics and Change Management41:03 Learning How to Learn44:57 Identifying Problems Over Tools

In this show, we're joined by Sean Chandler, Director of BI at CenterWell Home Health, to explore what it really means to thrive in BI today. Sean shares his personal journey, including his move into teaching, and offers practical insights on building a career in BI, self-learning for advancement, and fostering a strong partnership between BI and data science teams. Whether you're an aspiring BI analyst, a data scientist aiming to improve collaboration, or a career changer eyeing the BI space, this episode is for you. What You'll Learn: How to successfully transition from other roles into BI, and how to know if it's the right fit for you What good collaboration between BI and data science actually looks like, and how to recognize when it's broken How self-taught skills can accelerate your BI career, even without a formal background   🤝 Follow Sean on LinkedIn!   Register for free to be part of the next live session: https://bit.ly/3XB3A8b   Follow us on Socials: LinkedIn YouTube Instagram (Mavens of Data) Instagram (Maven Analytics) TikTok Facebook Medium X/Twitter

In this episode of Data Skeptic's Recommender Systems series, Kyle sits down with Aditya Chichani, a senior machine learning engineer at Walmart, to explore the darker side of recommendation algorithms. The conversation centers on shilling attacks—a form of manipulation where malicious actors create multiple fake profiles to game recommender systems, either to promote specific items or sabotage competitors. Aditya, who researched these attacks during his undergraduate studies at SPIT before completing his master's in computer science with a data science specialization at UC Berkeley, explains how these vulnerabilities emerge particularly in collaborative filtering systems. From promoting a friend's ska band on Spotify to inflating product ratings on e-commerce platforms, shilling attacks represent a significant threat in an industry where approximately 4% of reviews are fake, translating to $800 billion in annual sales in the US alone. The discussion delves deep into collaborative filtering, explaining both user-user and item-item approaches that create similarity matrices to predict user preferences. However, these systems face various shilling attacks of increasing sophistication: random attacks use minimal information with average ratings, while segmented attacks strategically target popular items (like Taylor Swift albums) to build credibility before promoting target items. Bandwagon attacks focus on highly popular items to connect with genuine users, and average attacks leverage item rating knowledge to appear authentic. User-user collaborative filtering proves particularly vulnerable, requiring as few as 500 fake profiles to impact recommendations, while item-item filtering demands significantly more resources. Aditya addresses detection through machine learning techniques that analyze behavioral patterns using methods like PCA to identify profiles with unusually high correlation and suspicious rating consistency. However, this remains an evolving challenge as attackers adapt strategies, now using large language models to generate more authentic-seeming fake reviews. His research with the MovieLens dataset tested detection algorithms against synthetic attacks, highlighting how these concerns extend to modern e-commerce systems. While companies rarely share attack and detection data publicly to avoid giving attackers advantages, academic research continues advancing both offensive and defensive strategies in recommender systems security.

Help us become the #1 Data Podcast by leaving a rating & review! We are 67 reviews away! I wouldn't try to become a data analyst next here. Here's 4 reasons why and what I'd do instead. 👩‍💻 Want to land a data job in less than 90 days? 👉 https://www.datacareerjumpstart.com/daa ⌚ TIMESTAMPS 00:32 - Reason 1 not to be data scientist 03:22 - Reason 2 not to be data scientist 04:55 - Reason 3 not to be data scientist 07:33 - Reason 4 not to be data scientist 11:28 - What to do instead 🍿 OTHER EPISODES MENTIONED Data Analyst Roadmap: https://datacareerpodcast.com/episode/136-how-i-would-become-a-data-analyst-in-2025-if-i-had-to-start-over-again Get Paid to Learn Data: https://datacareerpodcast.com/episode/137-get-paid-1000s-to-master-data-analytics-skills-in-2025 Get You Master's Paid For (Thomas): https://datacareerpodcast.com/episode/128-meet-the-math-teacher-who-landed-a-data-job-in-60-days-thomas-gresco Get You Master's Paid For (Rachael): https://datacareerpodcast.com/episode/125-how-she-landed-a-business-intelligence-analyst-job-in-less-than-100-days-w-rachael-finch My review of Georgia Tech's Master's: https://datacareerpodcast.com/episode/38-masters-in-data-analytics-from-georgia-tech-is-it-worth-it 💌 Join 30k+ aspiring data analysts & get my tips in your inbox weekly 👉 https://www.datacareerjumpstart.com/newsletter 🆘 Feeling stuck in your data journey? Come to my next free "How to Land Your First Data Job" training 👉 https://www.datacareerjumpstart.com/training 👩‍💻 Want to land a data job in less than 90 days? 👉 https://www.datacareerjumpstart.com/daa 👔 Ace The Interview with Confidence 👉 https://www.datacareerjumpstart.com//interviewsimulator 🔗 CONNECT WITH AVERY 🎥 YouTube Channel 🤝 LinkedIn 📸 Instagram 🎵 TikTok 💻 Website Mentioned in this episode: Join the last cohort of 2025! The LAST cohort of The Data Analytics Accelerator for 2025 kicks off on Monday, December 8th and enrollment is officially open!

To celebrate the end of the year, we’re running a special End-of-Year Sale, where you’ll get: ✅ A discount on your enrollment 🎁 6 bonus gifts, including job listings, interview prep, AI tools + more

If your goal is to land a data job in 2026, this is your chance to get ahead of the competition and start strong.

👉 Join the December Cohort & Claim Your Bonuses: https://DataCareerJumpstart.com/daa https://www.datacareerjumpstart.com/daa

podcast_episode
by Data Talks Club (DataTalks.Club) , Sebastian Ayala Ruano (Multiomics Network Analytics Group, DTU Biosustain)

In this talk, Sebastian, a bioinformatics researcher and software engineer, shares his inspiring journey from wet lab biotechnology to computational bioinformatics. Hosted by Data Talks Club, this session explores how data science, AI, and open-source tools are transforming modern biological research — from DNA sequencing to metagenomics and protein structure prediction.

You’ll learn about: - The difference between wet lab and dry lab workflows in biotechnology - How bioinformatics enables faster insights through data-driven modeling - The MCW2 Graph Project and its role in studying wastewater microbiomes - Using co-abundance networks and the CC Lasso algorithm to map microbial interactions - How AlphaFold revolutionized protein structure prediction - Building scientific knowledge graphs to integrate biological metadata - Open-source tools like VueGen and VueCore for automating reports and visualizations - The growing impact of AI and large language models (LLMs) in research and documentation - Key differences between R (BioConductor) and Python ecosystems for bioinformatics

This talk is ideal for data scientists, bioinformaticians, biotech researchers, and AI enthusiasts who want to understand how data science, AI, and biology intersect. Whether you work in genomics, computational biology, or scientific software, you’ll gain insights into real-world tools and workflows shaping the future of bioinformatics.

Links: - MicW2Graph: https://zenodo.org/records/12507444 - VueGen: https://github.com/Multiomics-Analytics-Group/vuegen - Awesome-Bioinformatics: https://github.com/danielecook/Awesome-Bioinformatics

TIMECODES00:00 Sebastian’s Journey into Bioinformatics06:02 From Wet Lab to Computational Biology08:23 Wet Lab vs Dry Lab Explained12:35 Bioinformatics as Data Science for Biology15:30 How DNA Sequencing Works19:29 MCW2 Graph and Wastewater Microbiomes23:10 Building Microbial Networks with CC Lasso26:54 Protein–Ligand Simulation Basics29:58 Predicting Protein Folding in 3D33:30 AlphaFold Revolution in Protein Prediction36:45 Inside the MCW2 Knowledge Graph39:54 VueGen: Automating Scientific Reports43:56 VueCore: Visualizing OMIX Data47:50 Using AI and LLMs in Bioinformatics50:25 R vs Python in Bioinformatics Tools53:17 Closing Thoughts from Ecuador Connect with Sebastian Twitter - https://twitter.com/sayalaruanoLinkedin - https://linkedin.com/in/sayalaruano Github - https://github.com/sayalaruanoWebsite - https://sayalaruano.github.io/ Connect with DataTalks.Club: Join the community - https://datatalks.club/slack.htmlSubscribe to our Google calendar to have all our events in your calendar - https://calendar.google.com/calendar/r?cid=ZjhxaWRqbnEwamhzY3A4ODA5azFlZ2hzNjBAZ3JvdXAuY2FsZW5kYXIuZ29vZ2xlLmNvbQCheck other upcoming events - https://lu.ma/dtc-eventsGitHub: https://github.com/DataTalksClubLinkedIn - https://www.linkedin.com/company/datatalks-club/Twitter - https://twitter.com/DataTalksClub - Website - https://datatalks.club/

In this episode, we explore how data science is helping researchers simulate and understand some of the most extreme physical events on Earth, from floods in Texas to hypersonic flight. Our guests are Stephen Baek, a leading expert in geometric deep learning and associate professor of data science at the University of Virginia, and Jack Beerman, a Ph.D. student whose work is already shaping real-world applications.

Together, they discuss how AI is transforming fields like weather forecasting, materials design, sports performance, and military innovation—and why graduate researchers like Jack are essential to moving this work forward.

Data interviews do not have to feel messy. In this episode, I share a simple AI Interview Copilot that works for data analyst, data scientist, analytics engineer, product analyst, and marketing analyst roles. What you will learn today: How to Turn a Job Post into a Skills Map: Know Exactly What to Study First.How to build role-specific SQL drills (joins, window functions, cohorts, retention, time series).How to practice product/case questions that end with a decision and a metric you can defend.How to prepare ML/experimentation basics (problem framing, features, success metrics, A/B test sanity checks).How to plan take-home assignments (scope, assumptions, readable notebook/report structure).How to create a 6-story STAR bank with real numbers and clear outcomes.How to follow a 7-day rhythm so you make steady progress without burnout.How to keep proof of progress so your confidence comes from evidence, not hope.Copy-and-use prompts from the show: JD → Skills Map: “Parse this job post. Table: Skill/Theme | Where mentioned | My level (guess) | Study action | Likely interview questions. Then give 5 bullets: what they are really hiring for.”SQL Drill Factory (Analyst/Product/Marketing): “Create 20 SQL tasks + hint + how to check results using orders, users, events, campaigns. Emphasize joins, windows, conditional agg, cohorts, funnels, retention, time windows.”Case Coach (Data/Product): “Run a 15-minute case: key metric is down. Ask one question at a time. Score clarity, structure, metrics, trade-offs. End with gaps + practice list.”ML/Experimentation Basics (Data Science): “Create a 7-step outline for framing a modeling problem (goal, data, features, baseline, evaluation, risks, comms). Add an A/B test sanity checklist (power, SRM, population, metric guardrails).”Take-Home Planner: “Given this brief, propose scope, data assumptions, 3–5 analysis steps, visuals, and a short results section. Output a clear report outline.”Behavioral STAR Bank: “Draft 6 STAR stories (120s) for conflict, ambiguity, failure, leadership without title, stakeholder influence, measurable impact. Put numbers in Results.”

In this episode, we chat with Dashel Ruiz, whose journey spans semiconductors, machine learning, and teaching. Dashel shares how he transitioned from hardware to data science, navigated complex projects in diverse industries, and now combines technical expertise with a passion for teaching. Tune in to hear insights on building a career in data, mastering new technologies, and making an impact both in the lab and the classroom.

TIMECODES 00:00 Dashel's unique career path from music to semiconductors 06:16 The transition into data and software engineering at Microchip 11:44 Discovering machine learning to solve real problems in semiconductor manufacturing 20:40 How Dashel found and his experience with the Machine Learning Zoomcamp 29:33 The practical advantages of DataTalks.Club courses over other platforms 39:52 Overcoming challenges and the value of the learning community 48:10 Hands-on project experience: From image classification to Kaggle competitions 54:12 Staying motivated throughout the long-term course 59:55 The importance of deployment and full-stack ML skills 1:07:36 Closing thoughts on teaching and future courses

Connect with Dashel Linkedin - https://www.linkedin.com/in/dashel-ruiz-perez-2b036172/ Connect with DataTalks.Club: Join the community - https://datatalks.club/slack.htmlSubscribe to our Google calendar to have all our events in your calendar - https://calendar.google.com/calendar/r?cid=ZjhxaWRqbnEwamhzY3A4ODA5azFlZ2hzNjBAZ3JvdXAuY2FsZW5kYXIuZ29vZ2xlLmNvbQCheck other upcoming events - https://lu.ma/dtc-eventsGitHub: https://github.com/DataTalksClubLinkedIn - https://www.linkedin.com/company/datatalks-club/ Twitter - https://twitter.com/DataTalksClub Website - https://datatalks.club/

If you're thinking about Data Analyst or Data Scientist career paths, then this one is for you! In this episode with Data Career Jumpstart Founder Avery Smith, you'll learn about the differences between Analyst and Data Scientist career paths, and hear some practical advice to help you on your journey. You'll leave with a better understanding of different data roles, which might be the better fit for you, and a concrete roadmap for taking action and accelerating your career. What You'll Learn: Key differences between Data Analyst and Data Science roles The critical tools to focus on to land a job in either role A step by step playbook for building the skills you need to succeed   This session was part of our OPEN CAMPUS week in October, which included 6 days of live expert sessions.   Register for free to be part of the next live session: https://bit.ly/3XB3A8b

Follow us on Socials: LinkedIn YouTube Instagram (Mavens of Data) Instagram (Maven Analytics) TikTok Facebook Medium X/Twitter

Who’s the most clutch quarterback in NFL history — Tom Brady, Patrick Mahomes, Aaron Rodgers, or someone completely unexpected? We’ll use Python + Data Science to figure it out.  👉 Try Sphinx for free - https://www.sphinx.ai ⏱️ TIMESTAMPS00:00 - Who’s the most clutch QB? 00:40 - Python + Sphinx AI: analyzing 1M NFL plays 02:00 - Defining “clutch” in football (data-driven approach) 03:15 - “TV Clutch” Top 10 07:50 - Using AI to processes play-by-play data 11:10 - Advanced Clutch Factor 17:00 - Advanced Top 10 24:30 - Build your own analysis 🔗 RESOURCES & LINKS💌 Join 20k+ aspiring data analysts — https://www.datacareerjumpstart.com/newsletter 🎯 Free Training: How to Land Your First Data Job — https://www.datacareerjumpstart.com/training 👩‍💻 Accelerator Program: Data Analytics Accelerator — https://www.datacareerjumpstart.com/daa 💼 Interview Prep Tool: Interview Simulator — https://www.datacareerjumpstart.com/interviewsimulator 📱 CONNECT WITH AVERY🎥 YouTube: @averysmith 🤝 LinkedIn: https://www.linkedin.com/in/averyjsmith 📸 Instagram: https://instagram.com/datacareerjumpstart 🎵 TikTok: https://www.tiktok.com/@verydata 💻 Website: https://www.datacareerjumpstart.com 📱 CONNECT WITH SPHINX🐦Twitter/X - https://x.com/getsphinx 🔗Linkedin - https://www.linkedin.com/company/sphinx-ml/ Mentioned in this episode: Join the last cohort of 2025! The LAST cohort of The Data Analytics Accelerator for 2025 kicks off on Monday, December 8th and enrollment is officially open!

To celebrate the end of the year, we’re running a special End-of-Year Sale, where you’ll get: ✅ A discount on your enrollment 🎁 6 bonus gifts, including job listings, interview prep, AI tools + more

If your goal is to land a data job in 2026, this is your chance to get ahead of the competition and start strong.

👉 Join the December Cohort & Claim Your Bonuses: https://DataCareerJumpstart.com/daa https://www.datacareerjumpstart.com/daa

Think you need a fancy degree to start a career in data? Think again. In this episode of Data Career School, Amlan Mohanty breaks down exactly how you can launch a successful data career and land your first job in data analytics, data science, or business intelligence without a traditional degree. Discover how to build in-demand skills, create a portfolio that gets noticed, and land your first data job using practical, actionable strategies. Whether you’re self-taught, switching careers, or just curious about the data field, this episode gives you the perfect roadmap to break into a data career.

At PyData Berlin, community members and industry voices highlighted how AI and data tooling are evolving across knowledge graphs, MLOps, small-model fine-tuning, explainability, and developer advocacy.

  • Igor Kvachenok (Leuphana University / ProKube) combined knowledge graphs with LLMs for structured data extraction in the polymer industry, and noted how MLOps is shifting toward LLM-focused workflows.
  • Selim Nowicki (Distill Labs) introduced a platform that uses knowledge distillation to fine-tune smaller models efficiently, making model specialization faster and more accessible.
  • Gülsah Durmaz (Architect & Developer) shared her transition from architecture to coding, creating Python tools for design automation and volunteering with PyData through PyLadies.
  • Yashasvi Misra (Pure Storage) spoke on explainable AI, stressing accountability and compliance, and shared her perspective as both a data engineer and active Python community organizer.
  • Mehdi Ouazza (MotherDuck) reflected on developer advocacy through video, workshops, and branding, showing how creative communication boosts adoption of open-source tools like DuckDB.

Igor Kvachenok Master’s student in Data Science at Leuphana University of Lüneburg, writing a thesis on LLM-enhanced data extraction for the polymer industry. Builds RDF knowledge graphs from semi-structured documents and works at ProKube on MLOps platforms powered by Kubeflow and Kubernetes.

Connect: https://www.linkedin.com/in/igor-kvachenok/

Selim Nowicki Founder of Distill Labs, a startup making small-model fine-tuning simple and fast with knowledge distillation. Previously led data teams at Berlin startups like Delivery Hero, Trade Republic, and Tier Mobility. Sees parallels between today’s ML tooling and dbt’s impact on analytics.

Connect: https://www.linkedin.com/in/selim-nowicki/

Gülsah Durmaz Architect turned developer, creating Python-based tools for architectural design automation with Rhino and Grasshopper. Active in PyLadies and a volunteer at PyData Berlin, she values the community for networking and learning, and aims to bring ML into architecture workflows.

Connect: https://www.linkedin.com/in/gulsah-durmaz/

Yashasvi (Yashi) Misra Data Engineer at Pure Storage, community organizer with PyLadies India, PyCon India, and Women Techmakers. Advocates for inclusive spaces in tech and speaks on explainable AI, bridging her day-to-day in data engineering with her passion for ethical ML.

Connect: https://www.linkedin.com/in/misrayashasvi/

Mehdi Ouazza Developer Advocate at MotherDuck, formerly a data engineer, now focused on building community and education around DuckDB. Runs popular YouTube channels ("mehdio DataTV" and "MotherDuck") and delivered a hands-on workshop at PyData Berlin. Blends technical clarity with creative storytelling.

Connect: https://www.linkedin.com/in/mehd-io/

In this episode, we talk with Daniel, an astrophysicist turned machine learning engineer and AI ambassador. Daniel shares his journey bridging astronomy and data science, how he leveraged live courses and public knowledge sharing to grow his skills, and his experiences working on cutting-edge radio astronomy projects and AI deployments. He also discusses practical advice for beginners in data and astronomy, and insights on career growth through community and continuous learning.TIMECODES00:00 Lunar eclipse story and Daniel’s astronomy career04:12 Electromagnetic spectrum and MEERKAT data explained10:39 Data analysis and positional cross-correlation challenges15:25 Physics behind radio star detection and observation limits16:35 Radio astronomy’s advantage and machine learning potential20:37 Radio astronomy progress and Daniel’s ML journey26:00 Python tools and experience with ZoomCamps31:26 Intel internship and exploring LLMs41:04 Sharing progress and course projects with orchestration tools44:49 Setting up Airflow 3.0 and building data pipelines47:39 AI startups, training resources, and NVIDIA courses50:20 Student access to education, NVIDIA experience, and beginner astronomy programs57:59 Skills, projects, and career advice for beginners59:19 Starting with data science or engineering1:00:07 Course sponsorship, data tools, and learning resourcesConnect with Daniel Linkedin -   / egbodaniel   Connect with DataTalks.Club: Join the community - https://datatalks.club/slack.htmlSubscribe to our Google calendar to have all our events in your calendar - https://calendar.google.com/calendar/...Check other upcoming events - https://lu.ma/dtc-eventsGitHub: https://github.com/DataTalksClubLinkedIn -   / datatalks-club   Twitter -   / datatalksclub   Website - https://datatalks.club/

Ken Jee has spent a decade in sports analytics, working at the intersection of data science and athlete performance. Now, he's building The Exponential Athlete, a podcast dedicated to exploring what makes athletes reach their highest potential. In this show, Ken shares: His 10-year journey in sports analytics and the lessons data can, and can't teach us about performance. How his background in data science set him up to successfully launch The Exponential Athlete. The limits of analytics — why diagnosis is easy, but decision-making is complex. How mental visualization (seeing success before it happens) plays a crucial role in athletic and personal excellence. The intersection of training philosophy, psychology, and data in shaping elite performers. Whether you're passionate about sports, data science, entrepreneurship, or personal growth, this episode offers practical insights you can apply immediately. 🤝 Follow Ken on LinkedIn!   Register for free to be part of the next live session: https://bit.ly/3XB3A8b   Follow us on Socials: LinkedIn YouTube Instagram (Mavens of Data) Instagram (Maven Analytics) TikTok Facebook Medium X/Twitter

podcast_episode
by Chris Bruehl (Institute for Advanced Analytics (IAA) at NC State)

In this episode, we're joined by Maven's own Chris Bruehl to unpack the 2025 data science landscape and explore what it really takes to break into the field today. If you're curious about what data scientists actually do — and how to become one — you won't want to miss this! What You'll Learn: How the data scientist role compares to other data careers The essential skills you need to land a data science job in 2025 Smart strategies to position yourself before applying   🤝 Follow Chris on LinkedIn!   Register for free to be part of the next live session: https://bit.ly/3XB3A8b   Follow us on Socials: LinkedIn YouTube Instagram (Mavens of Data) Instagram (Maven Analytics) TikTok Facebook Medium X/Twitter

Você já parou para pensar quais viéses seu algoritmo pode carregar e como isso impacta suas análises? Neste episódio, conversamos com Andressa Freires, fundadora da diversiData e Data Science Specialist, sobre como as perspectivas dos desenvolvedores de AIs e modelos podem transpassar no conteúdo criado por essas tecnologias. Além disso, discutimos como a falta de diversidade pode impactar as ferramentas que são amplamente utilizadas pelo mundo e as consequências desse movimento. Lembrando que você pode encontrar todos os podcasts da comunidade Data Hackers no Spotify, iTunes, Google Podcast, Castbox e muitas outras plataformas. Nossa Bancada Data Hackers: Paulo Vasconcellos — Co-founder da Data Hackers e Principal Data Scientist na Hotmart. Monique Femme — Head of Community Management na Data Hackers Referências: https://mitsloanreview.com.br/quebrando-correntes-e-liderando-com-proposito/ https://linktr.ee/diversidata https://www.amazon.com/Unmasking-AI-Mission-Protect-Machines/dp/0593241835 https://www.amazon.com/Weapons-Math-Destruction-Increases-Inequality/dp/0553418815