talk-data.com talk-data.com

Event

DataTalks.Club

2020-11-21 – 2025-11-28 Podcasts Visit website ↗

Activities tracked

201

DataTalks.Club - the place to talk about data!

Sessions & talks

Showing 51–75 of 201 · Newest first

Search within this event →

Navigating Challenges and Innovations in Search Technologies - Atita Arora

2023-12-27 Listen
podcast_episode

We talked about:

Atita’s background How NLP relates to search Atita’s experience with Lucidworks and OpenSource Connections Atita’s experience with Qdrant and vector databases Utilizing vector search Major changes to search Atita has noticed throughout her career RAG (Retrieval-Augmented Generation) Building a chatbot out of transcripts with LLMs Ingesting the data and evaluating the results Keeping humans in the loop Application of vector databases for machine learning Collaborative filtering Atita’s resource recommendations

Links:

LinkedIn: https://www.linkedin.com/in/atitaarora/
Twitter: https://x.com/atitaarora Github: https://github.com/atarora Human-in-the-Loop Machine Learning: https://www.manning.com/books/human-in-the-loop-machine-learning Relevant Search: https://www.manning.com/books/relevant-search Let's learn about Vectors: https://hub.superlinked.com/ Langchain: https://python.langchain.com/docs/get_started/introduction Qdrant blog: https://blog.qdrant.tech/ OpenSource Connections Blog: https://opensourceconnections.com/blog/

Free ML Engineering course: http://mlzoomcamp.com Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html

The Entrepreneurship Journey: From Freelancing to Starting a Company - Adrian Brudaru

2023-12-19 Listen
podcast_episode

We talked about:

Adrian’s background The benefits of freelancing Having an agency vs freelancing What let Adrian switch over from freelancing The conception of DLT (Growth Full Stack) The investment required to start a company Growth through the provision of services Growth through teaching (product-market fit) Moving on to creating docs Adrian’s current role Strategic partnerships and community growth through DocDB Plans for the future of DLT DLT vs Airbyte vs Fivetran Adrian’s resource recommendations

Links:

Adrian's LinkedIn: https://www.linkedin.com/in/data-team/ Twitter: https://twitter.com/dlt_library Github: https://github.com/dlt-hub/dlt Website: https://dlthub.com/docs/intro

Free ML Engineering course: http://mlzoomcamp.com Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html

Become a Data Freelancer - Dimitri Visnadi

2023-12-17 Listen
podcast_episode
Dimitri Visnadi (The DataFreelancer)

We talked about:

Dimitri’s background The first steps of transitioning into freelance Working with recruiters (contracting) Deciding on what to charge for your services Establishing your network Self-marketing Contracting vs freelancing Which channel is better for those starting out? Cutting out the middleman Where to look for clients and how to vet them The different way of getting into freelancing Going back to a full-time job after freelancing Common mistakes freelancers make Dimitri’s resource suggestions Reaching out to Dimitri

Links:

LinkedIn profile: http://www.linkedin.com/in/visnadi The DataFreelancer website: https://thedatafreelancer.com/

Free ML Engineering course: http://mlzoomcamp.com Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html

AI for Digital Health - Maria Bruckert

2023-12-04 Listen
podcast_episode

We talked about:

Maria’s background Deciding to go into telecare (healthcare) Current difficulties in healthcare Getting into the healthcare industry as a lifestyle brand The importance of a plan B and being flexible What is SQIN and the importance of communication Going from lipstick to skin health analysis The importance of community and broadening your audience The importance of feedback and communicating benefits The current state and growth of SQIN Convincing investors and the importance of proving profitability Maria’s role at SQIN Balancing a newborn child and a new company

Links:

Free ML Engineering course: http://mlzoomcamp.com Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html

Cracking the Code: Machine Learning Made Understandable - Christoph Molnar

2023-11-26 Listen
podcast_episode

We talked about:

Christoph’s background Kaggle and other competitions How Christoph became interested in interpretable machine learning Interpretability vs Accuracy Christoph’s current competition engagement How Christoph chooses topics for books Why Christoph started the writing journey with a book Self-publishing vs via a publisher Christoph’s other books What is conformal prediction? Christoph’s book on SHAP Explainable AI vs Interpretable AI Working alone vs with other people Christoph’s other engagements and how to stay hands-on Keeping a logbook Does one have to be an expert on the topic to write a book about it? Writing in the open and other feedback gathering methods Advice for those who want to be technical writers Self-publishing tools Finding Christoph online

Links:

LinkedIn: https://www.linkedin.com/in/christoph-molnar/ Website: https://christophmolnar.com/

Free ML Engineering course: http://mlzoomcamp.com Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html

The Unwritten Rules for Success in Machine Learning - Jack Blandin

2023-11-20 Listen
podcast_episode

We talked about:

Jack’s background Transitioning from IC to management Lesson not taught in traditional school The importance of people’s perception, trust, and respect How soft skills are relevant to machine learning How to put on a salesman hat in machine learning management The importance of visuals and building a POC as fast as possible 1st Rule of Machine Learning – don’t be afraid to start without machine learning The importance of understanding the reality that data represents The importance of putting yourself in the shoes of customers The importance of software engineering skills in machine learning Where to find Jack’s content Jack’s next venture

Links:

Jack's LinkedIn profile: https://www.linkedin.com/in/jackblandin/

Free ML Engineering course: http://mlzoomcamp.com Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html

From a Research Scientist at Amazon to a Machine learning/AI Consultant - Verena Webber

2023-11-10 Listen
podcast_episode
Verena Webber (Amazon)

Links:

Mini sound bath: https://www.youtube.com/watch?v=g-lDrcSqcrQ

Free ML Engineering course: http://mlzoomcamp.com Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html

From Marketing to Product Owner in Search - Lera Kaimashnіkova

2023-11-05 Listen
podcast_episode

We talked about:

Lera’s background Lera’s move from Ukraine to Germany The transition from Marketing to Product Ownership The importance of communication and one-on-ones The role of Product Owner Utilizing Scrum as a Product Owner Building teams and cross-functionality Lera’s experience learning about search The importance of having both technical knowledge and business context Open developer positions at AUTODOC What experience Lera came to AUTODOC with How marketing skills helped Lera in her current role Lera’s resource recommendations Everything is possible

Links:

Post: https://www.linkedin.com/posts/leracaiman_elasticsearch-ecommerce-activity-7106615081588674560-5WQO

Free ML Engineering course: http://mlzoomcamp.com Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html

Collaborative Data Science in Business - Ioannis Mesionis

2023-10-27 Listen
podcast_episode

Links:

LinkedIn: https://www.linkedin.com/in/ioannis-mesionis/
Github: https://github.com/ioannismesionis Website: https://ioannismesionis.github.io/

Free ML Engineering course: http://mlzoomcamp.com Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html

Bridging Data Science and Healthcare - Eleni Stamatelou

2023-10-20 Listen
podcast_episode

Free ML Engineering course: http://mlzoomcamp.com

Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html

DataTalks.Club Anniversary Interview - Alexey Grigorev, Johanna Bayer

2023-10-12 Listen
podcast_episode

Free ML Engineering course: http://mlzoomcamp.com Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html

Data Engineering for Fraud Prevention - Angela Ramirez

2023-10-06 Listen
podcast_episode
Angela Ramirez (Sam's Club)

We talked about:

Angela's background Angela's role at Sam's Club The usefulness of knowing ML as a data engineer Angela's career path Transitioning from data analyst to data engineer/system designer Best practices for system design and data engineering Working with document databases Working with network-based databases Detecting fraud with a network-based database Selecting the database type to work with Neo4j vs Postgres The importance of having software engineering knowledge in data engineering Data quality check tooling The greatest challenges in data engineering Debugging and finding the root cause of a failed job What kinds of tools Angela uses on a daily basis Working with external data sources Angela's resource recommendations

Links:

LinkedIn: https://www.linkedin.com/in/aramirez1305/ Twitter: https://twitter.com/angelamaria__r Github: https://github.com/aramir62 Previous podcast talk: https://twitter.com/i/spaces/1OwGWwZAZDnGQ?s=20

Free ML Engineering course: http://mlzoomcamp.com

Join DataTalks.Club: https://datatalks.club/slack.html

Our events: https://datatalks.club/events.html

From Data Manager to Data Architect - Loïc Magnien

2023-09-29 Listen
podcast_episode

We talked about:

Loïc's background Data management Loïc's transition to data engineer Challenges in the transition to data engineering What is a data architect? The output of a data architect's work Establishing metrics and dimensions The importance of communication Setting up best practices for the team Staying relevant and tech-watching Setting up specifications for a pipeline Be agile, create a POC, iterate ASAP, and build reusable templates Reaching out to Loïc for questions

Links:

Loiic LinkedIn: https://www.linkedin.com/in/loicmagnien/

Free ML Engineering course: http://mlzoomcamp.com

Join DataTalks.Club: https://datatalks.club/slack.html

Our events: https://datatalks.club/events.html

Pragmatic and Standardized MLOps - Maria Vechtomova

2023-09-08 Listen
podcast_episode
Maria Vechtomova (Marvelous MLOps)

We talked about:

Maria's background Marvelous MLOps Maria's definition of MLOps Alternate team setups without a central MLOps team Pragmatic vs non-pragmatic MLOps Must-have ML tools (categories) Maturity assessment What to start with in MLOps Standardized MLOps Convincing DevOps to implement Understanding what the tools are used for instead of knowing all the tools Maria's next project plans Is LLM Ops a thing? What Ahold Delhaize does Resource recommendations to learn more about MLOps The importance of data engineering knowledge for ML engineers

Links:

LinkedIn: https://www.linkedin.com/company/marvelous-mlops/

Website: https://marvelousmlops.substack.com/

Free MLOps course: https://github.com/DataTalksClub/mlops-zoomcamp Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html

Democratizing Causality - Aleksander Molak

2023-08-25 Listen
podcast_episode

We talked about:

Aleksander's background Aleksander as a Causal Ambassador Using causality to make decisions Counterfactuals and and Judea Pearl Meta-learners vs classical ML models Average treatment effect Reducing causal bias, the super efficient estimator, and model uplifting Metrics for evaluating a causal model vs a traditional ML model Is the added complexity of a causal model worth implementing? Utilizing LLMs in causal models (text as outcome) Text as treatment and style extraction The viability of A/B tests in causal models Graphical structures and nonparametric identification Aleksander's resource recommendations

Links:

The Book of Why: https://amzn.to/3OZpvBk Causal Inference and Discovery in Python: https://amzn.to/46Pperr Book's GitHub repo: https://github.com/PacktPublishing/Causal-Inference-and-Discovery-in-Python The Battle of Giants: Causality vs NLP (PyData Berlin 2023): https://www.youtube.com/watch?v=Bd1XtGZhnmw New Frontiers in Causal NLP (papers repo): https://bit.ly/3N0TFTL

Free MLOps course: https://github.com/DataTalksClub/mlops-zoomcamp Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html

Mastering Data Engineering as a Remote Worker - José María Sánchez Salas

2023-08-18 Listen
podcast_episode

We talked about:

José's background How José relocated to Norway and his schedule Tech companies in Norway and José role Challenges of working as a remote data engineer José's newsletter on how to make use of data The process of making data useful Where José gets inspiration for his newsletter Dealing with burnout When in Norway, do as the Norwegians do The legalities of working remotely in Norway The benefits of working remotely

Links:

LinkedIn: https://www.linkedin.com/in/jmssalas Github: https://github.com/jmssalas Website & Newsletter: https://jmssalas.com

Free MLOps course: https://github.com/DataTalksClub/mlops-zoomcamp Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html

The Good, the Bad and the Ugly of GPT - Sandra Kublik

2023-08-04 Listen
podcast_episode

We talked about:

Sandra's background Making a YouTube channel to break into the LLM space The business cases for LLMs LLMs as amplifiers The befits of keeping a human in the loop when using LLMs (AI limitations) Using LLMs as assistants Building an app that uses an LLM Prompt whisperers and how to improve your prompts Sandra's 7-day LLM experiment Sandra's LLM content recommendations Finding Sandra online

Links:

LinkedIn: https://www.linkedin.com/in/sandrakublik/ Twitter: https://twitter.com/sandra_kublik Youtube: https://www.youtube.com/@sandra_kublik

Free MLOps course: https://github.com/DataTalksClub/mlops-zoomcamp Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html

LLMs for Everyone - Meryem Arik

2023-07-28 Listen
podcast_episode
Meryem Arik (TitanML)

We talked about:

Meryam's background The constant evolution of startups How Meryam became interested in LLMs What is an LLM (generative vs non-generative models)? Why LLMs are important Open source models vs API models What TitanML does How fine-tuning a model helps in LLM use cases Fine-tuning generative models How generative models change the landscape of human work How to adjust models over time Vector databases and LLMs How to choose an open source LLM or an API Measuring input data quality Meryam's resource recommendations

Links:

Website: https://www.titanml.co/ Beta docs: https://titanml.gitbook.io/iris-documentation/overview/guide-to-titanml... Using llama2.0 in TitanML Blog: https://medium.com/@TitanML/the-easiest-way-to-fine-tune-and-inference-llama-2-0-8d8900a57d57 Discord: https://discord.gg/83RmHTjZgf Meryem LinkedIn: https://www.linkedin.com/in/meryemarik/

Free MLOps course: https://github.com/DataTalksClub/mlops-zoomcamp Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html

Investing in Open-Source Data Tools - Bela Wiertz

2023-07-21 Listen
podcast_episode
Bela Wiertz (TKM Family Office)

We talked about:

Bela's background Why startups even need investors Why open source is a viable go-to-market strategy Building a bottom-up community The investment thesis for the TKM Family Office and the blurriness of the funding round naming convention Angel investors vs VC Funds vs family offices Bela's investment criteria and GitHub stars as a metric Inbound sourcing, outbound sourcing, and investor networking Making a good impression on an investor Balancing open and closed source parts of a product The future of open source Recent successes of open source companies Bela's resource recommendations

Links:

Understand who is engaging with your open source project article: https://www.crowd.dev/ Top 6 Books on Developer Community Building: https://www.crowd.dev/post/top-6-books-on-developer-community-building Which open source software metrics matter: https://www.bvp.com/atlas/measuring-the-engagement-of-an-open-source-software-community#Which-open-source-software-metrics-matter

Free MLOps course: https://github.com/DataTalksClub/mlops-zoomcamp

Join DataTalks.Club: https://datatalks.club/slack.html

Our events: https://datatalks.club/events.html

Why Machine Learning Design is Broken - Valerii Babushkin

2023-07-14 Listen
podcast_episode

Links:

Book: https://www.manning.com/books/machine-learning-system-design?utm_source=AGMLBookcamp&utm_medium=affiliate&utm_campaign=book_babushkin_machine_4_25_23&utm_content=twitter Discount: poddatatalks21 (35% off) Evidently: https://www.evidentlyai.com/ Article: https://medium.com/people-ai-engineering/design-documents-for-ml-models-bbcd30402ff7

Free MLOps course: https://github.com/DataTalksClub/mlops-zoomcamp

Join DataTalks.Club: https://datatalks.club/slack.html

Our events: https://datatalks.club/events.html

Interpretable AI and ML - Polina Mosolova

2023-07-07 Listen
podcast_episode

We talked about:

Polina's background How common it is for PhD students to build ML pipelines end-to-end Simultaneous PhD and industry experience Support from both the academic and industry sides How common the industrial PhD setup is and how to get into one Organizational trust theory How price relates to trust How trust relates to explainability The importance of actionability Explainability vs interpretability vs actionability Complex glass box models Does the explainability of a model follow explainability? What explainable AI bring to customers and end users Can all trust be turned into KPI?

Links:

LinkedIn: https://www.linkedin.com/in/polina-mosolova/ Neural Additive Models paper: https://proceedings.neurips.cc/paper/2021/file/251bd0442dfcc53b5a761e050f8022b8-Paper.pdf Neural Basis Model paper: https://arxiv.org/pdf/2205.14120.pdf Interpretable Feature Spaces paper: https://kdd.org/exploration_files/vol24issue1_1._Interpretable_Feature_Spaces_revised.pdf

From Scratch to Success: Building an MLOps Team and ML Platform - Simon Stiebellehner

2023-06-30 Listen
podcast_episode

We talked about:

Simon's background What MLOps is and what it isn't Skills needed to build an ML platform that serves 100s of models Ranking the importance of skills The point where you should think about building an ML platform The importance of processes in ML platforms Weighing your options with SaaS platforms The exploratory setup, experiment tracking, and model registry What comes after deployment? Stitching tools together to create an ML platform Keeping data governance in mind when building a platform What comes first – the model or the platform? Do MLOps engineers need to have deep knowledge of how models work? Is API design important for MLOps? Simon's recommendations for furthering MLOps knowledge

Links:

LinkedIn: https://www.linkedin.com/in/simonstiebellehner/ Github: https://github.com/stiebels Medium: https://medium.com/@sistel

Free MLOps course: https://github.com/DataTalksClub/mlops-zoomcamp

Join DataTalks.Club: https://datatalks.club/slack.html

Our events: https://datatalks.club/events.html

From MLOps to DataOps - Santona Tuli

2023-06-23 Listen
podcast_episode

We talked about:

Santona's background Focusing on data workflows Upsolver vs DBT ML pipelines vs Data pipelines MLOps vs DataOps Tools used for data pipelines and ML pipelines The “modern data stack” and today's data ecosystem Staging the data and the concept of a “lakehouse” Transforming the data after staging What happens after the modeling phase Human-centric vs Machine-centric pipeline Applying skills learned in academia to ML engineering Crafting user personas based on real stories A framework of curiosity Santona's book and resource recommendations

Links:

LinkedIn: https://www.linkedin.com/in/santona-tuli/ Upsolver website: upsolver.com Why we built a SQL-based solution to unify batch and stream workflows: https://www.upsolver.com/blog/why-we-built-a-sql-based-solution-to-unify-batch-and-stream-workflows

Free MLOps course: https://github.com/DataTalksClub/mlops-zoomcamp

Join DataTalks.Club: https://datatalks.club/slack.html

Our events: https://datatalks.club/events.html

Data Developer Relations - Hugo Bowne-Anderson

2023-06-16 Listen
podcast_episode
Hugo Bowne-Anderson (DataCamp)

We talked about:

Hugo's background Why do tools and the companies that run them have wildly different names Hugo's other projects beside Metaflow Transitioning from educator to DevRel What is DevRel? DevRel vs Marketing How DevRel coordinates with developers How DevRel coordinates with marketers What skills a DevRel needs The challenges that come with being an educator Becoming a good writer: nature vs nurture Hugo's approach to writing and suggestions Establishing a goal for your content Choosing a form of media for your content Is DevRel intercompany or intracompany? The Vanishing Gradients podcast Finding Hugo online

Links:

Hugo Browne's github: http://hugobowne.github.io/ Vanishing Gradients: https://vanishinggradients.fireside.fm/ MLOps and DevOps: Why Data Makes It Differenthttps://www.oreilly.com/radar/mlops-and-devops-why-data-makes-it-different/ Evaluate Metaflow for free, right from your Browser: https://outerbounds.com/sandbox/

Free MLOps course: https://github.com/DataTalksClub/mlops-zoomcamp

Join DataTalks.Club: https://datatalks.club/slack.html

Our events: https://datatalks.club/events.html

Lessons Learned from Freelancing and Working in a Start-up - Antonis Stellas

2023-06-09 Listen
podcast_episode

We talked about;

Antonis' background The pros and cons of working for a startup Useful skills for working at a startup and the Lean way to work How Antonis joined the DataTalks.Club community Suggestions for students joining the MLOps course Antonis contributing to Evidently AI How Antonis started freelancing Getting your first clients on Upwork Pricing your work as a freelancer The process after getting approved by a client Wearing many hats as a freelancer and while working at a startup Other suggestions for getting clients as a freelancer Antonis' thoughts on the Data Engineering course Antonis' resource recommendations

Links:

Lean Startup by Eric Ries: https://theleanstartup.com/ Lean Analytics: https://leananalyticsbook.com/ Designing Machine Learning Systems by Chip Huyen: https://www.oreilly.com/library/view/designing-machine-learning/9781098107956/ Kafka Streaming with python by Khris Jenkins tutorial video: https://youtu.be/jItIQ-UvFI4

Free MLOps course: https://github.com/DataTalksClub/mlops-zoomcamp Join DataTalks.Club: https://datatalks.club/slack.html Our events: https://datatalks.club/events.html