talk-data.com talk-data.com

Event

DataTalks.Club

2020-11-21 – 2025-11-28 Podcasts Visit website ↗

Activities tracked

179

DataTalks.Club - the place to talk about data!

Filtering by: HTML ×

Sessions & talks

Showing 101–125 of 179 · Newest first

Search within this event →

Decoding Data Science Job Descriptions - Tereza Iofciu

2022-08-05 Listen
podcast_episode

We talked about:

DataTalks.Club intro Tereza’s background Working as a coach Identifying the mismatches between your needs and that of a company How to avoid misalignments Considering what’s mentioned in the job description, what isn’t, and why Diversity and culture of a company Lack of a salary in the job description Way of doing research about the company where you will potentially work How to avoid a mismatch with a company other than learning from your mistakes Before data, during data, after data (a company’s data maturity level) The company’s tech stack Finding Tereza online

Links: 

Decoding Data Science Job Descriptions (talk): https://www.youtube.com/watch?v=WAs9vSNTza8 Talk at ConnectForward: https://www.youtube.com/watch?v=WAs9vSNTza8 Slides: https://www.slideshare.net/terezaif/decoding-data-science-job-descriptions-250687704 Talk at DataLift: https://www.youtube.com/watch?v=pCtQ0szJiLA Slides: https://www.slideshare.net/terezaif/lessons-learned-from-hiring-and-retaining-data-practitioners

MLOps Zoomcamp: https://github.com/DataTalksClub/mlops-zoomcamp

Join DataTalks.Club: https://datatalks.club/slack.html

Our events: https://datatalks.club/events.html

Data Science for Social Impact - Christine Cepelak

2022-07-29 Listen
podcast_episode

We talked about:

Christine’s Background Private sector vs Public sector Public policy The challenges of being a community organizer How public policy relates to political science Programs that teach data science for public policy Data science for public policy vs regular data science The importance of ethical data science in public policy How data science in social impact project differs from other projects Other resources to learn about data science for public policy Challenges with getting data in data science for public policy The problems with accessing public datasets about recycling Christine’s potential projects after Master’s degree Gender inequality in STEM fields Corporate responsibility and why organizations need social impact data scientists What you need to start making a social impact with data science 80,000 hours Other use cases for public policy data science Coffee, Ethics & AI Finding Christine online

Links:

Explore some Data Science for Social Good projects: http://www.dssgfellowship.org/projects/ Bi-weekly Ethics in AI Coffee Chat: https://www.meetup.com/coffee-ethics-ai/ Make a Social Impact with your Job: https://tinyurl.com/80khours Course in Data Ethics: https://ethics.fast.ai/ Data Science for Social Good Berlin: https://dssg-berlin.org/ CorrelAid: https://correlaid.org/ DataKind: https://www.datakind.org/ Christine's LinkedIn: https://www.linkedin.com/in/christinecepelak/ Christine's Twitter: https://twitter.com/CLcep 

MLOps Zoomcamp: https://github.com/DataTalksClub/mlops-zoomcamp

Join DataTalks.Club: https://datatalks.club/slack.html

Our events: https://datatalks.club/events.html

Hiring Data Science Talent - Olga Ivina

2022-07-22 Listen
podcast_episode

We talked about:

Olga’s career journey Hiring data scientists now vs 7 years ago The two qualities of an excellent data scientist What makes Alexey do this podcast How Alexey get the latest information on data science How Olga checks a candidate’s technical skills How to make an answer stand out (showing your depth of knowledge) A strong mathematical background vs a strong engineering background When Auto ML will replace the need to have data scientists Should data scientists transition into management? (the importance of communication in an organization) Switching from a data analyst role to a data scientist Attracting female talent in data science Changing a job description to find talent Long gaps in the CV Eierlegende Wollmilchsau

Links:

Olga's LinkedIn: https://www.linkedin.com/in/olgaivina/  Olga's Twitter: https://twitter.com/olgaivina

MLOps Zoomcamp: https://github.com/DataTalksClub/mlops-zoomcamp

Join DataTalks.Club: https://datatalks.club/slack.html

Our events: https://datatalks.club/events.html

From Open-Source Maintainer to Founder - Will McGugan

2022-07-15 Listen
podcast_episode
Will McGugan (Textualize)

We talked about: 

Will’s background Will’s open source projects S3Fs and PyFile systems Inspiration for open source projects Will as a freelancer Starting a company from a tweet (Rich and Textual) Building in public (Will’s approach to social media) The workforce and roadmap of Textualize.io The importance of working on open source for Textualize employees The workflow of and contributions to Textualize Getting your first thousand GitHub Stars (going viral) Suggestions for those who wish to start in the open-source space Finding Will online

Links: 

Twitter: https://twitter.com/willmcgugan Textualize website: https://www.textualize.io/ Textualize GitHub: https://github.com/textualize

MLOps Zoomcamp: https://github.com/DataTalksClub/mlops-zoomcamp

Join DataTalks.Club: https://datatalks.club/slack.html

Our events: https://datatalks.club/events.html

Designing a Data Science Organization - Lisa Cohen

2022-07-08 Listen
podcast_episode

We talked about:

Lisa’s background Centralized org vs decentralized org Hybrid org (centralized/decentralized) Reporting your results in a data organization Planning in a data organization Having all the moving parts work towards the same goals Which approach Twitter follows (centralized vs decentralized) Pros and cons of a decentralized approach Pros and cons of a centralized approach Finding a common language with all the functions of an org Finding the right approach for companies that want to implement data science How many data scientists does a company need? Who do data scientists report huge findings to? The importance of partnering closely with other functions of the org The role of Product Managers in the org and across functions Who does analytics at Twitter (analysts vs data scientists) The importance of goals, objectives and key results Conflicting objectives The importance of research Finding Lisa online

Links:

LinkedIn: https://www.linkedin.com/in/cohenlisa/ Twitter: https://twitter.com/lisafeig Medium: https://medium.com/@lisa_cohen Lisa Cohen's YouTube videos: https://www.youtube.com/playlist?list=PLRhmnnfr2bX7-GAPHzvfUeIEt2iYCbI3w

MLOps Zoomcamp: https://github.com/DataTalksClub/mlops-zoomcamp

Join DataTalks.Club: https://datatalks.club/slack.html

Our events: https://datatalks.club/events.html

Developer Advocacy Engineer for Open-Source - Merve Noyan

2022-07-01 Listen
podcast_episode
Merve Noyan (Hugging Face)

We talked about:

Merve’s background Merve’s first contributions to open source What Merve currently does at Hugging Face (Hub, Spaces) What is means to be a developer advocacy engineer at Hugging Face The best way to get open source experience (Google Summer of Code, Hacktoberfest, and sprints) The peculiarities of hiring as it relates to code contributions Best resources to learn about NLP besides Hugging Face Good first projects for NLP The most important topics in NLP right now NLP ML Engineer vs NLP Data Scientist Project recommendations and other advice to catch the eye of recruiters Merve on Twitch and her podcast Finding Merve online Merve and Mario Kart

Links:

Hugging Face Course: https://hf.co/course Natural Language Processing in TensorFlow: https://www.coursera.org/learn/natural-language-processing-tensorflow Github ML Poetry: https://github.com/merveenoyan/ML-poetry Tackling multiple tasks with a single visual language model: https://www.deepmind.com/blog/tackling-multiple-tasks-with-a-single-visual-language-model Hugging Face big science/TOpp: https://huggingface.co/bigscience/T0pp Pathways Language Model (PaLM) blog: https://ai.googleblog.com/2022/04/pathways-language-model-palm-scaling-to.html

MLOps Zoomcamp: https://github.com/DataTalksClub/mlops-zoomcamp

Join DataTalks.Club: https://datatalks.club/slack.html

Our events: https://datatalks.club/events.html

Data Scientists at Work - Mısra Turp

2022-06-24 Listen
podcast_episode

We talked about:

Misra’s background What data scientists do Consultant data scientists vs in-house data scientists (and freelancers) Expectations for data scientists The importance of keeping up to date with AI developments (FOMA) How does DALL·E 2 work and should you care? Going to conferences to stay up to date The most pressing issue for data scientists Fighting FOMA and imposter syndrome Knowing when you have enough knowledge of a framework The “best” type of data scientist Being a generalist vs a specialist Advice for entry-level data entering an oversaturated market Catching the eye of big AI companies Choosing a project for your portfolio The importance of having a Ph.D. or Master’s degree in data science Finding Misra online

Links:

Mısra's YouTube channel: https://www.youtube.com/channel/UCpNUYWW0kiqyh0j5Qy3aU7w Twitter: https://twitter.com/misraturp Hands-on Data Science: Complete Your First Portfolio Project: https://www.soyouwanttobeadatascientist.com/hods 

MLOps Zoomcamp: https://github.com/DataTalksClub/mlops-zoomcamp

Join DataTalks.Club: https://datatalks.club/slack.html

Our events: https://datatalks.club/events.htm

Freelancing and Consulting with Data Engineering - Adrian Brudaru

2022-06-17 Listen
podcast_episode

We talked about:

Adrian’s background Freelancing vs Employment Risk and occupancy rate in freelancing The scariest part of freelancing Adrian’s first projects Freelancing 5 years later Pay rates in freelancing Acquiring skills while freelancing Working with recruitment agencies and networking Looking for projects and getting clients Freelancing vs consulting Clarity in clients’ expectations (scope of work) Building your network Freelancing platforms Adrian’s data loading prototype Going from freelancing to making your own product (and other investments) The usefulness of a portfolio Introverts in freelancing Is it possible to work for 3 months a year in freelancing? Choosing projects and skill-building strategy (focusing on interests) Freelancing in Berlin Clients’ expectations for freelancers vs employees Working with more than one client at the same time Adrian’s freelance cooperative on Slack Other advice for novice freelancers (networking) Finding Adrian online

Links:

Github: https://github.com/scale-vector Slack Community: https://join.slack.com/t/berlindatacol-szn7050/shared_invite/zt-19dp8msp0-pP4Av3_fVFBbsdrzPROEAg

MLOps Zoomcamp: https://github.com/DataTalksClub/mlops-zoomcamp

Join DataTalks.Club: https://datatalks.club/slack.html

Our events: https://datatalks.club/events.html

Getting a Data Engineering Job (Summary and Q&A) - Jeff Katz

2022-06-10 Listen
podcast_episode
Jeff Katz (JigsawLabs.io)

We talked about:

Summary of “Getting a Data Engineering Job” webinar Python and engineering skills  Interview process Behavioral interviews Technical interviews Learning Python and SQL from scratch Is having non-coding experience a disadvantage? Analyst or engineer? Do you need certificates? Do I need a master’s degree? Fully remote data engineering jobs Should I include teaching on my resume? Object-oriented programming for data engineering Python vs Java/Scala SQL and Python technical interview questions GCP certificates Is commercial experience really necessary? From sales to engineering Solution engineers Wrapping up

Links:

Getting a Data Engineering Job (webinar): https://www.youtube.com/watch?v=yvEWG-S1F_M The Flask Mega-Tutorial Part I - Hello, World! blog: https://blog.miguelgrinberg.com/post/the-flask-mega-tutorial-part-i-hello-world Mode SQL Tutorial: https://mode.com/sql-tutorial/

MLOps Zoomcamp: https://github.com/DataTalksClub/mlops-zoomcamp

Join DataTalks.Club: https://datatalks.club/slack.html

Our events: https://datatalks.club/events.html

Using Data for Asteroid Mining - Daynan Crull

2022-06-03 Listen
podcast_episode

We talked about:

Daynan’s background Astronomy vs cosmology Applications of data science and machine learning in astronomy Determining signal vs noise What the data looks like in astronomy Determining the features of an object in space Ground truth for space objects Why water is an important resource in the space economy Other useful resources that can be found in asteroids Sources of asteroids The data team at an asteroid mining company Open datasets for hobbyists Mission and hardware design for asteroid mining Partnerships and hires

Links: 

LinkedIn: https://www.linkedin.com/in/daynan/ We're looking for a Sr Data Engineer: https://boards.eu.greenhouse.io/karmanplus/jobs/4027128101?gh_jid=4027128101 Minor Planet Center: https://minorplanetcenter.net/- JPL Horizons has a nice set of APIs for accessing data related to small bodies (including asteroids): https://ssd.jpl.nasa.gov/api.html ESA has NEODyS: https://newton.spacedys.com/neodys   IRSA catalog that contains image and catalog data related to the WISE/NEOWISE data (and other infrared platforms): https://irsa.ipac.caltech.edu/frontpage/ NASA also has an archive of data collected from their various missions, including a node related to small bodies: https://pds-smallbodies.astro.umd.edu/ Sub-node directly related to asteroids: https://sbn.psi.edu/pds/ Size, Mass, and Density of Asteroids (SiMDA) is a nice catalog of observed asteroid attributes (and an indication of how small our sample size is!): https://astro.kretlow.de/?SiMDA The source survey data, several are useful for asteroids: Pan-STARRS (https://outerspace.stsci.edu/display/PANSTARRS)

MLOps Zoomcamp: https://github.com/DataTalksClub/mlops-zoomcamp

Join DataTalks.Club: https://datatalks.club/slack.html

Our events: https://datatalks.club/events.html

Machine Learning in Marketing - Juan Orduz

2022-05-27 Listen
podcast_episode

We talked about:

Juan’s background Typical problems in marketing that are solved with ML Attribution model Media Mix Model – detecting uplift and channel saturation Changes to privacy regulations and its effect on user tracking User retention and churn prevention A/B testing to detect uplift Statistical approach vs machine learning (setting a benchmark) Does retraining MMM models often improve efficiency? Attribution model baselines Choosing a decay rate for channels (Bayesian linear regression) Learning resource suggestions Bayesian approach vs Frequentist approach Suggestions for creating a marketing department Most challenging problems in marketing The importance of knowing marketing domain knowledge for data scientists Juan’s blog and other learning resources Finding Juan online

Links: 

Juan's PyData talk on uplift modeling: https://youtube.com/watch?v=VWjsi-5yc3w Juan's website: https://juanitorduz.github.io Introduction to Algorithmic Marketing book: https://algorithmic-marketing.online Preventing churn like a bandit: https://www.youtube.com/watch?v=n1uqeBNUlRM

MLOps Zoomcamp: https://github.com/DataTalksClub/mlops-zoomcamp

Join DataTalks.Club: https://datatalks.club/slack.html

Our events: https://datatalks.club/events.html

From Academia to Data Analytics and Engineering - Gloria Quiceno

2022-05-20 Listen
podcast_episode

We talked about: 

Gloria’s background Working with MATLAB, R, C, Python, and SQL Working at ICE Job hunting after the bootcamp Data engineering vs Data science Using Docker Keeping track of job applications, employers and questions Challenges during the job search and transition Concerns over data privacy Challenges with salary negotiation The importance of career coaching and support Skills learned at Spiced Retrospective on Gloria’s transition to data and advice Top skills that helped Gloria get the job Thoughts on cloud platforms Thoughts on bootcamps and courses Spiced graduation project Standing out in a sea of applicants The cohorts at Spiced Conclusion

Links:

LinkedIn: https://www.linkedin.com/in/gloria-quiceno/ Github: https://github.com/gdq12

MLOps Zoomcamp: https://github.com/DataTalksClub/mlops-zoomcamp

Join DataTalks.Club: https://datatalks.club/slack.html

Our events: https://datatalks.club/events.html

Teaching Data Engineers - Jeff Katz

2022-05-13 Listen
podcast_episode
Jeff Katz (JigsawLabs.io)

We talked about:

Jeff’s background Getting feedback to become a better teacher Going from engineering to teaching Jeff on becoming a curriculum writer Creating a curriculum that reinforces learning Jeff on starting his own data engineering bootcamp Shifting from teaching ML and data science to teaching data engineering Making sure that students get hired Screening bootcamp applicants Knowing when it’s time to apply for jobs The curriculum of JigsawLabs.io The market demand of Spark, Kafka, and Kubernetes (or lack thereof) Advice for data analysts that want to move into data engineering The market demand of ETL/ELT and DBT (or lack thereof) The importance of Python, SQL, and data modeling for data engineering roles Interview expectations How to get started in teaching The challenges of being a one-person company Teaching fundamentals vs the “shiny new stuff” JigsawLabs.io Finding Jeff online

Links: 

Jigsaw Labs: https://www.jigsawlabs.io/free Teaching my mom to code: https://www.youtube.com/watch?v=OfWwfTXGjBM Getting a Data Engineering Job Webinar with Jeff Katz: https://www.eventbrite.de/e/getting-a-data-engineering-job-tickets-310270877547

MLOps Zoomcamp: https://github.com/DataTalksClub/mlops-zoomcamp

Join DataTalks.Club: https://datatalks.club/slack.html

Our events: https://datatalks.club/events.html

From Roasting Coffee to Backend Development - Jessica Greene

2022-05-06 Listen
podcast_episode

We talked about: 

Jessica’s background Giving a talk at a tech conference about coffee Jessica’s transition into tech (How to get started) Going from learning to actually making money Landing your first job in tech Does your age matter when you’re trying to get a job? Challenges that Jessica faced in the beginning of her career Jessica’s role at PyLadies Fighting the Imposter Syndrome Generational differences in digital literacy and how to improve it Events organized by PyLadies Jessica’s beginnings at PyLadies (organizing events) Jessica’s experience with public speaking The impact of public speaking on your career Tips for public speaking Jessica’s work at Ecosia Discrimination in the tech industry (and in general) Finding Jessica online

Links:

Ecosia's website: https://www.ecosia.org/ Ecosia's blog: https://blog.ecosia.org/ecosia-financial-reports-tree-planting-receipts/ PyLadies Berlin: https://berlin.pyladies.com/ PyLadies' Meetup: https://meetup.com/PyLadies-Berlin Code Academy: https://www.codecademy.com/ Freecodecamp: https://www.freecodecamp.org/ Coursera Machine Learning: https://www.coursera.org/learn/machine-learning ML Bookcamp code: https://github.com/alexeygrigorev/mlbookcamp-code/tree/master/course-zoomcamp Google Summer code: https://summerofcode.withgoogle.com/ Outreachy website: https://www.outreachy.org/ Alumni Interview: https://railsgirlssummerofcode.org/blog/2020-03-17-alumni-interview-jessica Python pizza: https://python.pizza/ Pycon: https://pycon.it/en Pycon 2022: https://2022.pycon.de/

Join DataTalks.Club: https://datatalks.club/slack.html

Our events: https://datatalks.club/events.html

Recruiting Data Engineers - Nicolas Rassam

2022-04-29 Listen
podcast_episode
Nicolas Rassam (Onfido)

We talked about: 

Nicolas’ background The tech talent market in different countries Hiring data scientists vs data engineers A spike in interest for data engineering roles The importance of recruiters having  technical knowledge The main challenges of hiring data engineers The difference in hiring junior, mid, and senior level data engineers Things recruiters look for in people who switch to a data engineering role The importance of knowing cloud tools The importance of knowing infrastructure tools Preparing for the interview The importance of a formal education The importance having a project portfolio How your current domain influence the interview Conclusion

Links: 

Nicolas' Twitter: https://twitter.com/n_rassam  Nicolas' LinkedIn: https://www.linkedin.com/in/nicolasrassam/  Onfido is hiring: https://onfido.com/engineering-technology/  Interview with Alicja about recruiting data scientists: https://datatalks.club/podcast/s07e02-recruiting-data-professionals.html Webinar "Getting a Data Engineering Job" with Jeff Katz: https://eventbrite.com/e/310270877547

Join DataTalks.Club: https://datatalks.club/slack.html

Our events: https://datatalks.club/events.html

Storytime for DataOps - Christopher Bergh

2022-04-22 Listen
podcast_episode
Christopher Bergh (DataKitchen)

We talked about:

Christopher’s background The essence of DataOps Also known as Agile Analytics Operations or DevOps for Data Science Defining processes and automating them (defining “done” and “good”) The balance between heroism and fear (avoiding deferred value) The Lean approach Avoiding silos The 7 steps to DataOps Wanting to become replaceable DataOps is doable Testing tools DataOps vs MLOps The Head Chef at Data Kitchen What’s grilling at Data Kitchen? The DataOps Cookbook

Links:

DataOps Manifesto website: https://dataopsmanifesto.org/en/ DataOps Cookbook: https://dataops.datakitchen.io/pf-cookbook Recipes for DataOps Success: https://dataops.datakitchen.io/pf-recipes-for-dataops-success DataOps Certification Course: https://info.datakitchen.io/training-certification-dataops-fundamentals DataOps Blog: https://datakitchen.io/blog/ DataOps Maturity Model: https://datakitchen.io/dataops-maturity-model/ DataOps Webinars: https://datakitchen.io/webinars/

Join DataTalks.Club: https://datatalks.club/slack.html  

Our events: https://datatalks.club/events.html

Machine Learning and Personalization in Healthcare - Stefan Gudmundsson

2022-04-15 Listen
podcast_episode
Stefan Gudmundsson (Sidekick Health)

We talked about:

Stefan’s background Applications of machine learning in healthcare Sidekick Health – gamified therapeutics How is working for King different from Sidekick Health? The rewards systems in gamified apps The importance of building a strong foundation for a data science team The challenges of building an app in the healthcare industry Dealing with ethics issues Sidekick Health’s personalized recommendations and content The importance of having the right approach in A/B tests (strong analytics and good data) The importance of having domain knowledge to work as a data professional in the healthcare industry Making a data-driven company Risks for Sidekick Health Sidekick Health growth strategy Using AI to help people live better lives

Links:

LinkedIn: https://www.linkedin.com/in/stefanfreyrgudmundsson/  Job listings: https://sidekickhealth.bamboohr.com/jobs/

Join DataTalks.Club: https://datatalks.club/slack.html

Our events: https://datatalks.club/events.html

Innovation and Design for Machine Learning - Liesbeth Dingemans

2022-04-08 Listen
podcast_episode

We talked about:

Liesbeth’s background What is design? The importance of interaction in design Design as a process (Double Diamond technique) How long does it take to go from an idea to finishing the second diamond? Design thinking (Google’s PAIR) What is a Design Sprint and who should participate in it? Why should data specialists care about design? Challenging your task-giver (asking “why”) How to avoid the “Chinese whisper game” (reiterating the problem) Defining the roadmap for data science teams What is innovation? Bringing innovation to your management Task force-team approach to solving problems Innovation, resource management issues, and using data to back your ideas Words of advice for those interested in design and innovation

Links:

LinkedIn: https://www.linkedin.com/in/liesbeth-dingemans/ Medium posts on design, innovation, art and AI: https://medium.com/@liesbethmd

Join DataTalks.Club: https://datatalks.club/slack.html

Our events: https://datatalks.club/events.html

Hacking Your Data Career - Marijn Markus

2022-04-01 Listen
podcast_episode

We talked about:

Marijn’s background Standing out in data science Doing the opposite of what people tell you Don’t shoot the messenger (carefully sharing your findings) Advising the seniors Bite off more than you can chew, then chew Marijn’s side projects (finding value in doing things you find interesting) Building a project portfolio Marijn’s NGO project The importance of a team Open source intelligence (OSINT) The importance of soft skills for data experts Marijn’s LinkedIn growth strategy and tips

Links:  

Twitter: https://twitter.com/MarijnMarkus LinkedIn: https://www.linkedin.com/in/marijnmarkus/

Join DataTalks.Club: https://datatalks.club/slack.html

Our events: https://datatalks.club/events.html

Visualising Machine Learning - Meor Amer

2022-03-25 Listen
podcast_episode
Meor Amer (kDimensions)

We talked about:

kDimensions Being self-employed Visual engineering Constrain yourself to get creative Coming up with ideas Visualising difficult concepts The process of creating visuals Creating visuals Learning to create visuals for engineers Consuming with intention to create Learning by breaking code Earning with visuals Adding visuals to blog posts Meor’s book: visual introduction to deep learning

Links:  

A Visual Introduction to Deep Learning by Meor Amer: https://gumroad.com/a/63231091 kDimensions website: https://kdimensions.com/ Book to learn about Figma: https://figmabook.com/ Jack Butcher's approach: https://www.youtube.com/watch?v=azhqc4K-GAE 

Join DataTalks.Club: https://datatalks.club/slack.html

Our events: https://datatalks.club/events.html

From Math Teacher to Analytics Engineer - Juan Pablo

2022-03-18 Listen
podcast_episode

We talked about:

Juan Pablo's Backround Data engineering resources Teaching calculus Transitioning to Analytics Data Analytics bootcamp Getting money while studying Going to meetups to get a job Looking for uncrowded doors Using LinkedIn Portfolio Talking to people on meetups Eight tips to get your first analytics job Consider contracts and temporary roles Getting experience with non-profits Create your own internship Networking Website for hosting a portfolio I’m a math teacher. What should I learn first? Analytics engineering Best suggestion: keep showing up Networking on online conferences Communication skills and being organized

Links:

Website: https://www.thatjuanpablo.com/ Twitter: https://twitter.com/thatjuanpablo BROKE teacher to FAANG engineer Twitter thread: https://twitter.com/thatjuanpablo/status/1475806246317875203 LinkedIn: https://www.linkedin.com/in/thatjuanpablo/

Join DataTalks.Club: https://datatalks.club/slack.html

Our events: https://datatalks.club/events.html

From Data Science to Data Engineering - Ellen König

2022-03-11 Listen
podcast_episode

We talked about:

Ellen’s background Why Ellen switched from data science to data engineering The overlap between data science and data engineering Skills to learn and improve for data engineering Ways to pick up and improve skills (advice for making the transition) What makes a data engineering course “good” Languages to know for data engineering The easiest part of transitioning into data engineering The hardest part of transitioning into data engineering Common data engineering team distributions People who are both data scientists and data engineers Pet projects and other ways to pick up development skills Dealing with cloud processing costs (alerts, billing reports, trial periods) Advice for getting into entry level positions Which cloud platform should data engineers learn?

Links:

Twitter: https://twitter.com/ellen_koenig LinkedIn: https://www.linkedin.com/in/ellenkoenig/

Join DataTalks.Club: https://datatalks.club/slack.html

Our events: https://datatalks.club/events.html

Becoming a Data Engineering Manager - Rahul Jain

2022-03-04 Listen
podcast_episode
Rahul Jain (Mentoring Club)

We talked about:

Rahul’s background What do data engineering managers do and why do we need them? Balancing engineering and management Rahul’s transition into data engineering management The importance of updating your skill set Planning the transition to manager and other challenges Setting expectations for the team and measuring success Data reconciliation GDPR compliance Data modeling for Big Data Advice for people transitioning into data engineering management Staying on top of trends and enabling team members The qualities of a good data engineering team The qualities of a good data engineer candidate (interview advice) The difference between having knowledge and stuffing a CV with buzzwords Advice for students and fresh graduates An overview of an end-to-end data engineering process

Links:

Rahul's LinkedIn: https://www.linkedin.com/in/16rahuljain/

Join DataTalks.Club: https://datatalks.club/slack.html

Our events: https://datatalks.club/events.html

A/B Testing - Jakob Graff

2022-02-25 Listen
podcast_episode
Jakob Graff (Inkitt)

We talked about:

Jakob’s background The importance of A/B tests Statistical noise A/B test example A/B tests vs expert opinion Traffic splitting, A/A tests, and designing experiments Noisy vs stable metrics – test duration and business cycles Z-tests, T-tests, and time series A/B test crash course advice Frequentist approach vs Bayesian approach A/B/C/D tests Pizza dough

Links: 

Jakob's LinkedIn: https://www.linkedin.com/in/jakob-graff-a6113a3a/ Product Analyst role at Inkitt: https://jobs.lever.co/inkitt/d2b0427a-f37f-4002-975d-28bd60b56d70

Join DataTalks.Club: https://datatalks.club/slack.html

Our events: https://datatalks.club/events.html

Machine Learning System Design Interview - Valerii Babushkin

2022-02-18 Listen
podcast_episode

We talked about:

Valerii’s background Who goes through an ML system design interview System design VS ML System design Preparing for ML system design interviews Machine learning project checklist The importance of defining a goal and ways of measuring it What to do after you set a goal Typical components of an ML system Applying ML systems to real-world problems System design and coding in interviews for new graduates Humans in the validation of model performance

Links:

Valerii's telegram channel (in Russian): t.me/cryptovalerii

Join DataTalks.Club: https://datatalks.club/slack.html

Our events: https://datatalks.club/events.html