talk-data.com talk-data.com

Filter by Source

Select conferences and events

Activities & events

Title & Speakers Event
TidyTuesday 2026-01-27 · 23:00

Join R-Ladies Ottawa for a casual evening of programming on Tuesday, January 27th. We'll be participating in TidyTuesday, a weekly data visualization challenge organized by the R for Data Science community.

What is TidyTuesday?

Every week, a new dataset is posted online on the TidyTuesday GitHub repo, and folks from around the world create data visualizations using the dataset. It's an opportunity to put your programming skills into practice using real-world data in a way that's fun! It's also a great way for everyone to learn from each other, by sharing their visualizations and code.

What will the dataset be?

Even we don't know that (yet)! We'll have to wait until the day before the event to know what data we'll be working with. If you're interested in seeing some past datasets, take a look at the examples below, or visit the TidyTuesday GitHub repo to see all of the datasets dating back to 2018.

Examples from past TidyTuesdays:

Do I have to use R?

No! You can use any programming language or visualization software that you want. In fact, Python users from around the globe participate in "TyDyTuesday" on a weekly basis.

Who is this event for?

No previous programming experience is required to participate, and we'll have experienced programmers in the room who can help you get started (or unstuck), if needed.

...But if you want to get the most out of the event, a good way to prepare is to watch the recording of the introduction to data visualization workshop we hosted back in 2024. :)

What should I bring?

  • Please bring a laptop so you can code along. We recommend that you have RStudio or another IDE (such as VS Code or Positron) installed ahead of time, but we can help you get one installed if needed!
  • Come ready to learn, share, and contribute to a safe and welcoming community!

How will this event work?

  • First few minutes of the event: Introductions, and taking a look at the dataset together as a group.
  • Time to create a data visualization using the language or software of your choice, either on your own or with a (new) friend! Grab a free snack while you're at it :)
  • Last \~30 minutes of the event: Show and tell session for anyone who would like to share their creation with the group.

What else do I need to know?

This event (like all R-Ladies events) is totally FREE to attend.

The event will take place at Bayview Yards, which is located just a few steps away from the Bayview O-Train station. There is also a free parking lot available for those who are driving. You can find us in the "Training Room", which is on the second floor of the Bayview Yards building.

This is an in-person event with limited space! Please only RSVP if you are able to attend in-person!

***Please note that the mission of R-Ladies is to increase gender diversity in the R community. This event is intended to provide a safe space for women and gender minorities. We ask for male allies to be invited by and accompanied by a woman or gender minority.***

We’re grateful to be part of the Bayview Meetups initiative and extend our thanks to Bayview Yards for generously providing the venue space.

TidyTuesday
Introduction to R: Part 2 2025-12-03 · 17:00

Continuation of the Introduction to R workshop series. This session covers data processing with dplyr, data visualization with ggplot2, and concepts such as loops, functions, and conditional statements. Participants are encouraged to bring their own data and follow along.

r dplyr ggplot2
Workshop - Introduction to R (Part 2)
Introduction to R 2025-10-29 · 17:00

1.5-hour introductory workshop on the R programming language. Topics covered include: R as a calculator; basic data types and structures; reading data into R and exporting for external purposes; basic data processing; basic data visualization; loops, functions, and conditional statements.

r rstudio
Workshop - Introduction to R

Please register using the zoom link to get a reminder:

https://us02web.zoom.us/webinar/register/1417304925933/WN_ExHqDQGkSTqiLb0etd4bfg

Workshop: Medical AI and Introduction to Predictive Modeling For Clinical Decision Making

Agenda:

(PST) 9:00 am - 9:05 am Arrival, socializing, and Opening (PST) 9:05 am - 10:00 am Dr. Yasin Ceran, "Medical AI and Introduction to Predictive Modeling For Clinical Decision Making" (PST) 10:00 am - 10:05 am Q&A

About Dr. Yasin Ceran:

Yasin Ceran is passionate about all things data and holds a vast experience in data analysis, mathematical modeling and Apache Spark, and in SQL, Python and R. He is currently an associate professor at KAIST, South Korea, as well as teaching at San Jose State University at the heart of Silicon Valley. Yasin has worked rigorously on an array of data-related projects encompassing data mining, statistics, modeling, and is dedicated to sharing his experience and expertise with learners.

Please register using the zoom link to get a reminder:

https://us02web.zoom.us/webinar/register/1417304925933/WN_ExHqDQGkSTqiLb0etd4bfg

Medical AI and Introduction to Predictive Modeling For Clinical Decision Making

Please register using the zoom link to get a reminder:

https://us02web.zoom.us/webinar/register/1417304925933/WN_ExHqDQGkSTqiLb0etd4bfg

Workshop: Medical AI and Introduction to Predictive Modeling For Clinical Decision Making

Agenda:

(PST) 9:00 am - 9:05 am Arrival, socializing, and Opening (PST) 9:05 am - 10:00 am Dr. Yasin Ceran, "Medical AI and Introduction to Predictive Modeling For Clinical Decision Making" (PST) 10:00 am - 10:05 am Q&A

About Dr. Yasin Ceran:

Yasin Ceran is passionate about all things data and holds a vast experience in data analysis, mathematical modeling and Apache Spark, and in SQL, Python and R. He is currently an associate professor at KAIST, South Korea, as well as teaching at San Jose State University at the heart of Silicon Valley. Yasin has worked rigorously on an array of data-related projects encompassing data mining, statistics, modeling, and is dedicated to sharing his experience and expertise with learners.

Please register using the zoom link to get a reminder:

https://us02web.zoom.us/webinar/register/1417304925933/WN_ExHqDQGkSTqiLb0etd4bfg

Medical AI and Introduction to Predictive Modeling For Clinical Decision Making

Please register using the zoom link to get a reminder:

https://us02web.zoom.us/webinar/register/1417304925933/WN_ExHqDQGkSTqiLb0etd4bfg

Workshop: Medical AI and Introduction to Predictive Modeling For Clinical Decision Making

Agenda:

(PST) 9:00 am - 9:05 am Arrival, socializing, and Opening (PST) 9:05 am - 10:00 am Dr. Yasin Ceran, "Medical AI and Introduction to Predictive Modeling For Clinical Decision Making" (PST) 10:00 am - 10:05 am Q&A

About Dr. Yasin Ceran:

Yasin Ceran is passionate about all things data and holds a vast experience in data analysis, mathematical modeling and Apache Spark, and in SQL, Python and R. He is currently an associate professor at KAIST, South Korea, as well as teaching at San Jose State University at the heart of Silicon Valley. Yasin has worked rigorously on an array of data-related projects encompassing data mining, statistics, modeling, and is dedicated to sharing his experience and expertise with learners.

Please register using the zoom link to get a reminder:

https://us02web.zoom.us/webinar/register/1417304925933/WN_ExHqDQGkSTqiLb0etd4bfg

Medical AI and Introduction to Predictive Modeling For Clinical Decision Making

Join us for the R-Ladies New Delhi Chapter Meetup! This session is designed to introduce new members and attendees to the R programming language and the mission of R-Ladies, a global organization promoting gender diversity in the R community. The event will feature:

  1. Introduction to R-Ladies and R Programming: Learn about the community and how R can empower your data analysis skills.
  2. Hands-on Workshop: Dive into data visualization using popular R libraries.

What to Bring:

  1. A laptop (optional but encouraged for the hands-on session).
  2. Snacks: Light snacks will be provided, courtesy of the Cluster Innovation Centre, University of Delhi. This is a great opportunity to learn, network, and explore the potential of R. Beginners and experienced R users alike are welcome! For any questions, contact Kritika Verma at [email protected]. We look forward to seeing you there!
Visualizing Data with R: An R-Ladies Introduction

Please register using the zoom link to get a reminder:

https://us02web.zoom.us/webinar/register/7817214087648/WN_wWxCYMFhSJmpWy-hTzC2xw

Join us for a hands-on, two-week workshop to master data manipulation with Python's pandas library. This workshop is perfect for students, data enthusiasts, and professionals looking to enhance their data analysis skills. Basic knowledge of Python is recommended.

Workshop Outline:

Week 1: Introduction and Basic Data Manipulation Session 1: Pandas basics, Series, and DataFrame structures, loading/saving data, data selection, and filtering. Session 2: Handling missing data, data transformation, managing duplicates, and combining DataFrames. Week 2: Advanced Techniques and Visualization Session 3: Grouping and aggregation, pivot tables, cross-tabulation, and working with time series data. Session 4: Data visualization with pandas and other libraries

Agenda:

(PDT) 10:00 am - 10:05 am Arrival, socializing, and Opening (PDT) 10:05 am - 11:50 am Dr. Yasin Ceran, "Data Manipulation with Pandas" (PDT) 11:50 am - 12:00 pm Q&A

About Dr. Yasin Ceran:

Yasin Ceran is passionate about all things data and holds a vast experience in data analysis, mathematical modeling and Apache Spark, and in SQL, Python and R. He is currently an associate professor at KAIST, South Korea, as well as teaching at San Jose State University at the heart of Silicon Valley. Yasin has worked rigorously on an array of data-related projects encompassing data mining, statistics, modeling, and is dedicated to sharing his experience and expertise with learners.

https://us02web.zoom.us/webinar/register/7817214087648/WN_wWxCYMFhSJmpWy-hTzC2xw

Webinar Passcode 953375

Data Manipulation with Pandas

Please register using the zoom link to get a reminder:

https://us02web.zoom.us/webinar/register/7817214087648/WN_wWxCYMFhSJmpWy-hTzC2xw

Join us for a hands-on, two-week workshop to master data manipulation with Python's pandas library. This workshop is perfect for students, data enthusiasts, and professionals looking to enhance their data analysis skills. Basic knowledge of Python is recommended.

Workshop Outline:

Week 1: Introduction and Basic Data Manipulation Session 1: Pandas basics, Series, and DataFrame structures, loading/saving data, data selection, and filtering. Session 2: Handling missing data, data transformation, managing duplicates, and combining DataFrames. Week 2: Advanced Techniques and Visualization Session 3: Grouping and aggregation, pivot tables, cross-tabulation, and working with time series data. Session 4: Data visualization with pandas and other libraries

Agenda:

(PDT) 10:00 am - 10:05 am Arrival, socializing, and Opening (PDT) 10:05 am - 11:50 am Dr. Yasin Ceran, "Data Manipulation with Pandas" (PDT) 11:50 am - 12:00 pm Q&A

About Dr. Yasin Ceran:

Yasin Ceran is passionate about all things data and holds a vast experience in data analysis, mathematical modeling and Apache Spark, and in SQL, Python and R. He is currently an associate professor at KAIST, South Korea, as well as teaching at San Jose State University at the heart of Silicon Valley. Yasin has worked rigorously on an array of data-related projects encompassing data mining, statistics, modeling, and is dedicated to sharing his experience and expertise with learners.

https://us02web.zoom.us/webinar/register/7817214087648/WN_wWxCYMFhSJmpWy-hTzC2xw

Webinar Passcode 953375

Data Manipulation with Pandas

Please register using the zoom link to get a reminder:

https://us02web.zoom.us/webinar/register/7817214087648/WN_wWxCYMFhSJmpWy-hTzC2xw

Join us for a hands-on, two-week workshop to master data manipulation with Python's pandas library. This workshop is perfect for students, data enthusiasts, and professionals looking to enhance their data analysis skills. Basic knowledge of Python is recommended.

Workshop Outline:

Week 1: Introduction and Basic Data Manipulation Session 1: Pandas basics, Series, and DataFrame structures, loading/saving data, data selection, and filtering. Session 2: Handling missing data, data transformation, managing duplicates, and combining DataFrames. Week 2: Advanced Techniques and Visualization Session 3: Grouping and aggregation, pivot tables, cross-tabulation, and working with time series data. Session 4: Data visualization with pandas and other libraries

Agenda:

(PDT) 10:00 am - 10:05 am Arrival, socializing, and Opening (PDT) 10:05 am - 11:50 am Dr. Yasin Ceran, "Data Manipulation with Pandas" (PDT) 11:50 am - 12:00 pm Q&A

About Dr. Yasin Ceran:

Yasin Ceran is passionate about all things data and holds a vast experience in data analysis, mathematical modeling and Apache Spark, and in SQL, Python and R. He is currently an associate professor at KAIST, South Korea, as well as teaching at San Jose State University at the heart of Silicon Valley. Yasin has worked rigorously on an array of data-related projects encompassing data mining, statistics, modeling, and is dedicated to sharing his experience and expertise with learners.

https://us02web.zoom.us/webinar/register/7817214087648/WN_wWxCYMFhSJmpWy-hTzC2xw

Webinar Passcode 953375

Data Manipulation with Pandas

Please register using the zoom link to get a reminder:

https://us02web.zoom.us/webinar/register/7817214087648/WN_wWxCYMFhSJmpWy-hTzC2xw

Join us for a hands-on, two-week workshop to master data manipulation with Python's pandas library. This workshop is perfect for students, data enthusiasts, and professionals looking to enhance their data analysis skills. Basic knowledge of Python is recommended.

Workshop Outline:

Week 1: Introduction and Basic Data Manipulation Session 1: Pandas basics, Series, and DataFrame structures, loading/saving data, data selection, and filtering. Session 2: Handling missing data, data transformation, managing duplicates, and combining DataFrames. Week 2: Advanced Techniques and Visualization Session 3: Grouping and aggregation, pivot tables, cross-tabulation, and working with time series data. Session 4: Data visualization with pandas and other libraries

Agenda:

(PDT) 10:00 am - 10:05 am Arrival, socializing, and Opening (PDT) 10:05 am - 11:50 am Dr. Yasin Ceran, "Data Manipulation with Pandas" (PDT) 11:50 am - 12:00 pm Q&A

About Dr. Yasin Ceran:

Yasin Ceran is passionate about all things data and holds a vast experience in data analysis, mathematical modeling and Apache Spark, and in SQL, Python and R. He is currently an associate professor at KAIST, South Korea, as well as teaching at San Jose State University at the heart of Silicon Valley. Yasin has worked rigorously on an array of data-related projects encompassing data mining, statistics, modeling, and is dedicated to sharing his experience and expertise with learners.

https://us02web.zoom.us/webinar/register/7817214087648/WN_wWxCYMFhSJmpWy-hTzC2xw

Webinar Passcode 953375

Data Manipulation with Pandas

Please register using the zoom link to get a reminder:

https://us02web.zoom.us/webinar/register/7817214087648/WN_wWxCYMFhSJmpWy-hTzC2xw

Join us for a hands-on, two-week workshop to master data manipulation with Python's pandas library. This workshop is perfect for students, data enthusiasts, and professionals looking to enhance their data analysis skills. Basic knowledge of Python is recommended.

Workshop Outline:

Week 1: Introduction and Basic Data Manipulation Session 1: Pandas basics, Series, and DataFrame structures, loading/saving data, data selection, and filtering. Session 2: Handling missing data, data transformation, managing duplicates, and combining DataFrames. Week 2: Advanced Techniques and Visualization Session 3: Grouping and aggregation, pivot tables, cross-tabulation, and working with time series data. Session 4: Data visualization with pandas and other libraries

Agenda:

(PDT) 10:00 am - 10:05 am Arrival, socializing, and Opening (PDT) 10:05 am - 11:50 am Dr. Yasin Ceran, "Data Manipulation with Pandas" (PDT) 11:50 am - 12:00 pm Q&A

About Dr. Yasin Ceran:

Yasin Ceran is passionate about all things data and holds a vast experience in data analysis, mathematical modeling and Apache Spark, and in SQL, Python and R. He is currently an associate professor at KAIST, South Korea, as well as teaching at San Jose State University at the heart of Silicon Valley. Yasin has worked rigorously on an array of data-related projects encompassing data mining, statistics, modeling, and is dedicated to sharing his experience and expertise with learners.

https://us02web.zoom.us/webinar/register/7817214087648/WN_wWxCYMFhSJmpWy-hTzC2xw

Webinar Passcode 953375

Data Manipulation with Pandas

Please register using the zoom link to get a reminder:

https://us02web.zoom.us/webinar/register/9617152905708/WN_Nv0aUvB1T-W6oQ22HtSjZA

Time series forecasting plays a crucial role in machine learning, requiring special attention to capture the impact of time-related components such as trends and seasonality. Join us in this immersive workshop where we will delve into the depths of time series analysis and forecasting using Python.

Workshop Curriculum:

Module 1: Introduction to Time Series Analysis Module 2: Exploratory Data Analysis for Time Series Module 3: Traditional Time Series Forecasting Models Module 4: Advanced Time Series Forecasting Models Module 5: Model Evaluation and Selection Module 6: Handling Seasonality and Trends

Prerequisites: * Familiarity with the Jupiter Notebooks interface * Basic understanding of foundational statistical techniques * Proficiency in data reading and preprocessing tasks, including data cleaning using Python

Agenda:

(PDT) 10:00 am - 10:05 am Arrival, socializing, and Opening (PDT) 10:05 am - 12:00 pm Dr. Yasin Ceran, "Mastering Time Series Forecasting with Python A Comprehensive Workshop on Forecast Models and Techniques" (PDT) 12:00 pm - 12:10 pm Q&A

About Dr. Yasin Ceran:

Yasin Ceran is passionate about all things data and holds a vast experience in data analysis, mathematical modeling and Apache Spark, and in SQL, Python and R. He is currently an associate professor at KAIST, South Korea, as well as teaching at San Jose State University at the heart of Silicon Valley. Yasin has worked rigorously on an array of data-related projects encompassing data mining, statistics, modeling, and is dedicated to sharing his experience and expertise with learners.

https://us02web.zoom.us/webinar/register/9617152905708/WN_Nv0aUvB1T-W6oQ22HtSjZA

Webinar Passcode 325997

Mastering Time Series Forecasting with Python A Comprehensive Workshop

Please register using the zoom link to get a reminder:

https://us02web.zoom.us/webinar/register/9617152905708/WN_Nv0aUvB1T-W6oQ22HtSjZA

Time series forecasting plays a crucial role in machine learning, requiring special attention to capture the impact of time-related components such as trends and seasonality. Join us in this immersive workshop where we will delve into the depths of time series analysis and forecasting using Python.

Workshop Curriculum:

Module 1: Introduction to Time Series Analysis Module 2: Exploratory Data Analysis for Time Series Module 3: Traditional Time Series Forecasting Models Module 4: Advanced Time Series Forecasting Models Module 5: Model Evaluation and Selection Module 6: Handling Seasonality and Trends

Prerequisites: * Familiarity with the Jupiter Notebooks interface * Basic understanding of foundational statistical techniques * Proficiency in data reading and preprocessing tasks, including data cleaning using Python

Agenda:

(PDT) 10:00 am - 10:05 am Arrival, socializing, and Opening (PDT) 10:05 am - 12:00 pm Dr. Yasin Ceran, "Mastering Time Series Forecasting with Python A Comprehensive Workshop on Forecast Models and Techniques" (PDT) 12:00 pm - 12:10 pm Q&A

About Dr. Yasin Ceran:

Yasin Ceran is passionate about all things data and holds a vast experience in data analysis, mathematical modeling and Apache Spark, and in SQL, Python and R. He is currently an associate professor at KAIST, South Korea, as well as teaching at San Jose State University at the heart of Silicon Valley. Yasin has worked rigorously on an array of data-related projects encompassing data mining, statistics, modeling, and is dedicated to sharing his experience and expertise with learners.

https://us02web.zoom.us/webinar/register/9617152905708/WN_Nv0aUvB1T-W6oQ22HtSjZA

Webinar Passcode 325997

Mastering Time Series Forecasting with Python A Comprehensive Workshop

Please register using the zoom link to get a reminder:

https://us02web.zoom.us/webinar/register/9617152905708/WN_Nv0aUvB1T-W6oQ22HtSjZA

Time series forecasting plays a crucial role in machine learning, requiring special attention to capture the impact of time-related components such as trends and seasonality. Join us in this immersive workshop where we will delve into the depths of time series analysis and forecasting using Python.

Workshop Curriculum:

Module 1: Introduction to Time Series Analysis Module 2: Exploratory Data Analysis for Time Series Module 3: Traditional Time Series Forecasting Models Module 4: Advanced Time Series Forecasting Models Module 5: Model Evaluation and Selection Module 6: Handling Seasonality and Trends

Prerequisites: * Familiarity with the Jupiter Notebooks interface * Basic understanding of foundational statistical techniques * Proficiency in data reading and preprocessing tasks, including data cleaning using Python

Agenda:

(PDT) 10:00 am - 10:05 am Arrival, socializing, and Opening (PDT) 10:05 am - 12:00 pm Dr. Yasin Ceran, "Mastering Time Series Forecasting with Python A Comprehensive Workshop on Forecast Models and Techniques" (PDT) 12:00 pm - 12:10 pm Q&A

About Dr. Yasin Ceran:

Yasin Ceran is passionate about all things data and holds a vast experience in data analysis, mathematical modeling and Apache Spark, and in SQL, Python and R. He is currently an associate professor at KAIST, South Korea, as well as teaching at San Jose State University at the heart of Silicon Valley. Yasin has worked rigorously on an array of data-related projects encompassing data mining, statistics, modeling, and is dedicated to sharing his experience and expertise with learners.

https://us02web.zoom.us/webinar/register/9617152905708/WN_Nv0aUvB1T-W6oQ22HtSjZA

Webinar Passcode 325997

Mastering Time Series Forecasting with Python A Comprehensive Workshop

Please register using the zoom link to get a reminder:

https://us02web.zoom.us/webinar/register/9617152905708/WN_Nv0aUvB1T-W6oQ22HtSjZA

Time series forecasting plays a crucial role in machine learning, requiring special attention to capture the impact of time-related components such as trends and seasonality. Join us in this immersive workshop where we will delve into the depths of time series analysis and forecasting using Python.

Workshop Curriculum:

Module 1: Introduction to Time Series Analysis Module 2: Exploratory Data Analysis for Time Series Module 3: Traditional Time Series Forecasting Models Module 4: Advanced Time Series Forecasting Models Module 5: Model Evaluation and Selection Module 6: Handling Seasonality and Trends

Prerequisites: * Familiarity with the Jupiter Notebooks interface * Basic understanding of foundational statistical techniques * Proficiency in data reading and preprocessing tasks, including data cleaning using Python

Agenda:

(PDT) 10:00 am - 10:05 am Arrival, socializing, and Opening (PDT) 10:05 am - 12:00 pm Dr. Yasin Ceran, "Mastering Time Series Forecasting with Python A Comprehensive Workshop on Forecast Models and Techniques" (PDT) 12:00 pm - 12:10 pm Q&A

About Dr. Yasin Ceran:

Yasin Ceran is passionate about all things data and holds a vast experience in data analysis, mathematical modeling and Apache Spark, and in SQL, Python and R. He is currently an associate professor at KAIST, South Korea, as well as teaching at San Jose State University at the heart of Silicon Valley. Yasin has worked rigorously on an array of data-related projects encompassing data mining, statistics, modeling, and is dedicated to sharing his experience and expertise with learners.

https://us02web.zoom.us/webinar/register/9617152905708/WN_Nv0aUvB1T-W6oQ22HtSjZA

Webinar Passcode 325997

Mastering Time Series Forecasting with Python A Comprehensive Workshop
Introduction to R 2024-05-01 · 16:30

Do you want to work with R but have no idea where to start? Are you intimidated by the idea of learning a programming language? Have you always been interested in R but found the amount of online resources overwhelming?

R-ladies Amsterdam is hosting a beginner's event to give you a gentle introduction to the R programming language. This event will be an in-person workshop at the University of Amsterdam. We will explain some basic concepts of R and run code together. You get the opportunity to ask questions and work together with other people who are learning R. We will also provide some snacks :)

Itinerary: 18.30 - 18.45 Welcome 18.45 - 20.30 Workshop 20.30 - 21.00 Networking

We kindly ask you to bring your own laptop and install R and RStudio before you arrive! You can find instructions on how to do so here: https://posit.co/download/rstudio-desktop/

Introduction to R

In a previous Meetup our own Mari Plaza has presented an introduction of Golem which allows you for sure to write a ShinyApp with the production ready mind-set. That means you write a ShinyApp like a package, including all relevant components of Software Engineering such modules, tests, dependencies, git, code coverage and so on therefore your Web Application developed in R like a ShinyApp has the correct architecture to go to production. This time is not an introduction anymore, it is a 3 hours workshop, starting with 1 hour presentation about how you plan your application and 2 hours practical experience on how you use Golem.

Bring your computer and be ready to ask questions and test by yourself what we are talking about.

Since the content is interesting and dense, you could join us in the preview activities starting at the end of February 2024, but you can always catch-up, even on the day of the event. The preview activities includes:

  • Join our SLACK dedicated channel for discussing the concepts and review related materials.
  • Follow posts in our LinkedIn accounts
  • Regular updates through the messages of this event on MeetUp.

We are really happy to host this event with care and willingness to learn more about the great world of software engineering.

Is ShinyApp in R Production ready?

As R users, we often read in data directly from files, such as .csv files. However, developing an understanding of SQL gives us an edge when working with big data, as we can connect to databases directly from R, pulling in only the bits that we need for analysis and saving memory. SQL can also be used to efficiently perform simple data manipulation tasks like sorting, filtering, and aggregation on large datasets, which would take significant computational resources in R.

In this RLadies workshop we will run through some basic SQL queries that can be used to retrieve and filter tables from databases, and learn a little bit about how relational databases work. We will then look at how to write and run SQL queries directly from R, and some use cases for this.

In order to fully participate in this workshop, please make sure you have R and RStudio installed on your laptop, and make sure you have the package ‘DBI’ installed by running `install.packages("DBI")` before the workshop. We will use this package to connect to a SQL database via R.

After the workshop, we will have ample time for networking and chatting over some snacks and drinks (courtesy of the Software Sustainability Institute).

Chloe Brook is a Data Science Applications Consultant at EPCC, University of Edinburgh. Having studied Biochemistry at university, she is keen to enthuse others who haven’t come directly from computer science backgrounds about the joys of coding and data. She is also an online tutor alongside her day job, and enjoys encouraging people to believe in their ability to learn and grow!

Database-ics: an introduction to SQL for R users

To complete topics that we did not manage to fit into the first workshop, we will host the second part of the Introduction to R Workshop in our next Meetup. This time we will cover:

  • Reading in data
  • Basic data visualization
  • R packages
  • R Markdown for writing reports

The workshop is open to everyone, in particular also if you did not make it to the first part! To catch up you can look at the first few sections of the workshop material prepared by Pia: Intro to R (up to Data Structures) or you could read through the Getting Started section of https://moderndive.netlify.app/1-getting-started.html

As before, we recommend you bring your own laptop where you have R and RStudio installed (https://cran.r-project.org/ and https://www.rstudio.com/products/rstudio/download/). In case you want to install Rmarkdown you can follow the instructions at https://bookdown.org/yihui/rmarkdown/installation.html

Workshop - Introduction to R - Part 2