Use dplyr to effectively handle data in R [Online]: 2021-07-21 to 2021-07-22

 

During COVID-19, Research Commons' services continue.

SFU graduate students are encouraged to book consultations with the Research Commons staff and partners. Consultations are available by phone, via email, or through online video-conference.

Not finding what you're looking for? Please get in touch with us at research-commons@sfu.ca so we can discuss your research support needs. 

 

Dates
Wednesday, July 21, 2021 - 3:00pm to 4:30pm
Thursday, July 22, 2021 - 3:00pm to 4:30pm
Location
via Zoom (link will be sent to participants 24 hours before the workshop/event begins)

All times are Pacific Time Zone (Vancouver, BC, Canada).

This workshop for intermediate R users will show you how to leverage dplyr (a package in tidyverse) to more effectively filter, transform, and aggregate your data. If you've used R before, then you know that getting base R to transform data into a form needed for your analysis is usually a challenge, with complex blocks of code to perform what should be simple operations. Luckily for us, dplyr is designed to help easily express these operations so that what is essentially a simple data transformation only requires simple code.
 
This workshop assumes that you are already comfortable in base R. For example, you should be able to: 
  • Import data from a CSV or text file 
  • Extract and create columns in a data frame, and filter rows according to different conditions 
  • Write an R script that can run on its own without manual user intervention.

By the end of the workshop you will be able to: 

  • Use the primary dplyr functions for selecting, mutating, filtering, summarizing, and re-ordering data 
  • Recognize existing complicated base R code blocks and simplify them down using dplyr 
  • Use tidyr (a tidyverse package) to easily reshape data both to and from long and wide formats. 

Requirements:

  • Participants will need to have R and RStudio installed on their device prior to attending the workshop
  • Familiarity with R and the RStudio environment including an understanding of basic functionality such as object assignment, data structures, and running scripts

Attendance requirement:

You need to attend BOTH days. Different topics are covered each day and it builds on materials covered on the previous day, so if you miss a day, we won't have the resources to help you catch up in this online environment.

 

Facilitator
Matthew McKitrick
Pegah Ziabari
Audience

Import this workshop into a calendar