Data Wrangling and Preprocessing With R [Online]

About the workshop

This workshop for intermediate R users will show you how to leverage dplyr (a package in tidyverse) to more effectively filter, transform, and aggregate your data. If you've used R before, then you know that getting base R to transform data into a form needed for your analysis is usually a challenge, with complex blocks of code to perform what should be simple operations. Luckily for us, dplyr is designed to help easily express these operations so that what is essentially a simple data transformation only requires simple code.
 
This workshop assumes that you are already comfortable in base R. For example, you should be able to: 
  • Import data from a CSV or text file 
  • Extract and create columns in a data frame, and filter rows according to different conditions 
  • Write an R script that can run on its own without manual user intervention.

By the end of the workshop you will be able to: 

  • Use the primary dplyr functions for selecting, mutating, filtering, summarizing, and re-ordering data 
  • Recognize existing complicated base R code blocks and simplify them down using dplyr 
  • Use tidyr (a tidyverse package) to easily reshape data both to and from long and wide formats. 
Audience

Upcoming workshops

No upcoming workshops available.