This workshop for intermediate R users will show you how to leverage dplyr (a package in tidyverse) to more effectively filter, transform, and aggregate your data. If you've used R before, then you know that getting base R to transform data into a form needed for your analysis is usually a challenge, with complex blocks of code to perform what should be simple operations. Luckily for us, dplyr is designed to help easily express these operations so that what is essentially a simple data transformation only requires simple code.
This workshop assumes that you are already comfortable in base R. For example, you should be able to:
- Import data from a CSV or text file
- Extract and create columns in a data frame, and filter rows according to different conditions
- Write an R script that can run on its own without manual user intervention.
By the end of the workshop you will be able to:
- Use the primary dplyr functions for selecting, mutating, filtering, summarizing, and re-ordering data
- Recognize existing complicated base R code blocks and simplify them down using dplyr
- Use tidyr (a tidyverse package) to easily reshape data both to and from long and wide formats.