You are here

Web Scraping in R

Growing amount of data is available over the web. However, this data is usually presented in an unstructured HTML format which poses a challenge to researchers who want to automatically capture the data and convert it into a form appropriate for analysis. Web scraping is a computational method that offers means to meet such challenges. In this workshop you will learn how to scrape unstructured web pages using rvest R package and prepare the captured data for analysis. You will gain some hands-on experience working on a few small projects that underlie common scraping strategies/issues. The last project will include scraping of multiple web pages.

Prerequisites: Functional knowledge of commonly used base R commands (for an overview see https://www.rstudio.com/wp-content/uploads/2016/05/base-r.pdf).

Note: Workshop participants will need to bring their own laptops, with R and RStudio installed prior to attending the workshop.

Upcoming Workshops

Title Location Dates
Web Scraping in R : 2019-11-22 Burnaby, Bennett Library, Rm 7010, Research Commons Friday, November 22, 2019 - 9:30am to 4:30pm