Growing amount of data is available over the web. However, this data is usually presented in an unstructured HTML format which poses a challenge to researchers who want to automatically capture the data and convert it into a form appropriate for analysis. Web scraping is a computational method that offers means to meet such challenges. In this workshop you will learn how to scrape unstructured web pages using rvest R package and prepare the captured data for analysis. You will gain some hands-on experience working on a few small projects that underlie common scraping strategies/issues. The last project will include scraping of multiple web pages.
Prerequisites: Functional knowledge of commonly used base R commands (for an overview see https://www.rstudio.com/wp-content/uploads/2016/05/base-r.pdf).
|Web Scraping in R : 2019-11-22||Burnaby, Bennett Library, Rm 7010, Research Commons||Friday, November 22, 2019 - 9:30am to 4:30pm|