Dates
Saturday, July 20, 2024 - 10:30am to 4:30pm
Location
Burnaby, Bennett Library, Rm 7010, Research Commons

This workshop is in the past and registrations are unavailable.

Registration dates
closed Friday, July 19, 2024 - 8:00am

If you register and then realize you cannot attend, you must cancel by Sunday, July 14, 2024 - 11:59pm to avoid a $25 non-attendance fee applied to your library account in accordance with SFU Library's Cancellation Policy.

All times are Pacific Time Zone (Vancouver, BC, Canada).

Note: This is an in-person workshop.

Growing amount of data is available over the web. However, this data is usually presented in an unstructured HTML format which poses a challenge to researchers who want to automatically capture the data and convert it into a form appropriate for analysis. Web scraping is a computational method that offers means to meet such challenges. In this workshop you will learn how to scrape unstructured web pages using rvest R package and prepare the captured data for analysis. You will gain some hands-on experience working on a few small projects that underlie common scraping strategies/issues. The last project will include scraping of multiple web pages.

Requirements 

  • Functional knowledge of commonly used base R commands (for an overview see https://www.rstudio.com/wp-content/uploads/2016/05/base-r.pdf)
  • Participants will need to bring their own laptops, with R and RStudio installed prior to attending the workshop
Facilitator(s)
Louis Arsenault-Mahjoubi
Payman Nickchi

Import this workshop into a calendar