
This workshop is in the past and registrations are unavailable.
If you register and then realize you cannot attend, you must cancel by Sunday, July 14, 2024 - 11:59pm to avoid a $25 non-attendance fee applied to your library account in accordance with SFU Library's Cancellation Policy.
All times are Pacific Time Zone (Vancouver, BC, Canada).
Growing amount of data is available over the web. However, this data is usually presented in an unstructured HTML format which poses a challenge to researchers who want to automatically capture the data and convert it into a form appropriate for analysis. Web scraping is a computational method that offers means to meet such challenges. In this workshop you will learn how to scrape unstructured web pages using rvest R package and prepare the captured data for analysis. You will gain some hands-on experience working on a few small projects that underlie common scraping strategies/issues. The last project will include scraping of multiple web pages.
Requirements
