AGENDA

  • 5 June 2020
  • 11:00-12:00

Measuring Progress Learning Track Webscraping Tutorial


On 5 June, tune into a practical tutorial on web scraping from the GLF Measuring Progress Learning Track coordination team. The tutorial will last about 35 minutes, during which time participants can ask questions using Whova’s Q&A function. Afterwards, there will be a live Q&A based on questions curated during the tutorial.

Tutorial content and objectives
Web scraping is a method of extracting and restructuring information from web pages. Given the enormous quantity of unstructured data that is now populating the internet, web scraping is an essential skill for making sense of the world as we enter an information-led epoch. This tutorial will introduce the concept and then move on to basic techniques for web scraping using R, which is the most common programming language for statistical analysis. By the end of the tutorial, participants will be able to harvest unstructured text data from any website of their choosing and then perform simple analytics on the data they have collected. The script will be available for download on GitHub.