Extract valuable data from blog posts, news sites, social networks, wikipedia, and more! Web scraping is a powerful tool for pulling data that’s locked away in web content.
In this two hour workshop we’ll cover the basic technologies involved in web scraping: HTML and CSS. Learn how HTML gives meaning and structure to data, and how we can get under the hood of popular websites to use that structure as a way to identify key data values. Then, craft powerful CSS selectors as a notation to identify the component parts of a web page that we’ll use to scrape data.
Please bring a laptop computer for your own use during the workshop. No specialized software is required to participate in this workshop.