Image credit:

Pragmatic Datafication: data cleaning, web scraping, twitter gathering, and parsing


Preexisting and clean data sets such as the General Social Survey (GSS) or Census data are readily available, cover long periods of time, and have well documented codebooks. Meanwhile, researchers increasingly want to gather their own data from websites, which introduces a different layer of complexity. Use easily accessible tools to impose structure upon semi-structured data.


Series Dates: 2017-2018

Slides are divided into the following sections