Data for the project was collected from the following online resources:

  1. Spanish flu (Chicago): www.Kaggle.com

Description: This dataset contains information on 8034 mortality cases, with “Unique Point Identifier (fid)”, case type as pneumonia (1) or flu/unknown(0), point-location of each death(x(m) and y(m)), and week when death was recorded (as of 9/28/1918), for all the cases. Data recorded was for 7 weeks, which was during the peak of the disease.

  1. SARS 2003 (Global + U.S.): WHO.int/csr/sars/

Description: Data collected from March 17, 2003 to April 2, 2003 on cumulative number of cases, number of deaths, number recovered, and countries for the given date. The tidy global dataset contains 96 rows of daily activity, and 5. The tidy U.S. dataset contains 94 rows of daily activity, and 8.

  1. H1N1 + Influenza B Cases (Global + U.S.): apps.who.int/flumart

Description: Flu data is available by a country or region, with range of years (starting with 1995) and range of weeks selected. It includes weekly information on the type of flu virus (A or B) detected by subtypes of five in group A and 3 in group B, total number of positive cases, total of flu-negative cases, and ILI (Influenza like Illness). ILI activity is recorded as regional, local or sporadic outbreak, with “no report” for many countries. We normalized the datasets to contain variables of pandemic duration, number of new cases, and number of cumulative cases. The tidy dataset for global H1N1 cases contains 79 rows and 3 columns. The dataset for U.S. H1N1 cases contains 52 rows and 3 columns. The dataset for global Inlfuenza B cases contains 79 rows and 3 columns. The dataset for Inlfuenza B cases in the U.S. contains 52 rows and 3 columns.

  1. H1N1 Deaths (Global + U.S.): www.Kaggle.com

Description: This dataset contains information about new and cumulative number of deaths resulting from the H1N1 pandemic from January 2009 to August 2010. We normalized the datasets to contain variables of pandemic duration, number of new deaths, and number of cumulative deaths. The tidy dataset for global H1N1 deaths contains 50 rows and 3 columns. The tidy dataset for U.S. H1N1 deaths contains 50 rows and 3 columns.

  1. COVID-19 (Global): covid19.who.int/table

Description: This data by World Health Organization provides information on total cumulative cases of COVID-19, newly reported cases in last 24hrs, total deaths and deaths in last 24 hrs, and method of transmission (community, cluster, sporadic or no cases) for a country or a region. The dataset used was taken from January 3, 2020 until November 10, 2020 and was normalized for the variables of pandemic duration, number of new case, number of cumulative cases, number of new deaths, and number of cumulative deaths. The tidy dataset contains 313 rows and 5 columns

  1. COVID-19 Cases (USA): data.cdc.gov

Description: The original dataset contained updated daily case counts of COVID-19 with many variables such as current status, sex, age, and ethnicity. The tidy dataset contains the variables pandemic duration, number of new cases, and number of cumulative cases and is comprised of 321 rows and 3 columns.

  1. COVID-19 Deaths (USA): data.cdc.gov

Description: This dataset contains updated weekly death counts attributed to COVID-19. The tidy dataset contains the variables pandemic duration, number of new cases, and number of cumulative cases and is comprised of 44 rows and 3 columns.

  1. COVID-19 (Chicago): data.cityofchicago.org

Description: This dataset included 7 rows of dates and 2 columns, date starting from 3/1/ 2020 to 11/22/2020. Data was collected for total cases, total deaths and total hospitalizations for each day, and stratified by age groups, sex, and ethnicity for all the cases, deaths and hospitalizations.

  1. \(R_0\) and other statistical summaries: thelancet.com

Description: This is an article with multiple tables, with table 1 containing information on Basic Reproductive Rates of various prior pandemics and Covid-19. Data-scraping was used to extract the relevant table and information on it.

  1. Country Spatial Polygon Data: GitHub repo

Description: This is a .JSON file containing spatial polygon data for 180 countries. Countries are identifiable by name or 3 letter country code.