Date of Award

Summer 2021

Document Type

Thesis

Degree Name

Master of Science (MS)

Department

Mathematical and Statistical Sciences

Program

Applied Statistics

First Advisor

Bansal, Naveen

Second Advisor

Rowe, Daniel

Third Advisor

Sanders, Rebecca

Abstract

Covid-19 is an epidemic disease caused by SARS-Cov-2 virus, which is a type of coronavirus. This virus is highly contiguous, and the confirmed cases of this disease have increased rapidly in a short period. After one month of the first reported case, the World Health Organization (WHO) claims that the Covid-19 will become an international public health emergency. The main purpose of this thesis is to predict the daily confirmed cases of Covid-19 in the midwestern central states in the U.S, by using Autoregression Integrated Moving Average (ARIMA) model and Long Short-Term Memory network (LSTM), which is a type of recurrent neural network. We compare the Root Mean Square Error (RMSE) for the prediction to determine the performance of the two methods. In this thesis, we show that the LSTM network has a smaller prediction RMSE. Also, both models capture the seasonality of the dataset. LSTM captures the trend of the dataset and has a higher prediction than expected values. ARIMA does not capture the trend of the dataset and will have a larger range. Therefore, we can conclude that LSTM is a better method for predicting daily confirmed cases of Covid-19 in the Midwestern central states in the U.S.

COinS