Volume 28, Issue 8 p. 1375-1377
Perspective
Free Access

A Primer on COVID-19 Mathematical Models

Diana M. Thomas

Corresponding Author

Diana M. Thomas

Department of Mathematical Sciences, United States Military Academy, West Point, New York, USA

Correspondence: Diana M. Thomas ([email protected])

Search for more papers by this author
Rodney Sturdivant

Rodney Sturdivant

Henry M. Jackson Foundation for the Advancement of Military Medicine, Bethesda, Maryland, USA

Search for more papers by this author
Nikhil V. Dhurandhar

Nikhil V. Dhurandhar

Department of Nutritional Sciences, Texas Tech University, Lubbock, Texas, USA

Search for more papers by this author
Swati Debroy

Swati Debroy

Department of Mathematics, University of South Carolina, Beaufort, Bluffton, South Carolina, USA

Search for more papers by this author
Nicholas Clark

Nicholas Clark

Department of Mathematical Sciences, United States Military Academy, West Point, New York, USA

Search for more papers by this author
First published: 09 May 2020
Citations: 27
Disclosure: The authors declared no conflict of interest.

The emergence of severe acute respiratory syndrome coronavirus 2 (or coronavirus disease [COVID]-19) has led to a widespread global pandemic ((1)). COVID-19 symptoms and mortality are disproportionately more severe in people with obesity and obesity-related comorbidities ((2, 3)). This is of concern for the United States, where ~42% have obesity, and of these, 85% have type 2 diabetes.

Without effective vaccines or therapies, there is an urgent need for prevention and treatment strategies for COVID-19 infections. Developing such strategies requires a thorough understanding of how COVID-19 infections spread in a population. Mathematical modeling predicts the spread of COVID-19 and permits testing different strategies to control and reduce population disease impacts.

The New York Times published COVID-19 predictions from five mathematical models with widely varying results ((4)). A recent review ((5)) also highlighted model differences and noted that a fluid situation dependent on many external variables should not be viewed as a “crystal ball.” Why do COVID-19 models yield such variable predictions? How are the different COVID-19 models developed? How should models be applied? Here, we provide a brief tutorial addressing these questions.

The majority of models fall into the following two categories, which are applied for different purposes: projections versus statistical forecasts. Projections are deterministic and they explain what could happen under a set of underlying hypotheses, while statistical forecasts use observed data to predict what will happen ((6)).

Dynamic Projection Models

In 1927, Kermack and McKendrick ((7)) developed the first continuous variable projection model of epidemic population dynamics, often referred to as a “susceptible, infected, and removed” (SIR) model. They compartmentalized a constant population into three states. The first state represents individuals susceptible to the disease with the number of susceptible individuals on day urn:x-wiley:19307381:media:oby22881:oby22881-math-0001 of the epidemic, denoted by urn:x-wiley:19307381:media:oby22881:oby22881-math-0002 The second state is the number of infected individuals on day urn:x-wiley:19307381:media:oby22881:oby22881-math-0003 of the epidemic, denoted by urn:x-wiley:19307381:media:oby22881:oby22881-math-0004 Finally, individuals removed from the infectious disease dynamics through death or recovery with immunity on day urn:x-wiley:19307381:media:oby22881:oby22881-math-0005 of the epidemic are denoted by urn:x-wiley:19307381:media:oby22881:oby22881-math-0006

R models assume susceptible individuals contract the infection by interacting with infected individuals. The term that models this interaction is represented as a proportion of the product of the susceptible and infected: urn:x-wiley:19307381:media:oby22881:oby22881-math-0007. If the mortality/recovery rate is modeled as a direct proportion of infected individuals, we arrive at the final Kermack-McKendrick Model. This flow diagram is depicted in Figure 1, with mathematical formulation as a system of three ordinary differential equations:
urn:x-wiley:19307381:media:oby22881:oby22881-math-0008
urn:x-wiley:19307381:media:oby22881:oby22881-math-0009
urn:x-wiley:19307381:media:oby22881:oby22881-math-0010
Details are in the caption following the image
Transition from the three possible states, susceptible urn:x-wiley:19307381:media:oby22881:oby22881-math-0011 infected, urn:x-wiley:19307381:media:oby22881:oby22881-math-0012, and removed urn:x-wiley:19307381:media:oby22881:oby22881-math-0013 The proportion of interactions between individuals in urn:x-wiley:19307381:media:oby22881:oby22881-math-0014 and urn:x-wiley:19307381:media:oby22881:oby22881-math-0015 that lead to infection per unit time is urn:x-wiley:19307381:media:oby22881:oby22881-math-0016, and the proportion of infected individuals that recover with immunity or die from disease per unit time is urn:x-wiley:19307381:media:oby22881:oby22881-math-0017. The total number of individuals within the system remains constant throughout calculation.

The key property of projection models like the Kermack-McKendrick system is that they are based on logical assumptions of the underlying mechanics of a process and they can be developed without data to immediately address “what if” questions.

It is from SIR-like models that we can see the effects of social distancing on flattening the curve. SIR models, however, are sensitive to underlying model assumptions. When these assumptions are modified, projections sometimes change dramatically. For example, we could assume there is a time lag to infection that accounts for the virus incubation period by modifying the term urn:x-wiley:19307381:media:oby22881:oby22881-math-0018 to urn:x-wiley:19307381:media:oby22881:oby22881-math-0019. The transmission rate, urn:x-wiley:19307381:media:oby22881:oby22881-math-0020, could also be assumed dependent on the currently observed infected population. Additionally, instead of assuming urn:x-wiley:19307381:media:oby22881:oby22881-math-0021 is constant, we could model urn:x-wiley:19307381:media:oby22881:oby22881-math-0022 as a function of urn:x-wiley:19307381:media:oby22881:oby22881-math-0023 by using the Hill function ((8)) as follows:
urn:x-wiley:19307381:media:oby22881:oby22881-math-0024

Here, r declines as the number of infected individuals gets higher, reflecting increased social distancing during peak infectivity. On the other hand, urn:x-wiley:19307381:media:oby22881:oby22881-math-0025 increases when the number of infected individuals decrease. The values urn:x-wiley:19307381:media:oby22881:oby22881-math-0026, K, and n are parameters that could be fit to data once available. The changes in assumptions alter the projections as depicted in Figure 2.

Details are in the caption following the image
Varying assumptions can alter the dynamic projection model outcomes. Assuming there is a time lag shifts the curve. Including social distancing that increases with increased infective populations and decreases with declining infected populations “flattens the curve” and results in an asymmetric projection. The parameter values were set with total population set as 1,000 individuals, urn:x-wiley:19307381:media:oby22881:oby22881-math-0027. For the time lag model, the time lag for infection was set at urn:x-wiley:19307381:media:oby22881:oby22881-math-0028 days and the time lag to mortality at 3 days. For the model where urn:x-wiley:19307381:media:oby22881:oby22881-math-0029 is represented by the Hill function, urn:x-wiley:19307381:media:oby22881:oby22881-math-0030 and urn:x-wiley:19307381:media:oby22881:oby22881-math-0031.

Once data is fit to model parameters, SIR models can predict when infections peak or how high the peak may be, but as pointed out in Jewell et al. ((5)) and observed in Figure 2, these are dependent on the underlying model assumptions.

Forecasting Models

Forecasting models are more useful after data has been gathered. For instance, most SIR COVID-19 models use fixed parameters, which lead to a constant referred to as urn:x-wiley:19307381:media:oby22881:oby22881-math-0032 or “ urn:x-wiley:19307381:media:oby22881:oby22881-math-0033 naught.” urn:x-wiley:19307381:media:oby22881:oby22881-math-0034 is defined as the number of secondary infections that will result from a single infected individual being introduced into a completely susceptible population. A calculated reproduction number during an ongoing epidemic is the “effective” reproduction number, urn:x-wiley:19307381:media:oby22881:oby22881-math-0035, which is the observed average number of secondary cases from a single primary case per unit time. urn:x-wiley:19307381:media:oby22881:oby22881-math-0036 accounts for changes to urn:x-wiley:19307381:media:oby22881:oby22881-math-0037 through control measures such as social distancing.

Within the SIR framework, urn:x-wiley:19307381:media:oby22881:oby22881-math-0038, where urn:x-wiley:19307381:media:oby22881:oby22881-math-0039 is the time to recover and urn:x-wiley:19307381:media:oby22881:oby22881-math-0040 is the population size. Absent of interventions, the estimated urn:x-wiley:19307381:media:oby22881:oby22881-math-0041 of COVID-19 is between 1.5 and 6.7 ((9)). However, this value is not constant but changing daily. Renewal equations allow us to estimate values ((10)) of urn:x-wiley:19307381:media:oby22881:oby22881-math-0042 that can be fit to a statistical model. Similar to Massad et al. ((6)), we fit an exponential decay model to New York’s data yielding urn:x-wiley:19307381:media:oby22881:oby22881-math-0043, though other statistical models such as one with an asymptote could be considered. Figure 3 depicts the fitted effective reproduction number versus the calculated urn:x-wiley:19307381:media:oby22881:oby22881-math-0044 values for New York. This forecast model will inform improved SIR model projections and generate prediction intervals. Projection models for the SARS-1 epidemic overestimate urn:x-wiley:19307381:media:oby22881:oby22881-math-0045 compared with statistical forecasting estimates of reproduction numbers ((6)). It is important that researchers use projection models for hypothetical scenario development and statistical forecasting models for forecasting, which are two different goals.

Details are in the caption following the image
The effective reproduction number urn:x-wiley:19307381:media:oby22881:oby22881-math-0046, sometimes referred to as urn:x-wiley:19307381:media:oby22881:oby22881-math-0047 or “R naught,” for New York fit to data with an exponential curve. urn:x-wiley:19307381:media:oby22881:oby22881-math-0048 represents the average number of secondary cases because of one infected person at the beginning of the epidemic, and in order for the epidemic to decline, urn:x-wiley:19307381:media:oby22881:oby22881-math-0049 should be less than 1. The points are the calculated daily effective reproduction number calculated by number of observed cases divided by the number of expected cases on a given day, while the gray curve forecasts future effective reproduction numbers that can be used in dynamic projection SIR models. Here, urn:x-wiley:19307381:media:oby22881:oby22881-math-0050 went below 1 (represented by the dashed line) around 28 days.

Including the differential effects of COVID-19 on individuals with obesity into both projection and forecasting models by separating each compartment into normal weight and obesity populations will be key to understanding the population-wide impacts of COVID-19 on obesity. Obesity researchers can apply both types of models to evaluate prevention and treatment strategies for COVID-19 in persons with obesity.

    The full text of this article hosted at iucr.org is unavailable due to technical difficulties.