Mathematical model
A mathematical model is a description of a system using mathematical concepts and language. The process of developing a mathematical model is termed mathematical modeling. Mathematical models are used in the natural sciences (such as physics, ...
s can project how
infectious diseases
An infection is the invasion of tissues by pathogens, their multiplication, and the reaction of host tissues to the infectious agent and the toxins they produce. An infectious disease, also known as a transmissible disease or communicable dise ...
progress to show the likely outcome of an
epidemic (including
in plants) and help inform
public health and plant health interventions. Models use basic assumptions or collected statistics along with mathematics to find
parameters for various infectious diseases and use those parameters to calculate the effects of different interventions, like mass
vaccination programs. The modelling can help decide which intervention(s) to avoid and which to trial, or can predict future growth patterns, etc.
History
The modelling of infectious diseases is a tool that has been used to study the mechanisms by which diseases spread, to predict the future course of an outbreak and to evaluate strategies to control an epidemic.
The first scientist who systematically tried to quantify
causes of death
The following is a list of the causes of human deaths worldwide for different years arranged by their associated mortality rates. In 2002, there were about 57 million deaths. In 2005, according to the World Health Organization (WHO) using the I ...
was
John Graunt in his book ''Natural and Political Observations made upon the Bills of Mortality'', in 1662. The bills he studied were listings of numbers and causes of deaths published weekly. Graunt's analysis of causes of death is considered the beginning of the "theory of competing risks" which according to Daley and Gani
is "a theory that is now well established among modern epidemiologists".
The earliest account of
mathematical modelling of spread of disease was carried out in 1760 by
Daniel Bernoulli
Daniel Bernoulli FRS (; – 27 March 1782) was a Swiss mathematician and physicist and was one of the many prominent mathematicians in the Bernoulli family from Basel. He is particularly remembered for his applications of mathematics to mechan ...
. Trained as a physician, Bernoulli created a mathematical model to defend the practice of inoculating against
smallpox. The calculations from this model showed that universal inoculation against smallpox would increase the
life expectancy from 26 years 7 months to 29 years 9 months.
Daniel Bernoulli's work preceded the modern understanding of
germ theory
The germ theory of disease is the currently accepted scientific theory for many diseases. It states that microorganisms known as pathogens or "germs" can lead to disease. These small organisms, too small to be seen without magnification, invade ...
.
In the early 20th century, William Hamer and
Ronald Ross applied the
law of mass action to explain epidemic behaviour.
The 1920s saw the emergence of compartmental models. The
Kermack–McKendrick epidemic model (1927) and the
Reed–Frost epidemic model (1928) both describe the relationship between
susceptible, infected and
immune individuals in a population. The Kermack–McKendrick epidemic model was successful in predicting the behavior of outbreaks very similar to that observed in many recorded epidemics.
Recently,
agent-based model
An agent-based model (ABM) is a computational model for simulating the actions and interactions of autonomous agents (both individual or collective entities such as organizations or groups) in order to understand the behavior of a system and wha ...
s (ABMs) have been used in exchange for simpler
compartmental models. For example, epidemiological ABMs have been used to inform public health (nonpharmaceutical) interventions against the spread of
SARS-CoV-2. Epidemiological ABMs, in spite of their complexity and requiring high computational power, have been criticized for simplifying and unrealistic assumptions. Still, they can be useful in informing decisions regarding mitigation and suppression measures in cases when ABMs are accurately calibrated.
Assumptions
Models are only as good as the assumptions on which they are based. If a model makes predictions that are out of line with observed results and the mathematics is correct, the initial assumptions must change to make the model useful.
* Rectangular and stationary
age distribution
A population pyramid (age structure diagram) or "age-sex pyramid" is a graphical illustration of the distribution of a population (typically that of a country or region of the world) by age groups and sex; it typically takes the shape of a pyramid ...
, i.e., everybody in the population lives to age ''L'' and then dies, and for each age (up to ''L'') there is the same number of people in the population. This is often well-justified for developed countries where there is a low infant mortality and much of the population lives to the life expectancy.
* Homogeneous mixing of the population, i.e., individuals of the population under scrutiny assort and
make contact at random and do not mix mostly in a smaller subgroup. This assumption is rarely justified because
social structure is widespread. For example, most people in London only make contact with other Londoners. Further, within London then there are smaller subgroups, such as the Turkish community or teenagers (just to give two examples), who mix with each other more than people outside their group. However, homogeneous mixing is a standard assumption to make the mathematics tractable.
Types of epidemic models
Stochastic
"Stochastic" means being or having a random variable. A stochastic model is a tool for estimating probability distributions of potential outcomes by allowing for random variation in one or more inputs over time. Stochastic models depend on the chance variations in risk of exposure, disease and other illness dynamics. Statistical agent-level disease dissemination in small or large populations can be determined by stochastic methods.
Deterministic
When dealing with large populations, as in the case of tuberculosis, deterministic or compartmental mathematical models are often used. In a deterministic model, individuals in the population are assigned to different subgroups or compartments, each representing a specific stage of the epidemic.
The transition rates from one class to another are mathematically expressed as derivatives, hence the model is formulated using differential equations. While building such models, it must be assumed that the population size in a compartment is differentiable with respect to time and that the epidemic process is deterministic. In other words, the changes in population of a compartment can be calculated using only the history that was used to develop the model.
Sub-exponential growth
A common explanation for the growth of epidemics holds that 1 person infects 2, those 2 infect 4 and so on and so on with the number of infected doubling every generation.
It is analogous to a game of
tag where 1 person tags 2, those 2 tag 4 others who've never been tagged and so on. As this game progresses it becomes increasing frenetic as the tagged run past the previously tagged to hunt down those who have never been tagged.
Thus this model of an epidemic leads to a curve that
grows exponentially until it crashes to zero as all the population have been infected. i.e. no
herd immunity and no peak and gradual decline as seen in reality.
Reproduction number
The ''basic reproduction number'' (denoted by ''R
0'') is a measure of how transferable a disease is. It is the average number of people that a single infectious person will infect over the course of their infection. This quantity determines whether the infection will increase sub-exponentially, die out, or remain constant: if ''R
0'' > 1, then each person on average infects more than one other person so the disease will spread; if ''R
0'' < 1, then each person infects fewer than one person on average so the disease will die out; and if ''R
0'' = 1, then each person will infect on average exactly one other person, so the disease will become ''endemic:'' it will move throughout the population but not increase or decrease.
Endemic steady state
An infectious disease is said to be
endemic when it can be sustained in a population without the need for external inputs. This means that, on average, each infected person is infecting ''exactly'' one other person (any more and the number of people infected will
grow sub-exponentially and there will be an
epidemic, any less and the disease will die out). In mathematical terms, that is:
:
The
basic reproduction number (''R''
0) of the disease, assuming everyone is susceptible, multiplied by the proportion of the population that is actually susceptible (''S'') must be one (since those who are not susceptible do not feature in our calculations as they cannot contract the disease). Notice that this relation means that for a disease to be in the
endemic steady state, the higher the basic reproduction number, the lower the proportion of the population susceptible must be, and vice versa. This expression has limitations concerning the susceptibility proportion, e.g. the ''R''
0 equals 0.5 implicates S has to be 2, however this proportion exceeds the population size.
Assume the rectangular stationary age distribution and let also the ages of infection have the same distribution for each birth year. Let the average age of infection be ''A'', for instance when individuals younger than ''A'' are susceptible and those older than ''A'' are immune (or infectious). Then it can be shown by an easy argument that the proportion of the population that is susceptible is given by:
:
We reiterate that ''L'' is the age at which in this model every individual is assumed to die. But the mathematical definition of the endemic steady state can be rearranged to give:
:
Therefore, due to the
transitive property
In mathematics, a relation on a set is transitive if, for all elements , , in , whenever relates to and to , then also relates to . Each partial order as well as each equivalence relation needs to be transitive.
Definition
A homog ...
:
:
This provides a simple way to estimate the parameter ''R''
0 using easily available data.
For a population with an
exponential age distribution,
:
This allows for the basic reproduction number of a disease given ''A'' and ''L'' in either type of population distribution.
Compartmental models in epidemiology
Compartmental models are formulated as
Markov chains.
A classic compartmental model in epidemiology is the SIR model, which may be used as a simple model for modelling epidemics. Multiple other types of compartmental models are also employed.
The SIR model

In 1927, W. O. Kermack and A. G. McKendrick created a model in which they considered a fixed population with only three compartments: susceptible,
; infected,
; and recovered,
. The compartments used for this model consist of three classes:
*
is used to represent the individuals not yet infected with the disease at time t, or those susceptible to the disease of the population.
*
denotes the individuals of the population who have been infected with the disease and are capable of spreading the disease to those in the susceptible category.
*
is the compartment used for the individuals of the population who have been infected and then removed from the disease, either due to immunization or due to death. Those in this category are not able to be infected again or to transmit the infection to others.
Other compartmental models
There are many modifications of the SIR model, including those that include births and deaths, where upon recovery there is no immunity (SIS model), where immunity lasts only for a short period of time (SIRS), where there is a latent period of the disease where the person is not infectious (
SEIS and SEIR), and where infants can be born with immunity (MSIR).
Infectious disease dynamics
Mathematical models need to integrate the increasing volume of
data being generated on
host
A host is a person responsible for guests at an event or for providing hospitality during it.
Host may also refer to:
Places
* Host, Pennsylvania, a village in Berks County
People
*Jim Host (born 1937), American businessman
* Michel Host ...
-
pathogen interactions. Many theoretical studies of the
population dynamics
Population dynamics is the type of mathematics used to model and study the size and age composition of populations as dynamical systems.
History
Population dynamics has traditionally been the dominant branch of mathematical biology, which has ...
, structure and evolution of
infectious diseases of
plants and animals, including humans, are concerned with this problem.
Research topics include:
*
antigenic shift
Antigenic shift is the process by which two or more different strains of a virus, or strains of two or more different viruses, combine to form a new subtype having a mixture of the surface antigens of the two or more original strains. The term is ...
*
epidemiological networks
*
evolution and spread of
resistance
Resistance may refer to:
Arts, entertainment, and media Comics
* Either of two similarly named but otherwise unrelated comic book series, both published by Wildstorm:
** ''Resistance'' (comics), based on the video game of the same title
** ''T ...
*
immuno-epidemiology
* intra-host dynamics
*
Pandemic
A pandemic () is an epidemic of an infectious disease that has spread across a large region, for instance multiple continents or worldwide, affecting a substantial number of individuals. A widespread endemic (epidemiology), endemic disease wi ...
* pathogen
population genetics
* persistence of pathogens within hosts
*
phylodynamics
* role and identification of
infection reservoir
In infectious disease ecology and epidemiology, a natural reservoir, also known as a disease reservoir or a reservoir of infection, is the population of organisms or the specific environment in which an infectious pathogen naturally lives and re ...
s
* role of host genetic factors
*
spatial epidemiology
* statistical and mathematical tools and innovations
*
Strain (biology)
In biology, a strain is a genetic variant, a subtype or a culture within a biological species. Strains are often seen as inherently artificial concepts, characterized by a specific intent for genetic isolation. This is most easily observed in mi ...
structure and interactions
*
transmission
Transmission may refer to:
Medicine, science and technology
* Power transmission
** Electric power transmission
** Propulsion transmission, technology allowing controlled application of power
*** Automatic transmission
*** Manual transmission
*** ...
, spread and control of infection
*
virulence
Mathematics of mass vaccination
If the proportion of the population that is immune exceeds the
herd immunity level for the disease, then the disease can no longer persist in the population and its transmission dies out. Thus, a disease can be eliminated from a population if enough individuals are immune due to either vaccination or recovery from prior exposure to disease. For example,
smallpox eradication, with the last wild case in 1977, and certification of the eradication of indigenous transmission of 2 of the 3 types of wild
poliovirus (type 2 in 2015, after the last reported case in 1999, and type 3 in 2019, after the last reported case in 2012).
The herd immunity level will be denoted ''q''. Recall that, for a stable state:
:
In turn,
:
which is approximately:
:
''S'' will be (1 − ''q''), since ''q'' is the proportion of the population that is immune and ''q'' + ''S'' must equal one (since in this simplified model, everyone is either susceptible or immune). Then:
:
Remember that this is the threshold level. Die out of transmission will only occur if the proportion of immune individuals ''exceeds'' this level due to a mass vaccination programme.
We have just calculated the critical immunization threshold (denoted ''q
c''). It is the minimum proportion of the population that must be immunized at birth (or close to birth) in order for the infection to die out in the population.
:
Because the fraction of the final size of the population ''p'' that is never infected can be defined as:
:
Hence,
:
Solving for
, we obtain:
:
When mass vaccination cannot exceed the herd immunity
If the vaccine used is insufficiently effective or the required coverage cannot be reached, the program may fail to exceed ''q
c''. Such a program will protect vaccinated individuals from disease, but may change the dynamics of transmission.
Suppose that a proportion of the population ''q'' (where ''q'' < ''q
c'') is immunised at birth against an infection with ''R''
0 > 1. The
vaccination programme changes ''R''
0 to ''R
q'' where
:
This change occurs simply because there are now fewer susceptibles in the population who can be infected. ''R
q'' is simply ''R''
0 minus those that would normally be infected but that cannot be now since they are immune.
As a consequence of this lower
basic reproduction number, the average age of infection ''A'' will also change to some new value ''A
q'' in those who have been left unvaccinated.
Recall the relation that linked ''R''
0, ''A'' and ''L''. Assuming that life expectancy has not changed, now:
:
:
But ''R''
0 = ''L''/''A'' so:
:
Thus, the vaccination program may raise the average age of infection, and unvaccinated individuals will experience a reduced
force of infection due to the presence of the vaccinated group. For a disease that leads to greater clinical severity in older populations, the unvaccinated proportion of the population may experience the disease relatively later in life than would occur in the absence of vaccine.
When mass vaccination exceeds the herd immunity
If a vaccination program causes the proportion of immune individuals in a population to exceed the critical threshold for a significant length of time, transmission of the infectious disease in that population will stop. If elimination occurs everywhere at the same time, then this can lead to
eradication.
; Elimination
: Interruption of endemic transmission of an infectious disease, which occurs if each infected individual infects less than one other, is achieved by maintaining vaccination coverage to keep the proportion of immune individuals above the critical immunization threshold.
; Eradication
: Elimination everywhere at the same time such that the infectious agent dies out (for example,
smallpox and
rinderpest).
Reliability
Models have the advantage of examining multiple outcomes simultaneously, rather than making a single forecast. Models have shown broad degrees of reliability in past pandemics, such as
SARS,
SARS-CoV-2,
[
] Swine flu,
MERS and
Ebola
Ebola, also known as Ebola virus disease (EVD) and Ebola hemorrhagic fever (EHF), is a viral hemorrhagic fever in humans and other primates, caused by ebolaviruses. Symptoms typically start anywhere between two days and three weeks after becom ...
.
See also
*
Pandemic
A pandemic () is an epidemic of an infectious disease that has spread across a large region, for instance multiple continents or worldwide, affecting a substantial number of individuals. A widespread endemic (epidemiology), endemic disease wi ...
*
Compartmental models in epidemiology
*
Contact tracing
*
Critical community size
*
Disease surveillance
*
Ecosystem model
An ecosystem model is an abstract, usually mathematical, representation of an ecological system (ranging in scale from an individual population, to an ecological community, or even an entire biome), which is studied to better understand the re ...
*
Force of infection
*
Landscape epidemiology Landscape epidemiology draws some of its roots from the field of landscape ecology. Just as the discipline of landscape ecology is concerned with analyzing both pattern and process in ecosystems across time and space, landscape epidemiology can be ...
*
Next-generation matrix In epidemiology, the next-generation matrix is used to derive the basic reproduction number, for a compartmental model of the spread of infectious diseases. In population dynamics it is used to compute the basic reproduction number for structured ...
*
Risk factor
*
Sexual network
A sexual network is a social network that is defined by the sexual relationships within a set of individuals.
Studies and discoveries
Like other forms of social networks, sexual networks can be formally studied using the mathematics of graph the ...
*
Transmission risks and rates
Transmission of an infection requires three conditions:
*an infectious individual
*a susceptible individual
*an effective contact between them
An effective contact is defined as any kind of contact between two individuals such that, if one individ ...
*
WAIFW matrix
In infectious disease modelling, a who acquires infection from whom (WAIFW) matrix is a matrix that describes the rate of transmission of infection between different groups in a population, such as people of different ages. Used with an SIR model, ...
References
Further reading
*
*
* An introductory book on infectious disease modelling and its applications.
*
*
*
External links
;Software
Model-Builder Interactive (GUI-based) software to build, simulate, and analyze ODE models.
GLEaMviz Simulator Enables simulation of emerging infectious diseases spreading across the world.
STEM Open source framework for Epidemiological Modeling available through the Eclipse Foundation.
*
R packag
surveillance Temporal and Spatio-Temporal Modeling and Monitoring of Epidemic Phenomena
{{Computer modeling
Epidemiology
Mathematical and theoretical biology
Vaccination
Medical statistics