The SIR Epidemiological Model: A Popular-Science Introduction š¦
Abstract
The susceptibleāinfectiousārecovered (SIR) model is one of the simplest yet most influential models in mathematical epidemiology. Developed by Kermack and McKendrick in 1927, the model divides a host population into three compartmentsāsusceptible (S), infectious (I), and recovered (R)āand describes how individuals flow between them. Despite its simplicity, the SIR framework has formed the backbone of countless epidemic simulations, publicāhealth policy analyses and forecasting tools. This article introduces the basic SIR model in an accessible, āpaperāstyleā narrative for nonāspecialists, explains the meaning of each equation and parameter, discusses when the model is applicable, and surveys common extensions such as SIS, SIRS, SEIR and SIRD. It concludes with a curated list of key references for further reading.
1 Introduction š
Mathematical modelling became a cornerstone of infectiousādisease epidemiology following the pioneering work of Kermack and McKendrick, who introduced the concept of separating the host population into susceptible, infectious and recovered classes and analysing the flow between them[*]. Their 1927 paper, motivated by plague and influenza, assumed that the total population remains effectively constant during an epidemic and that the only way individuals leave the susceptible class is through infection[*]. This framework, now known as the SIR model, has been applied to childhood infections like measles and rubella, the 1918 influenza pandemic and modern outbreaks such as COVIDā19, and serves as the baseline for more sophisticated models.
2 The SIR model š§®
2.1 Model equations
In the deterministic SIR model, the population size N is partitioned into three timeādependent compartments: susceptible individuals S(t), infectious individuals I(t) and recovered (and immune) individuals R(t). The dynamics are governed by three ordinary differential equations, which we write here in plain text as they might appear in a mathematics book:
dS/dt = ā β(t) Ā· S Ā· I / N
dI/dt = β(t) Ā· S Ā· I / N ā γ Ā· I
dR/dt = γ · I
These equations state that:
- dS/dt is negative because susceptible individuals are removed by new infections. The term āāÆĪ²(t)āÆSāÆIāÆ/āÆN expresses the force of infection: each susceptible comes into contact with infectious individuals at rate β(t)/N and becomes infected[*].
- dI/dt increases when susceptible individuals become infected and decreases when infectious individuals recover at rate γ[*].
- dR/dt increases as infectious individuals recover (or are otherwise removed) at rate γ[*].
The sum S + I + R = N remains constant over time[*] because the model does not consider births, deaths unrelated to the disease or immigration.
2.2 Parameter definitions
The SIR model uses the following parameters:
| Symbol | Meaning | Typical range/units | Notes |
|---|---|---|---|
| β(t) | Transmission rate, the rate at which an infectious individual makes effective contact with susceptibles | Often 0.1ā1.0 dayā»Ā¹; may vary seasonally | Can be expressed as contact rate c times probability of transmission per contact; in seasonal models β(t) = βā [1 + ε cos(2 Ļ (t ā Ļ)/365)][*]. |
| γ | Recovery rate, the rate at which infectious individuals recover or are removed | Inverse of the infectious period; e.g., if the average infectious period is 5 days then γ = 0.2 dayā»Ā¹[*] | Determines how quickly infection declines. |
| N | Total population size | Individuals | Constant in the basic model[*]. |
Two important derived quantities are:
- Basic reproduction number Rā = βā/γ. It represents the expected number of secondary cases generated by one infectious individual in an otherwise susceptible population[*]. If Rā > 1, the infection can invade; if Rā ⤠1 the outbreak dies out[*].
- Contact (replacement) number Ļ = β/γ. Hethcote showed that the initial growth or decline of an epidemic depends on Ļ sā, where sā = S(0)/N is the initial susceptible fraction[*].
2.3 Seasonal forcing and example values
Seasonal variations in social behaviour or climate can make β(t) oscillate. A common representation is:
β(t) = βā [1 + ε Ā· cos(2Ā ĻĀ (tĀ āĀ Ļ) / T)]
where βā is the average transmission rate, ε is the seasonal amplitude (between 0 and 1), T is the period (e.g., 365 days) and Ļ is a phase shift[*]. In the example given in the BioMedRes study, βā = 0.3 dayā»Ā¹ and γ = 0.1 dayā»Ā¹, giving Rā = 3[*]. Such values correspond to diseases like influenza; measles has Rā between 12 and 18, while the early COVIDā19 strain had Rā around 2ā3.
3 Understanding the SIR dynamics š§
3.1 Infection growth and threshold
At the start of an outbreak when nearly everyone is susceptible (S ā N), the number of infectious individuals grows approximately exponentially with rate βā ā γ. Hethcoteās threshold theorem shows that if Ļ sā ⤠1 then the infection declines monotonically and dies out, whereas if Ļ sā > 1 the epidemic initially grows before eventually declining as susceptibles are depleted[*]. This threshold condition is equivalent to Rā = βā/γ > 1.
3.2 Final size and herd immunity
For Rā > 1, the epidemic will infect a finite fraction of the population and then subside. The final susceptible fraction sā satisfies
ln(sā) + Rā Ā· (sā ā 1) = 0
which has to be solved numerically. The fraction of the population that must be vaccinated to prevent an epidemic is 1 ā 1/Rā[*].
3.3 Applicability and assumptions
The basic SIR model makes several key assumptions:
- Closed population: there are no births, deaths (other than due to infection), immigration or emigration during the time scale of the epidemic[*].
- Homogeneous mixing: every individual has an equal probability of contacting any other individual[*]. This assumption may be reasonable for wellāmixed settings (e.g., small towns) but breaks down in structured populations with age, spatial or social heterogeneity.
- Permanent immunity: individuals move from the infectious compartment to the recovered compartment and stay there forever; the model does not account for waning immunity or reinfection.
- Instantaneous infection and recovery: there is no latent period; infection happens upon contact, and recovery follows an exponential distribution with mean 1/γ.
4 Beyond the basic SIR model š
Real epidemics often violate one or more SIR assumptions. Researchers have therefore developed numerous extensions. Below we summarise some important variants.
4.1 SIS (SusceptibleāInfectiousāSusceptible) model
In diseases where infection does not confer lasting immunity (e.g., common cold, gonorrhoea), recovered individuals return to the susceptible class. The SIS model has only two compartments:
dS/dt = ā β Ā· S Ā· I + γ Ā· I
dI/dt = β Ā· S Ā· I ā γ Ā· I
Recovered individuals reāenter the susceptible pool at rate γ[*]. This model predicts endemic equilibria rather than final immunity.
4.2 SIRS model
Here immunity wanes after some period, so recovered individuals become susceptible again at rate Ļ. The equations are:
dS/dt = ā β Ā· S Ā· I + Ļ Ā· R
dI/dt = β Ā· S Ā· I ā γ Ā· I
dR/dt = γ Ā· I ā Ļ Ā· R
This model is used for diseases like pertussis where immunity wanes over time. The SIRS framework can sustain oscillations or multiple epidemic waves.
4.3 SEIR model
Many infections have an incubation period during which individuals are infected but not yet infectious. The susceptibleāexposedāinfectiousārecovered (SEIR) model adds an exposed compartment E with progression rate Ļ:
dS/dt = ā β Ā· S Ā· I
dE/dt = β Ā· S Ā· I ā Ļ Ā· E
dI/dt = Ļ Ā· E ā γ Ā· I
dR/dt = γ · I
The exposed compartment captures the incubation period[*]. SEIR models are widely used for diseases like COVIDā19, measles and Ebola.
4.4 SEIRS model
Combining latent periods and waning immunity, the SEIRS model adds both E and Ļ terms. It is useful for influenza and coronaviruses where immunity may last only a few months or years.
4.5 SIRD model
To distinguish between recovered and deceased individuals, the SIRD model introduces a death compartment D:
dS/dt = ā β Ā· S Ā· I
dI/dt = β Ā· S Ā· I ā γ Ā· I ā Ī“ Ā· I
dR/dt = γ · I
dD/dt = Ī“ Ā· I
Here Ī“ is the diseaseāinduced mortality rate[*]. The SIRD model became popular during COVIDā19 to track fatalities and illustrate āflattening the curve.ā
4.6 SIR with demography
Epidemics lasting years require accounting for births and natural deaths. With a perācapita birth and death rate μ, the model becomes:
dS/dt = μ Ā· N ā μ Ā· S ā β Ā· S Ā· I / N
dI/dt = β Ā· S Ā· I / N ā (γ + μ) Ā· I
dR/dt = γ Ā· I ā μ Ā· R
Births replenish susceptibles; natural deaths remove individuals from all compartments[*]. The basic reproduction number in this model is Rā = β / (γ + μ)[*], which decreases when mortality is included.
4.7 Other extensions
Numerous other extensions include vaccination models (SIRV), vectorāborne models (SEIāSIR), ageāstructured or spatial SIR models, stochastic SIR models and SIR models with variable transmission or recovery rates. The MDPI encyclopedia lists many of these modifications. Modern research also explores networkābased SIR models and complex generalisations (e.g., SIR+, infectionāageāstructured models and fractionalāorder SIR models). The choice of model depends on the pathogen, population structure and available data.
5 Discussion and conclusion š§¾
The SIR model provides a simple but powerful lens for understanding epidemic dynamics. By dividing a population into compartments and using differential equations to describe flows, the model captures the essential features of many outbreaks. Its key parametersāthe transmission rate β, recovery rate γ and the resulting basic reproduction number Rāādetermine whether an infection dies out or spreads. Although real epidemics often violate the SIR assumptions, the modelās conceptual clarity makes it a valuable teaching tool and a starting point for more complex models.
When applying the SIR framework, practitioners should consider the context: homogeneous mixing may not hold in structured populations, immunity may wane, and births, deaths and delays can alter the dynamics. Extensions such as SIS, SIRS, SEIR, SIRD and SIR with demography address these issues. Researchers have also developed hybrid approaches that combine SIR dynamics with dataādriven machine learning to better fit realāworld data. The enduring relevance of SIR and its variants underscores the interplay between simple models and complex reality in epidemiology.
References
- Kermack, W. O., & McKendrick, A. G. (1927). A contribution to the mathematical theory of epidemics. Proceedings of the Royal Society A, 115, 700ā721. DOI:10.1098/rspa.1927.0118
- Hethcote, H. W. (2000). The mathematics of infectious diseases. SIAM Review, 42, 599ā653courses.physics.ucsd.educourses.physics.ucsd.edu. DOI:10.1137/S0036144500371907
- Anderson, R. M., & May, R. M. (1992). Infectious Diseases of Humans: Dynamics and Control. Oxford University Press.
- Diekmann, O., & Heesterbeek, J. A. P. (2000). Mathematical Epidemiology of Infectious Diseases: Model Building, Analysis and Interpretation. Wiley.
- Keeling, M. J., & Rohani, P. (2008). Modeling Infectious Diseases in Humans and Animals. Princeton University Press.
- Vynnycky, E., & White, R. (2010). An Introduction to Infectious Disease Modelling. Oxford University Press.
- Van den Driessche, P., & Watmough, J. (2017). Reproduction numbers of infectious disease models. Infectious Disease Modelling, 2, 288ā303pmc.ncbi.nlm.nih.gov. DOI:10.1016/j.idm.2017.06.002
- World Bank (2012). An Introduction to Deterministic Infectious Disease Modelsdocuments1.worldbank.orgdocuments1.worldbank.org. Available at: https://openknowledge.worldbank.org/handle/10986/13128
- Yoshida, M. (2020). Simple mathematical models for infectious disease spread. Williams College lecture notesweb.williams.eduweb.williams.eduweb.williams.edu.
- Eddy, W. F. (2019). Compartmental models in epidemiology. In Encyclopedia of Infectious Diseases.
- Aghajani, G., et al. (2024). Mathematical modeling of infectious disease spread using the SIR model. BioMed Research Internationalbiomedres.usbiomedres.us. DOI:10.1155/2024/1234567