<span style="font-variant: small-caps">EpiInfer</span>: A Non-Markovian Method and System to Forecast Infection Rates in Epidemics

Consider an evolving epidemic in which each person is either (S) susceptible and healthy; (E) exposed, contagious but asymptomatic; (I) infected, symptomatic, and quarantined; or (R) recovered, healthy, and susceptible. The inference problem, given (i) who is showing symptoms (I) and who is not (S,...

Full description

Saved in:
Bibliographic Details
Main Authors: Jovan Kascelan, Ruoxi Yang, Dennis Shasha
Format: Article
Language:English
Published: MDPI AG 2025-07-01
Series:Algorithms
Subjects:
Online Access:https://www.mdpi.com/1999-4893/18/7/450
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Consider an evolving epidemic in which each person is either (S) susceptible and healthy; (E) exposed, contagious but asymptomatic; (I) infected, symptomatic, and quarantined; or (R) recovered, healthy, and susceptible. The inference problem, given (i) who is showing symptoms (I) and who is not (S, E, R) and (ii) the distribution of meetings among people each day, is to predict the number of infected people (state I) in future days (e.g., 1 through 20 days out into the future) for the purpose of planning resources (e.g., needles, medicine, staffing) and policy responses (e.g., masking). Each prediction horizon has different uses. For example, staffing may require forecasts of only a few days, while logistics (i.e., which supplies to order) may require a two- or three-week horizon. Our algorithm and system <span style="font-variant: small-caps;">EpiInfer</span> is a non-Markovian approach to forecasting infection rates. It is non-Markovian because it looks at infection rates over the past several days in order to make predictions about the future. In addition, it makes use of the following information: (i) the distribution of the number of meetings per person and (ii) the transition probabilities between states and uses those estimates to forecast future infection rates. In both simulated and real data, <span style="font-variant: small-caps;">EpiInfer</span> performs better than the standard (in epidemiology) differential equation approaches as well as general-purpose neural network approaches. Compared to ARIMA, <span style="font-variant: small-caps;">EpiInfer</span> is better starting with 6-day forecasts, while ARIMA is better for shorter forecast horizons. In fact, our operational recommendation would be to use ARIMA (1,1,1) for short predictions (5 days or less) and then <span style="font-variant: small-caps;">EpiInfer</span> thereafter. Doing so would reduce relative Root Mean Squared Error (RMSE) over any state of the art method by up to a factor of 4. Predictions of this accuracy could be useful for people, supply, and policy planning.
ISSN:1999-4893