Mitigate SIR epidemic spreading via contact blocking in temporal networks

Zhang, Shilun; Zhao, Xunyi; Wang, Huijuan

doi:10.1007/s41109-021-00436-w

Research
Open access
Published: 06 January 2022

Mitigate SIR epidemic spreading via contact blocking in temporal networks

Applied Network Science volume 7, Article number: 2 (2022) Cite this article

2740 Accesses
3 Citations
1 Altmetric
Metrics details

Abstract

Progress has been made in how to suppress epidemic spreading on temporal networks via blocking all contacts of targeted nodes or node pairs. In this work, we develop contact blocking strategies that remove a fraction of contacts from a temporal (time evolving) human contact network to mitigate the spread of a Susceptible-Infected-Recovered epidemic. We define the probability that a contact c(i, j, t) is removed as a function of a given centrality metric of the corresponding link l(i, j) in the aggregated network and the time t of the contact. The aggregated network captures the number of contacts between each node pair. A set of 12 link centrality metrics have been proposed and each centrality metric leads to a unique contact removal strategy. These strategies together with a baseline strategy (random removal) are evaluated in empirical contact networks via the average prevalence, the peak prevalence and the time to reach the peak prevalence. We find that the epidemic spreading can be mitigated the best when contacts between node pairs that have fewer contacts and early contacts are more likely to be removed. A strategy tends to perform better when the average number contacts removed from each node pair varies less. The aggregated pruned network resulted from the best contact removal strategy tends to have a large largest eigenvalue, a large modularity and probably a small largest connected component size.

Introduction

Networks, such as physical contact networks and online social networks, facilitate the spread of epidemics and information. The study of epidemic spreading first assumed the topology of networks to be static (Pastor-Satorras et al. 2015; Wang et al. 2013), while many real-world networks are not static as nodes and links can appear and disappear over time, thus can be better represented as temporal networks (Holme and Saramäki 2012). For example, human contact networks such as face-to-face contact networks (Zhao et al. 2011) are temporal networks, which can be described by a sequence of contacts (or temporal links) between pairs of individuals occurring at discrete time steps. The increasing availability of network data with temporal information has fostered research on how the temporal aspect of networks can affect dynamic processes such as the spreading of epidemics (Zhang et al. 2017; Karsai et al. 2011) and information (Scholtes et al. 2014) on temporal networks. Epidemic/information spreading can be mitigated via reducing physical contacts. Covid-19 measures like curfew, working at home, social distancing all aim to block physical contacts. These measures treat at least a subgroup of the population in the same way. In this work, we address the further question of how to mitigate the epidemic spreading more effectively via selecting the contacts to block heterogeneously and strategically. We propose to develop contact removal strategies utilizing the network properties of contacts.

We consider real-world physical contact networks, where only the connection between nodes evolves (appears when there is a contact and disappears) over time whereas the nature/type of nodes and contacts do not change . In this case, a temporal network observed within a time window [0, T] can be represented by $\mathcal {G}=(\mathcal {N},\mathcal {C})$, where $\mathcal {N}$ is the node set observed within [0, T], size $N=|\mathcal {N}|$ is the number of nodes in the network, $\mathcal {C}=\{c(i,j,t), t\in [0,T],i,j\in \mathcal {N}\}$ is the set of contacts between pairs of nodes in $\mathcal {N}$, with contact (i, j, t) representing the interaction between node i and node j at time step t. A contact c(i, j, t), also called a temporal link, describes interaction/connection between node i and j at a specific time t. A node without any contact at time t can be regarded as inactive or not observed at that time step. We confine ourselves to the Susceptible-Infected-Recovered (SIR) epidemic spreading model (Pastor-Satorras et al. 2015) on a temporal network instead of more realistic spreading processes: Initially at $t=0$, a seed node is selected to be infected whereas all the other nodes are susceptible; When a contact happens between an infected node and a susceptible node at any time step, the susceptible node becomes infected with a probability $\beta$; Each infected node becomes recovered with a probability $\gamma$ at each time step. A recovered node will neither be infected nor infect any other node. The contacts to block will be selected based on the (time) aggregated network $\mathcal {G_W}$ of the temporal network $\mathcal {G}$. Aggregated network represented as $\mathcal {G_W}=(\mathcal {N}, \mathcal {L})$ is a weighted network with the same node set $\mathcal {N}$ as temporal network $\mathcal {G}$, $\mathcal {L}$ is the set of weighted links, two nodes i and j in $\mathcal {G_W}$ are connected by a link l(i, j) if they have at least one contact in temporal network $\mathcal {G}$ and link l(i, j) is associated with a weight recording the number of contacts in $\mathcal {G}$ between the two nodes. In the rest of this paper, links refer to the links in the aggregated network, and contacts will not be called temporal links anymore to avoid confusion. Contacts between two nodes i and j can be regarded as the corresponding link l(i, j) in the aggregated network activated at specific time steps.

The objective is to mitigate the epidemic spreading via blocking a given percentage $\phi$ of contacts, selected based on the aggregated network. The fraction $\phi$ of contacts removed corresponds to the cost of the mitigation. To launch a contact removal intervention during the time window [0, T], the information of the aggregated network of the temporal network $\mathcal {G}$ observed in [0, T] needs to be known at $T=0$. Such aggregated network is assumed to be given in our work, whereas in practice, it can be estimated based on the temporal network observed before $T=0$. Predicting the aggregated network is more feasible compared to predicting the temporal network in [0, T]. The latter, i.e. long-term prediction of time specific and possibly noisy contacts challenges machine learning approaches that target at short-term predictions. Hence, we focus on the development of contact removal strategies based on the aggregated network, instead of the complete temporal network information which is difficult to obtain.

We propose probabilistic contact removal strategies. Specifically, the probability that a contact c(i, j, t) is removed is a generic function of a centrality metric (Newman 2018) of link l(i, j) in the aggregated network and the time t of the contact. Each centrality metric leads to a unique mitigation strategy in contact removal. The impact of an SIR epidemic spreading can be evaluated via the following performance measures, which will be used to evaluate the mitigation strategies: the average prevalence over time, where the prevalence at a time step is the number of infected nodes; the maximal prevalence, so called peak height, which suggests the maximal demand in e.g. hospital resources; the time to reach the peak prevalence, so called peak time, which indicates the time to prepare the medical resources for the peak demand.

The mitigation strategies that we have proposed are evaluated in 6 real-world temporal networks. We find that the mitigation effect is better when contacts between node pairs that have fewer contacts are removed with a higher probability. Removing contacts that occur earlier in time could further enhance the mitigation effect. A strategy tends to better mitigate the epidemic spreading if the average number of contacts removed varies less among node pairs. Furthermore, we analyze properties of the aggregated pruned network resulted from each contact blocking strategy. We find that the optimal strategy tends to lead to an aggregated pruned network with a large largest eigenvalue, a large modularity and a possibly a small largest connected component. Networks with a large modularity and a small largest connected component are difficult for an epidemic to spread. Static networks with a small largest eigenvalue have been shown to be robust against epidemic spreading i.e. have a high epidemic threshold for Susceptible-Infected-Susceptible epidemic. The resultant aggregated pruned network after contact removal, however, may lead to a low prevalence if its largest eigenvalue is large. This suggests that the temporal information of contacts, may lead to new phenomena that can not be captured by static network studied.

Recent work has been devoted to understand the influence of temporal networks on dynamic processes and especially the mitigation of epidemic spreading. A first line of reseach has studied the mitigation of epidemic spreading via node-level approaches. Génois et al. (2015) have shown that vaccination of individuals who act as bridges between communities in time-aggregated network can efficiently prevent epidemic outbreaks. Gemmetto et al. (2014) have investigated the epidemic mitigation via excluding a sub-group of nodes in a temporal network in school environments. Another line of research has focused on link-based approaches to suppress epidemic outbreaks. Link removal strategies based on link centrality metrics in the aggregated network has been studied in Zhan et al. (2019). These strategies select the links in the aggregated network to block, thus removing all contacts associated with the selected links. In this work, we investigate in-depth at contact level, i.e. how to select a given number of contacts to remove to suppress epidemic spreading. To the best of our knowledge, few works have studied contact-level approaches to suppress epidemic spreading. Our previous work (Zhao and Wang 2020) has addressed the same question, however, was confined to Susceptible-Infected (SI) model, which is a special case of SIR model. In this work, we consider the SIR model, broaden and deepen our investigation towards a more comprehensive evaluation of mitigation effect and a more systematic analysis of the properties of the pruned network to explain the performance of the strategies. In view of the uncertainty of realistic temporal network data, we further check the robustness of our finding in the relative effectiveness of proposed mitigation strategies when the temporal networks are under the perturbation, i.e. when the time (ordering) of contacts is uncertain.

Methods

We will firstly propose our contract removal strategies. Afterwards, we will introduce the real-world temporal networks and simulations that will be used to simulate the epidemic spreading process and further to evaluate the effect of the mitigation strategies.

Contact blocking strategies

We select the contacts to block based on a given centrality metric in the aggregated network and the time of each contact. Specifically, the probability that a contact c(i, j, t) is removed is defined as a function of the given centrality metric of the corresponding link l(i, j) in the aggregated network $\mathcal {G_W}$ and the time t of the contact. This function also ensures that a fraction $\phi$ of contacts are removed on average.

Link centrality metrics

We propose a set of link centrality metrics based on node centrality metrics for the aggregated network $\mathcal {G_W}$. The aggregated network $\mathcal {G_W}$ is a weighted network constructed from a temporal network $\mathcal {G}$. The weight of each link in the aggregated network represents the number of contacts between the two corresponding nodes in the temporal network. Each centrality metric below will lead afterwards to a unique mitigation strategy:

Degree product of a link l(i, j) refers to $d(i)\cdot d(j)$, where d(i) is the degree of node i defined as the number of links incident to node i in the aggregated network.
Strength product of a link l(i, j) refers to $s(i)\cdot s(j)$, where s(i) is the strength of node i defined as the total weights of all the links incident to node i in aggregated network. The strength of a node tells the total number of contacts the node has.
Betweenness is the number of shortest paths that traverse the link between all possibly node pairs in the unweighted aggregated network (Wang et al. 2008).
Link weight of a link l(i, j) in aggregated network refers to the total number of contacts between node i and j in the corresponding temporal network.
Weighted eigenvector component product is the product of the principal eigenvector components of the link’s two end nodes. The principal eigenvector is the eigenvector corresponds to the largest eigenvalue of the weighted aggregated network.
Unweighted eigenvector component product is the product of the principal eigenvector components of the link’s two end nodes. The principal eigenvector is the eigenvector corresponds to the largest eigenvalue of the unweighted aggregated network.

Besides the proposed strategies based on the aforementioned link centrality metrics, we introduce a baseline strategy called Random removal. In the Random removal strategy, the probability for each contact c(i, j, t) to be removed is independent of the centrality of l(i, j). Or equivalently, Random removal sets the centrality value as 1 for all links.

Contact removal probability

Given a link centrality metric m, we can derive the centrality $m_{ij}$ for each link l(i, j) in the aggregated network. Consider the simple case where the probability that a contact c(i, j, t) between i and j is removed is independent of the time t and we first propose the removal preference $p_{ij}$:

$$\begin{aligned} p_{ij} = m_{ij}\frac{\phi \sum _{lk} w_{lk}}{\sum _{lk} (w_{lk}m_{lk})} \end{aligned}$$

(1)

where $w_{ij}$ is the weight of link l(i, j) in the aggregated network or equivalently the number of contacts between i and j, $\phi$ is the expected fraction of contacts to be removed, thus we have $\sum _{ij} p_{ij}w_{ij}=\phi \sum _{lk} w_{lk}$, which is the expected number of contacts to be removed. The removal preference $p_{ij}$ of a contact between any node pair i and j is proportional to the centrality $m_{ij}$ of the corresponding link l(i, j).

We cannot use the removal preference $p_{ij}$ directly as the removal probability of a contact between node i and j in view of the following. Some centrality metrics could be highly heterogeneous. The removal preference $p_{ij}$ is possibly larger than 1 if the centrality measure $m_{ij}$ of the link l(i, j) is large. To deal with this issue, we propose an iterative process to derive the contact removal probability by re-normalizing $p_{ij}$, where ${i},{j}\in \mathcal {N}$: we assign removal probabilities 1 to those contacts whose removal preference $p_{ij}$ according to (1) is larger than one, and re-normalize $p_{ij}$ among the contacts with $p_{ij}\le 1$ to satisfy $\sum _{ij} p_{ij}w_{ij}=\phi \sum _{ij} w_{ij}$. We repeat this normalization process until the removal preference $p_{ij}$ of all contacts are between 0 and 1, while the actual average fraction of contacts blocked is $\phi$. Now we define $\tilde{p}_{ij}$ as the re-normalized $p_{ij}$ via the proposed iterative process, and $\tilde{p}_{ij}$ is used as the removal probability of each contact between node i and node j.

We further generalize the definition of the contact removal preference $p_{ij}$ as

$$\begin{aligned} p^*_{ij} = m_{ij}^{\alpha }\frac{\phi \sum _{lk} w_{lk}}{\sum _{lk} (w_{lk}m_{lk}^{\alpha })} \end{aligned}$$

(2)

The removal preference of a contact c(i, j, t) is proportional to a polynomial function of $m_{ij}$. The definition (1) of $p_{ij}$ is a special case when $\alpha =1$ of definition (2). The random strategy, i.e. all contacts have the same probability of being removed, corresponds to the case when $\alpha =0$. Consider (1) where the reciprocal metric $\frac{1}{m_{ij}}$ is taken as a new centrality metric. The corresponding strategy is equivalent to the general definition (2) where metric $m_{ij}$ is considered and $\alpha =-1$.

In this work, we consider the definition (1) of $p_{ij}$ using the aforementioned list of centrality metrics and their reciprocals as well as the random strategy, which correspond to the general definition of (2) where $\alpha =1,-1,0$, respectively.

Finally, we generalize our strategy by considering the timestamps of the contacts. This is motivated by the intuition that early intervention, e.g. blocking early contacts, could be possibly more effective. We propose a time-dependent contract removal preference $p_{ij}(t)$:

$$\begin{aligned} p_{ij}(t) = m_{ij}f(t)\frac{\phi \sum _{lk} w_{lk}}{\sum _{lk} (w_{lk}m_{lk}f(t))} \end{aligned}$$

(3)

where f(t) describes the preference to remove contacts at specific period. The preference that c(i, j, t) is removed is proportional to $m_{ij}\cdot f(t)$. The same aforementioned normalization process is applied to this generalized contact removal preference to derive the removal probability of each contact.

As a start, we consider $f(t)=4\cdot 1_{t\le T/2}+1_{t>T/2}$, $f(t)=1_{t\le T/2}+4 \cdot 1_{t>T/2}$ and $f(t)=1$, where the indicator function $1_{y}$ is one if the condition y is true, and otherwise it is 0. They correspond to the preference of removing contacts happening early in [1, T/2], late in (T/2, T] and no preference for the timestamps of the contacts, respectively.

Datasets

The following real-world physical contact networks will be considered:

HighSchool11&12 record the physical contacts between students in a high school in Marseilles, France (Fournet and Barrat 2014). The two datasets consider two different groups of students.
WorkPlace13&15 capture the contacts between individuals in an office building in France (Génois et al. 2015). The two datasets are measured from different groups of individuals respectively.
MIT are human contact network among students of the Massachusetts Institute of Technology (Kunegis 2013; Eagle and Pentland 2006). The MIT dataset has been measured for about 8 months.

All networks are undirected. Their properties are given in Table 1. The duration of each time step is either 1 s or 20 s in all the networks. For the MIT dataset, we choose randomly two observation period, each of about one-week time. The temporal networks corresponding to these two periods are called MIT1 and MIT2. In this way, all the six temporal networks (HighSchool11&12, WorkPlace13&15, MIT1&2) are comparable in observation window. They will be used to study the impact of the mitigation strategies on the average prevalence over time, the focus of this work.

Table 1 Basic properties of real-world networks: the number of nodes, links (in the aggregated network) and contacts, respectively

Mitigate SIR epidemic spreading via contact blocking in temporal networks

Abstract

Introduction

Methods

Contact blocking strategies

Link centrality metrics

Contact removal probability

Datasets

Simulation

Results

Performance evaluation

Average prevalence

Properties of the pruned network

Peak height and peak time

Robustness

Conclusions

Availability of data and materials

Notes

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords